TOP ACTIVATIONS
MAX = 0.085

again ? Can we trade shells back ?" L
" Can I borrow your shovel ?" The neighbor thought for
She said , " A br ac ad ab ra !"
egg ! He watched in amaz ement as the egg began
go ! Everyone watched in amaz ement as it took off
wand and say " A br ac ad ab ra !"
. Can we still have tea and cookies after ?"
and said , " A br ac ad ab ra ,
sand . She used her shovel to dig it up and
Sam gasped in amaz ement . He had never
her mom my watched in amaz ement as the machine flew
air . He watched with amaz ement as the z ig
water . He gasped in amaz ement as he saw the
ly . Jill nodded in amaz ement .
ears . Can we have some , please ?" Tom says
another wand . " A br ac ad ab ra !"
" Can we have some cake , Mom my ?"
said together , " A br ac ad ab ra !"
. Now she had a brand new brush and she would
Little Jack gasped in amaz ement . Dad pointed to

MIN ACTIVATIONS
MIN = -0.083

the noise until he stumbled upon a big surprise .
what was around the river â s bend .
exploring , the cat stumbled upon a large shadow . The
felt like hours before stumbling upon a strange house . The
, it 's too dangerous â . Max
forest home until he came upon a cottage .
walking along , he stumbled upon an old tower . Sir
was searching he soon stumbled upon a large bull . The
and the rain was falling heavily . The little bird was
the sound until she stumbled upon a beautiful bird perched on
particularly tall tree and stumbled upon a hard , shiny thing
case sitting in the neighbor â s garden .
his journey , he stumbled upon a tree with a funny
hours when he suddenly came upon a big , tall fence
the house and soon stumbled upon an old chest . Peter
the woods when she stumbled upon a big dark billboard .
and it started to rain heavily . The little girl cried
and low until they came upon the most unexpected place .
it seemed that it wouldn â t be allowed
find and suddenly he stumbled upon some old rocks . In

INTERVAL 0.043 - 0.085
CONTAINS 0.381%

so pretty . Can I paint with you ?"
you can also have some cake and ice cream for dessert
made him happy . <|endoftext|> Once upon a time there was
was so excited . He looked around , but he didn
!" Her dad looked at her . He laughed

INTERVAL 0.001 - 0.043
CONTAINS 46.668%

. As they walked through the zoo ,
mom . She was making a y ummy snack in the
serious , but that they would be okay . Tom felt
are proud of her . They tell her she is a
. They also saw some new or nam ents that Santa

INTERVAL -0.041 - 0.001
CONTAINS 52.327%

Mia saw that the tiger was not mean , just hungry
it used its strong claws to pick up the garbage and
be useful again !" And <|endoftext|> Once upon
He thought it might help him . He wished and wished
. Sally saw her friend Tim . Tim said , "

INTERVAL -0.083 - -0.041
CONTAINS 0.624%

about not chewing things that weren 't food .
it went . <|endoftext|> Once upon a time there was a
robot friend . <|endoftext|> Once upon a time , there was
it safe to go outside when it is raining ?" Lily
playing too much in the dark . Bun Bun promised to
Token again
Token ID757
Feature activation+2.28e-02

Pos logit contributions
ades+0.082
 penn+0.073
uddle+0.071
igs+0.071
 sho+0.070
Neg logit contributions
 You-0.066
|-0.057
You-0.054
 seem-0.052
 Listen-0.049
Token?
Token ID30
Feature activation-8.40e-03

Pos logit contributions
ades+0.037
brush+0.032
aid+0.032
 sho+0.032
uddle+0.031
Neg logit contributions
 You-0.039
|-0.036
 seem-0.034
 Today-0.034
 generation-0.034
Token Can
Token ID1680
Feature activation-2.64e-03

Pos logit contributions
|+0.007
 You+0.006
 seem+0.006
Warning+0.005
 generation+0.005
Neg logit contributions
ades-0.020
 sho-0.018
brush-0.018
uddle-0.018
aid-0.018
Token we
Token ID356
Feature activation+4.26e-02

Pos logit contributions
|+0.003
 You+0.003
 seem+0.003
Warning+0.003
 generation+0.003
Neg logit contributions
ades-0.005
 sho-0.005
uddle-0.005
brush-0.005
aid-0.005
Token trade
Token ID3292
Feature activation+7.16e-02

Pos logit contributions
ades+0.101
 tant+0.093
ins+0.092
usted+0.090
 handful+0.090
Neg logit contributions
 You-0.043
 seem-0.040
|-0.035
 Listen-0.031
 Today-0.031
Token shells
Token ID19679
Feature activation+8.24e-02

Pos logit contributions
ades+0.122
usted+0.116
uddle+0.113
brush+0.113
 sho+0.111
Neg logit contributions
 You-0.109
You-0.085
|-0.084
Warning-0.077
 That-0.076
Token back
Token ID736
Feature activation+2.80e-02

Pos logit contributions
ades+0.146
 sudden+0.137
 hairst+0.134
 sho+0.134
 brand+0.127
Neg logit contributions
 You-0.117
|-0.099
 seem-0.092
 Today-0.090
You-0.087
Token?"
Token ID1701
Feature activation+9.25e-03

Pos logit contributions
ades+0.048
 sho+0.043
aid+0.042
brush+0.042
 brand+0.041
Neg logit contributions
 You-0.043
|-0.041
keeping-0.036
 Today-0.036
 generation-0.035
Token
Token ID198
Feature activation+7.62e-03

Pos logit contributions
ades+0.016
uddle+0.014
aid+0.014
 sho+0.014
brush+0.014
Neg logit contributions
|-0.014
 You-0.014
 seem-0.013
 generation-0.013
Warning-0.013
Token
Token ID198
Feature activation+6.75e-03

Pos logit contributions
ades+0.016
 sho+0.015
brush+0.014
uddle+0.014
 penn+0.014
Neg logit contributions
 You-0.009
|-0.009
 seem-0.008
Warning-0.007
 generation-0.007
TokenL
Token ID43
Feature activation+7.34e-04

Pos logit contributions
ades+0.011
 sho+0.010
uddle+0.010
brush+0.010
 penn+0.010
Neg logit contributions
 You-0.011
|-0.010
Warning-0.009
You-0.009
 seem-0.009
Token "
Token ID366
Feature activation+2.26e-03

Pos logit contributions
ades+0.042
uddle+0.038
brush+0.037
 sho+0.036
aid+0.036
Neg logit contributions
 You-0.036
|-0.034
 seem-0.032
Warning-0.031
 generation-0.030
TokenCan
Token ID6090
Feature activation-6.64e-03

Pos logit contributions
ades+0.004
 sho+0.004
brush+0.004
uddle+0.004
aid+0.004
Neg logit contributions
 You-0.003
|-0.003
 seem-0.003
Warning-0.003
 generation-0.003
Token I
Token ID314
Feature activation+2.19e-02

Pos logit contributions
|+0.008
 You+0.007
Warning+0.007
 seem+0.007
 generation+0.007
Neg logit contributions
ades-0.014
 sho-0.012
brush-0.012
uddle-0.012
aid-0.012
Token borrow
Token ID8804
Feature activation+5.07e-02

Pos logit contributions
ades+0.049
uddle+0.045
ins+0.044
usted+0.044
udd+0.043
Neg logit contributions
 You-0.025
 seem-0.025
|-0.023
 Have-0.019
 Today-0.019
Token your
Token ID534
Feature activation+1.24e-02

Pos logit contributions
ades+0.097
uddle+0.092
udd+0.087
brush+0.086
ade+0.085
Neg logit contributions
 You-0.079
 seem-0.060
|-0.058
 That-0.057
You-0.056
Token shovel
Token ID33677
Feature activation+8.20e-02

Pos logit contributions
ades+0.018
aid+0.016
brush+0.016
uddle+0.015
usted+0.015
Neg logit contributions
 You-0.022
 seem-0.020
|-0.020
 generation-0.019
Warning-0.019
Token?"
Token ID1701
Feature activation+1.33e-02

Pos logit contributions
ades+0.144
 hairst+0.137
aid+0.137
 sudden+0.134
 swapped+0.126
Neg logit contributions
 You-0.118
 Today-0.101
|-0.094
 seem-0.093
 That-0.085
Token The
Token ID383
Feature activation+8.78e-03

Pos logit contributions
ades+0.024
 sho+0.022
uddle+0.021
brush+0.021
aid+0.021
Neg logit contributions
 You-0.019
|-0.019
 seem-0.017
Warning-0.017
 generation-0.016
Token neighbor
Token ID4780
Feature activation+1.77e-02

Pos logit contributions
ades+0.011
uddle+0.009
 sho+0.009
brush+0.009
aid+0.008
Neg logit contributions
 You-0.018
|-0.018
 seem-0.017
Warning-0.017
 generation-0.017
Token thought
Token ID1807
Feature activation+3.66e-02

Pos logit contributions
ades+0.029
uddle+0.024
 sunscreen+0.023
 penn+0.023
brush+0.022
Neg logit contributions
 You-0.031
|-0.029
 seem-0.028
elope-0.027
 Today-0.026
Token for
Token ID329
Feature activation-1.13e-02

Pos logit contributions
ades+0.062
uddle+0.054
 sho+0.054
brush+0.053
aid+0.052
Neg logit contributions
 You-0.058
|-0.057
 seem-0.053
Warning-0.051
 generation-0.050
Token She
Token ID1375
Feature activation+2.09e-02

Pos logit contributions
ades+0.017
uddle+0.015
brush+0.014
 sho+0.014
aid+0.014
Neg logit contributions
|-0.014
 You-0.013
rated-0.013
keeping-0.013
 generation-0.013
Token said
Token ID531
Feature activation+2.43e-02

Pos logit contributions
ades+0.041
uddle+0.036
 sunscreen+0.035
 brand+0.034
ade+0.034
Neg logit contributions
 You-0.027
|-0.026
 seem-0.025
Warning-0.024
You-0.024
Token,
Token ID11
Feature activation-3.68e-04

Pos logit contributions
ades+0.039
uddle+0.037
 brand+0.033
udd+0.033
ade+0.032
Neg logit contributions
 You-0.043
 seem-0.039
 Listen-0.035
 That-0.034
 Come-0.034
Token "
Token ID366
Feature activation-9.11e-03

Pos logit contributions
 You+0.001
|+0.001
 seem+0.001
Warning+0.001
 generation+0.001
Neg logit contributions
ades-0.001
 sho-0.001
uddle-0.001
brush-0.001
aid-0.001
TokenA
Token ID32
Feature activation+2.11e-02

Pos logit contributions
|+0.015
 You+0.014
keeping+0.014
rated+0.014
 seem+0.013
Neg logit contributions
ades-0.014
aid-0.013
brush-0.013
 sho-0.012
 scissors-0.012
Tokenbr
Token ID1671
Feature activation+8.17e-02

Pos logit contributions
ades+0.030
uddle+0.026
 sho+0.026
brush+0.025
aid+0.025
Neg logit contributions
 You-0.039
|-0.038
 seem-0.036
Warning-0.035
 generation-0.035
Tokenac
Token ID330
Feature activation+1.95e-02

Pos logit contributions
ades+0.118
 brand+0.118
 sunscreen+0.117
 sho+0.108
 helmet+0.104
Neg logit contributions
 You-0.133
|-0.129
 Today-0.125
 seem-0.123
Warning-0.118
Tokenad
Token ID324
Feature activation+4.24e-03

Pos logit contributions
ades+0.036
uddle+0.032
 sho+0.032
brush+0.031
aid+0.031
Neg logit contributions
 You-0.028
|-0.028
 seem-0.025
Warning-0.025
 generation-0.024
Tokenab
Token ID397
Feature activation+3.03e-02

Pos logit contributions
ades+0.006
 sho+0.005
uddle+0.005
brush+0.005
aid+0.005
Neg logit contributions
 You-0.008
|-0.008
 seem-0.007
Warning-0.007
 generation-0.007
Tokenra
Token ID430
Feature activation+2.86e-02

Pos logit contributions
ades+0.047
uddle+0.040
 sho+0.040
brush+0.040
aid+0.040
Neg logit contributions
 You-0.052
|-0.051
 seem-0.047
Warning-0.046
 generation-0.045
Token!"
Token ID2474
Feature activation+4.15e-03

Pos logit contributions
ades+0.039
uddle+0.034
 sho+0.033
aid+0.032
 brand+0.032
Neg logit contributions
 You-0.055
|-0.052
 seem-0.050
Warning-0.048
 generation-0.047
Token egg
Token ID5935
Feature activation+1.37e-02

Pos logit contributions
ades+0.010
 sho+0.009
uddle+0.009
brush+0.008
aid+0.008
Neg logit contributions
 You-0.013
|-0.012
 seem-0.012
Warning-0.011
 generation-0.011
Token!
Token ID0
Feature activation-1.60e-02

Pos logit contributions
ades+0.020
brush+0.017
 sho+0.017
uddle+0.017
aid+0.017
Neg logit contributions
 You-0.024
|-0.024
 seem-0.022
Warning-0.022
 generation-0.022
Token He
Token ID679
Feature activation+1.09e-05

Pos logit contributions
|+0.019
 seem+0.018
rated+0.017
 You+0.017
Warning+0.017
Neg logit contributions
ades-0.030
brush-0.028
 sho-0.027
aid-0.027
uddle-0.027
Token watched
Token ID7342
Feature activation+1.29e-02

Pos logit contributions
ades+0.000
uddle+0.000
 sho+0.000
brush+0.000
aid+0.000
Neg logit contributions
 You-0.000
|-0.000
 seem-0.000
 generation-0.000
Warning-0.000
Token in
Token ID287
Feature activation-8.00e-03

Pos logit contributions
ades+0.020
uddle+0.018
aid+0.017
 sho+0.017
brush+0.017
Neg logit contributions
 You-0.023
|-0.021
 seem-0.021
 generation-0.020
Warning-0.019
Token amaz
Token ID40642
Feature activation+8.13e-02

Pos logit contributions
 You+0.014
|+0.013
 seem+0.013
Warning+0.012
 generation+0.012
Neg logit contributions
ades-0.012
uddle-0.011
 sho-0.011
brush-0.011
aid-0.010
Tokenement
Token ID972
Feature activation-6.78e-03

Pos logit contributions
 sho+0.116
 sunscreen+0.113
 dads+0.112
aid+0.109
uddle+0.107
Neg logit contributions
 You-0.139
 seem-0.118
 generation-0.112
 Today-0.111
Warning-0.108
Token as
Token ID355
Feature activation-1.60e-02

Pos logit contributions
|+0.013
 You+0.013
 seem+0.012
Warning+0.012
 generation+0.012
Neg logit contributions
ades-0.009
 sho-0.008
uddle-0.008
brush-0.008
aid-0.007
Token the
Token ID262
Feature activation-2.66e-03

Pos logit contributions
 You+0.028
|+0.027
 seem+0.025
 generation+0.024
Warning+0.024
Neg logit contributions
ades-0.025
uddle-0.022
 sho-0.021
brush-0.021
aid-0.021
Token egg
Token ID5935
Feature activation-5.82e-03

Pos logit contributions
 You+0.005
|+0.005
 seem+0.005
Warning+0.005
 generation+0.005
Neg logit contributions
ades-0.003
 sho-0.003
uddle-0.003
brush-0.003
aid-0.003
Token began
Token ID2540
Feature activation-1.56e-02

Pos logit contributions
 You+0.010
|+0.010
 seem+0.009
Warning+0.009
 generation+0.009
Neg logit contributions
ades-0.009
uddle-0.008
 sho-0.008
brush-0.007
aid-0.007
Token go
Token ID467
Feature activation+5.46e-03

Pos logit contributions
ades+0.009
uddle+0.008
aid+0.007
brush+0.007
 sho+0.007
Neg logit contributions
 You-0.007
|-0.007
 seem-0.007
 generation-0.006
keeping-0.006
Token!
Token ID0
Feature activation-1.02e-02

Pos logit contributions
ades+0.007
uddle+0.006
 sho+0.006
brush+0.006
aid+0.006
Neg logit contributions
 You-0.010
|-0.010
Warning-0.010
 seem-0.010
 generation-0.009
Token Everyone
Token ID11075
Feature activation+1.39e-03

Pos logit contributions
|+0.013
 seem+0.012
rated+0.012
 You+0.012
Warning+0.012
Neg logit contributions
ades-0.018
brush-0.017
 sho-0.016
uddle-0.016
aid-0.016
Token watched
Token ID7342
Feature activation+7.97e-03

Pos logit contributions
ades+0.002
 sho+0.002
uddle+0.002
brush+0.002
aid+0.002
Neg logit contributions
 You-0.002
|-0.002
 seem-0.002
Warning-0.002
 generation-0.002
Token in
Token ID287
Feature activation-8.90e-03

Pos logit contributions
ades+0.011
 sho+0.009
uddle+0.009
brush+0.009
aid+0.009
Neg logit contributions
 You-0.015
|-0.015
 seem-0.014
Warning-0.014
 generation-0.013
Token amaz
Token ID40642
Feature activation+8.08e-02

Pos logit contributions
|+0.016
 You+0.016
 seem+0.015
Warning+0.015
 generation+0.015
Neg logit contributions
ades-0.012
 sho-0.011
brush-0.011
uddle-0.010
 scissors-0.010
Tokenement
Token ID972
Feature activation-5.48e-03

Pos logit contributions
 sho+0.108
aid+0.103
 sunscreen+0.100
 dads+0.096
 handful+0.096
Neg logit contributions
 You-0.145
 seem-0.127
|-0.124
 generation-0.121
 Today-0.120
Token as
Token ID355
Feature activation-1.56e-02

Pos logit contributions
|+0.010
 You+0.010
Warning+0.009
 seem+0.009
 generation+0.009
Neg logit contributions
ades-0.008
uddle-0.006
brush-0.006
 sho-0.006
 penn-0.006
Token it
Token ID340
Feature activation-1.87e-02

Pos logit contributions
 You+0.028
|+0.028
 seem+0.026
Warning+0.026
 generation+0.025
Neg logit contributions
ades-0.022
 sho-0.019
brush-0.019
uddle-0.019
aid-0.018
Token took
Token ID1718
Feature activation+7.81e-03

Pos logit contributions
 You+0.031
|+0.030
 seem+0.028
Warning+0.027
 generation+0.027
Neg logit contributions
ades-0.030
 sho-0.026
uddle-0.026
brush-0.026
aid-0.025
Token off
Token ID572
Feature activation-1.92e-03

Pos logit contributions
ades+0.012
uddle+0.011
 sho+0.010
brush+0.010
aid+0.010
Neg logit contributions
 You-0.014
|-0.013
 seem-0.012
 generation-0.012
Warning-0.012
Token wand
Token ID11569
Feature activation+2.99e-02

Pos logit contributions
ades+0.012
aid+0.010
ade+0.010
brush+0.010
uddle+0.010
Neg logit contributions
 You-0.014
|-0.013
 seem-0.012
 generation-0.012
keeping-0.012
Token and
Token ID290
Feature activation-2.18e-03

Pos logit contributions
ades+0.048
uddle+0.042
aid+0.041
 penn+0.039
 sho+0.038
Neg logit contributions
 You-0.049
 seem-0.046
|-0.046
 generation-0.044
 Today-0.042
Token say
Token ID910
Feature activation+2.46e-02

Pos logit contributions
 You+0.003
|+0.003
 seem+0.003
 generation+0.003
Warning+0.003
Neg logit contributions
ades-0.004
uddle-0.003
aid-0.003
 sho-0.003
brush-0.003
Token "
Token ID366
Feature activation-2.11e-02

Pos logit contributions
ades+0.037
uddle+0.036
aid+0.033
ade+0.032
brush+0.032
Neg logit contributions
 You-0.044
 seem-0.041
|-0.036
 Welcome-0.036
 Today-0.035
TokenA
Token ID32
Feature activation+2.48e-02

Pos logit contributions
 You+0.032
|+0.031
 seem+0.030
ses+0.029
keeping+0.028
Neg logit contributions
ades-0.035
aid-0.032
brush-0.031
 sho-0.030
is-0.030
Tokenbr
Token ID1671
Feature activation+8.02e-02

Pos logit contributions
ades+0.030
 sho+0.025
uddle+0.025
brush+0.025
aid+0.024
Neg logit contributions
 You-0.050
|-0.050
 seem-0.047
 generation-0.046
Warning-0.046
Tokenac
Token ID330
Feature activation+2.56e-02

Pos logit contributions
 sunscreen+0.121
 brand+0.120
 sho+0.110
 cream+0.109
uddle+0.106
Neg logit contributions
|-0.122
 You-0.117
 seem-0.115
 Today-0.113
 generation-0.104
Tokenad
Token ID324
Feature activation-2.21e-03

Pos logit contributions
ades+0.049
uddle+0.046
 sho+0.045
 brand+0.044
brush+0.044
Neg logit contributions
 You-0.033
|-0.033
 seem-0.029
Warning-0.029
 generation-0.028
Tokenab
Token ID397
Feature activation+2.64e-02

Pos logit contributions
 You+0.004
|+0.004
 seem+0.004
Warning+0.004
 generation+0.004
Neg logit contributions
ades-0.003
 sho-0.002
brush-0.002
uddle-0.002
aid-0.002
Tokenra
Token ID430
Feature activation+3.16e-02

Pos logit contributions
ades+0.040
uddle+0.035
 sho+0.035
aid+0.034
brush+0.034
Neg logit contributions
 You-0.045
|-0.044
 seem-0.041
 generation-0.040
Warning-0.040
Token!"
Token ID2474
Feature activation-2.35e-03

Pos logit contributions
ades+0.053
uddle+0.048
 sho+0.047
aid+0.047
 penn+0.045
Neg logit contributions
 You-0.050
|-0.047
 seem-0.045
Warning-0.042
 generation-0.042
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
ades+0.029
 sho+0.025
uddle+0.025
brush+0.025
aid+0.024
Neg logit contributions
 You-0.039
|-0.038
 seem-0.036
Warning-0.035
 generation-0.035
Token Can
Token ID1680
Feature activation-5.39e-03

Pos logit contributions
|+0.019
rated+0.018
 seem+0.018
Warning+0.017
keeping+0.017
Neg logit contributions
brush-0.044
 sho-0.042
ades-0.042
aid-0.042
 moms-0.041
Token we
Token ID356
Feature activation+3.80e-02

Pos logit contributions
|+0.008
keeping+0.007
 You+0.007
rated+0.007
Warning+0.007
Neg logit contributions
ades-0.009
brush-0.009
 sho-0.008
uddle-0.008
aid-0.008
Token still
Token ID991
Feature activation+8.04e-03

Pos logit contributions
ades+0.086
ins+0.081
 tant+0.076
uddle+0.075
atted+0.075
Neg logit contributions
 You-0.042
 seem-0.041
|-0.036
 Today-0.034
 have-0.031
Token have
Token ID423
Feature activation+4.50e-02

Pos logit contributions
ades+0.016
uddle+0.014
 sho+0.014
 penn+0.014
ade+0.014
Neg logit contributions
 You-0.010
|-0.010
 seem-0.010
 generation-0.009
 Today-0.008
Token tea
Token ID8887
Feature activation+7.98e-02

Pos logit contributions
ades+0.102
uddle+0.101
as+0.096
bles+0.093
uing+0.091
Neg logit contributions
 You-0.039
 seem-0.035
 been-0.035
|-0.033
 generation-0.032
Token and
Token ID290
Feature activation+1.86e-02

Pos logit contributions
ades+0.135
 sudden+0.131
isel+0.128
atted+0.127
 penn+0.126
Neg logit contributions
 You-0.107
|-0.100
 Today-0.091
 seem-0.087
You-0.082
Token cookies
Token ID14746
Feature activation+6.35e-02

Pos logit contributions
ades+0.030
uddle+0.025
ade+0.023
in+0.023
 sho+0.023
Neg logit contributions
 You-0.032
|-0.031
 seem-0.030
 generation-0.028
keeping-0.027
Token after
Token ID706
Feature activation-1.21e-02

Pos logit contributions
ades+0.118
 sho+0.103
uddle+0.101
 penn+0.100
 handful+0.098
Neg logit contributions
 You-0.094
|-0.083
 Today-0.077
 seem-0.076
You-0.074
Token?"
Token ID1701
Feature activation+1.45e-02

Pos logit contributions
 You+0.022
|+0.022
 seem+0.020
Warning+0.020
 generation+0.020
Neg logit contributions
ades-0.017
 sho-0.015
uddle-0.015
brush-0.015
aid-0.014
Token
Token ID198
Feature activation+1.71e-02

Pos logit contributions
ades+0.026
 sho+0.023
uddle+0.023
brush+0.023
aid+0.022
Neg logit contributions
|-0.021
 You-0.021
 seem-0.019
Warning-0.019
 generation-0.019
Token and
Token ID290
Feature activation-4.75e-03

Pos logit contributions
 You+0.032
|+0.031
 seem+0.029
 generation+0.028
Warning+0.028
Neg logit contributions
ades-0.031
uddle-0.027
 sho-0.026
aid-0.026
brush-0.026
Token said
Token ID531
Feature activation+2.36e-02

Pos logit contributions
 You+0.008
|+0.008
 seem+0.008
Warning+0.008
 generation+0.008
Neg logit contributions
ades-0.007
uddle-0.006
brush-0.006
 sho-0.006
aid-0.006
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
ades+0.033
uddle+0.029
aid+0.028
 brand+0.028
brush+0.028
Neg logit contributions
 You-0.045
 seem-0.041
 Welcome-0.037
 Today-0.037
 Listen-0.037
Token "
Token ID366
Feature activation-1.34e-02

Pos logit contributions
|+0.023
 You+0.022
 seem+0.021
Warning+0.020
 generation+0.020
Neg logit contributions
ades-0.023
 sho-0.020
brush-0.020
uddle-0.020
aid-0.020
TokenA
Token ID32
Feature activation+1.21e-02

Pos logit contributions
|+0.019
 You+0.019
 seem+0.018
keeping+0.018
rated+0.018
Neg logit contributions
ades-0.022
aid-0.022
brush-0.021
 sho-0.020
is-0.020
Tokenbr
Token ID1671
Feature activation+7.94e-02

Pos logit contributions
ades+0.016
uddle+0.013
 sho+0.013
brush+0.013
aid+0.012
Neg logit contributions
 You-0.024
|-0.024
 seem-0.022
 generation-0.022
Warning-0.021
Tokenac
Token ID330
Feature activation+1.89e-02

Pos logit contributions
 brand+0.120
ades+0.119
 sunscreen+0.118
 sho+0.110
 helmet+0.105
Neg logit contributions
|-0.124
 You-0.123
 Today-0.117
ses-0.103
Warning-0.102
Tokenad
Token ID324
Feature activation-4.47e-03

Pos logit contributions
ades+0.035
uddle+0.031
 sho+0.031
brush+0.031
 brand+0.030
Neg logit contributions
 You-0.027
|-0.026
 seem-0.024
Warning-0.023
 generation-0.023
Tokenab
Token ID397
Feature activation+2.21e-02

Pos logit contributions
 You+0.009
|+0.009
 seem+0.008
Warning+0.008
 generation+0.008
Neg logit contributions
ades-0.006
brush-0.005
 sho-0.005
uddle-0.005
aid-0.005
Tokenra
Token ID430
Feature activation+2.56e-02

Pos logit contributions
ades+0.034
uddle+0.029
 sho+0.029
brush+0.029
aid+0.028
Neg logit contributions
 You-0.038
|-0.037
 seem-0.035
Warning-0.034
 generation-0.034
Token,
Token ID11
Feature activation+5.03e-03

Pos logit contributions
ades+0.034
uddle+0.030
 sho+0.029
aid+0.028
 brand+0.028
Neg logit contributions
 You-0.050
|-0.047
 seem-0.045
Warning-0.043
 generation-0.043
Token sand
Token ID6450
Feature activation-1.30e-03

Pos logit contributions
 You+0.032
|+0.032
 seem+0.029
 generation+0.028
Warning+0.028
Neg logit contributions
ades-0.039
uddle-0.034
brush-0.034
 sho-0.034
aid-0.034
Token.
Token ID13
Feature activation-3.11e-03

Pos logit contributions
 You+0.002
|+0.002
 seem+0.002
Warning+0.002
 generation+0.002
Neg logit contributions
ades-0.002
 sho-0.002
uddle-0.002
brush-0.002
aid-0.002
Token She
Token ID1375
Feature activation+2.46e-02

Pos logit contributions
|+0.003
 You+0.003
 generation+0.003
 seem+0.003
Warning+0.003
Neg logit contributions
ades-0.007
 sho-0.006
uddle-0.006
brush-0.006
aid-0.006
Token used
Token ID973
Feature activation+2.68e-02

Pos logit contributions
ades+0.049
brush+0.043
uddle+0.043
aid+0.041
 sunscreen+0.041
Neg logit contributions
|-0.031
 You-0.031
 seem-0.028
Warning-0.027
 generation-0.027
Token her
Token ID607
Feature activation+2.75e-02

Pos logit contributions
ades+0.058
uddle+0.057
udd+0.052
ins+0.049
in+0.049
Neg logit contributions
 seem-0.032
 You-0.029
 generation-0.026
Warning-0.026
keeping-0.025
Token shovel
Token ID33677
Feature activation+7.90e-02

Pos logit contributions
ades+0.044
ins+0.038
udd+0.036
is+0.036
aid+0.036
Neg logit contributions
 You-0.047
 seem-0.046
|-0.043
ses-0.043
 generation-0.043
Token to
Token ID284
Feature activation+3.16e-03

Pos logit contributions
ades+0.156
 cowboy+0.147
aid+0.142
uddle+0.138
 hairst+0.138
Neg logit contributions
 You-0.088
 seem-0.083
 She-0.077
keeping-0.077
 Today-0.071
Token dig
Token ID3100
Feature activation+2.41e-02

Pos logit contributions
ades+0.007
uddle+0.006
ins+0.006
udd+0.006
ip+0.005
Neg logit contributions
 seem-0.004
|-0.004
 generation-0.004
 You-0.004
Warning-0.004
Token it
Token ID340
Feature activation-1.97e-02

Pos logit contributions
ades+0.036
uddle+0.031
aid+0.030
 sho+0.030
brush+0.030
Neg logit contributions
 You-0.042
|-0.042
 seem-0.039
 generation-0.038
Warning-0.038
Token up
Token ID510
Feature activation-1.11e-02

Pos logit contributions
 You+0.036
|+0.035
 seem+0.033
 generation+0.032
Warning+0.032
Neg logit contributions
ades-0.029
aid-0.024
uddle-0.024
brush-0.024
 sho-0.024
Token and
Token ID290
Feature activation-6.49e-03

Pos logit contributions
 You+0.021
|+0.021
 seem+0.019
Warning+0.019
 generation+0.019
Neg logit contributions
ades-0.015
 sho-0.013
uddle-0.013
brush-0.013
aid-0.013
Token
Token ID198
Feature activation+3.71e-03

Pos logit contributions
|+0.030
 generation+0.030
Warning+0.029
 You+0.029
rated+0.028
Neg logit contributions
ades-0.028
uddle-0.025
brush-0.024
 sho-0.023
ip-0.022
Token
Token ID198
Feature activation+2.01e-03

Pos logit contributions
ades+0.007
uddle+0.006
brush+0.006
 sho+0.006
aid+0.006
Neg logit contributions
 You-0.005
|-0.005
 seem-0.005
 generation-0.005
Warning-0.005
TokenSam
Token ID16305
Feature activation+4.94e-03

Pos logit contributions
ades+0.003
 sho+0.002
uddle+0.002
brush+0.002
aid+0.002
Neg logit contributions
 You-0.004
|-0.004
 seem-0.004
Warning-0.004
 generation-0.004
Token gasped
Token ID45236
Feature activation-1.61e-03

Pos logit contributions
ades+0.008
uddle+0.007
brush+0.007
 sho+0.006
aid+0.006
Neg logit contributions
 You-0.009
|-0.008
 seem-0.008
Warning-0.008
 generation-0.007
Token in
Token ID287
Feature activation+9.89e-03

Pos logit contributions
|+0.003
 You+0.003
 seem+0.003
Warning+0.003
 generation+0.003
Neg logit contributions
ades-0.002
 sho-0.002
uddle-0.002
brush-0.002
 penn-0.002
Token amaz
Token ID40642
Feature activation+7.86e-02

Pos logit contributions
ades+0.015
uddle+0.013
brush+0.012
aid+0.012
 sho+0.012
Neg logit contributions
 You-0.018
|-0.018
 seem-0.017
Warning-0.016
 generation-0.016
Tokenement
Token ID972
Feature activation+2.65e-03

Pos logit contributions
 sho+0.114
 sunscreen+0.112
 dads+0.111
aid+0.108
 handful+0.107
Neg logit contributions
 You-0.126
 seem-0.100
keeping-0.097
 MacDonald-0.096
Warning-0.095
Token.
Token ID13
Feature activation-6.94e-03

Pos logit contributions
ades+0.004
uddle+0.004
 sho+0.004
brush+0.004
 penn+0.004
Neg logit contributions
|-0.004
 You-0.004
Warning-0.004
 seem-0.004
 generation-0.004
Token He
Token ID679
Feature activation+9.67e-03

Pos logit contributions
|+0.009
 You+0.008
 seem+0.008
 generation+0.008
Warning+0.008
Neg logit contributions
ades-0.013
 sho-0.012
uddle-0.012
brush-0.012
 brand-0.012
Token had
Token ID550
Feature activation-6.90e-04

Pos logit contributions
ades+0.017
aid+0.016
brush+0.015
 penn+0.015
uddle+0.015
Neg logit contributions
 You-0.014
|-0.014
 seem-0.013
Warning-0.012
 generation-0.012
Token never
Token ID1239
Feature activation-1.67e-02

Pos logit contributions
 You+0.001
|+0.001
 seem+0.001
 generation+0.001
Warning+0.001
Neg logit contributions
ades-0.001
uddle-0.001
aid-0.001
 penn-0.001
ip-0.001
Token her
Token ID607
Feature activation-1.32e-03

Pos logit contributions
ades+0.008
uddle+0.007
brush+0.006
 sho+0.006
 sunscreen+0.006
Neg logit contributions
 You-0.008
 seem-0.007
|-0.007
 generation-0.007
rated-0.007
Token mom
Token ID1995
Feature activation-4.48e-04

Pos logit contributions
 You+0.003
|+0.003
 seem+0.002
 generation+0.002
Warning+0.002
Neg logit contributions
ades-0.002
uddle-0.001
brush-0.001
 sho-0.001
aid-0.001
Tokenmy
Token ID1820
Feature activation-8.12e-03

Pos logit contributions
 You+0.001
|+0.001
 seem+0.001
Warning+0.001
 generation+0.001
Neg logit contributions
ades-0.001
 sho-0.001
uddle-0.001
brush-0.001
aid-0.001
Token watched
Token ID7342
Feature activation+5.14e-04

Pos logit contributions
|+0.015
 You+0.014
 seem+0.013
Warning+0.013
 generation+0.013
Neg logit contributions
ades-0.012
 sho-0.010
brush-0.010
uddle-0.010
aid-0.010
Token in
Token ID287
Feature activation-7.50e-03

Pos logit contributions
ades+0.001
uddle+0.001
 sho+0.001
brush+0.001
aid+0.001
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
 generation-0.001
Warning-0.001
Token amaz
Token ID40642
Feature activation+7.85e-02

Pos logit contributions
 You+0.014
|+0.014
 seem+0.013
Warning+0.012
 generation+0.012
Neg logit contributions
ades-0.011
 sho-0.009
uddle-0.009
brush-0.009
aid-0.009
Tokenement
Token ID972
Feature activation-6.09e-03

Pos logit contributions
 sho+0.113
 dads+0.107
aid+0.104
 sunscreen+0.103
 handful+0.100
Neg logit contributions
 You-0.132
 seem-0.112
|-0.109
 Today-0.108
 generation-0.108
Token as
Token ID355
Feature activation-2.07e-02

Pos logit contributions
|+0.011
 You+0.010
Warning+0.010
 seem+0.009
 generation+0.009
Neg logit contributions
ades-0.009
uddle-0.008
brush-0.008
 sho-0.008
 penn-0.008
Token the
Token ID262
Feature activation+2.36e-03

Pos logit contributions
 You+0.033
|+0.032
 seem+0.030
 generation+0.029
Warning+0.028
Neg logit contributions
ades-0.035
uddle-0.031
 sho-0.030
brush-0.030
aid-0.030
Token machine
Token ID4572
Feature activation-7.25e-03

Pos logit contributions
ades+0.003
 sho+0.003
uddle+0.003
brush+0.003
aid+0.003
Neg logit contributions
 You-0.005
|-0.004
 seem-0.004
Warning-0.004
 generation-0.004
Token flew
Token ID13112
Feature activation+6.28e-04

Pos logit contributions
 You+0.013
|+0.012
 seem+0.012
Warning+0.011
 generation+0.011
Neg logit contributions
ades-0.011
 sho-0.009
uddle-0.009
brush-0.009
aid-0.009
Token air
Token ID1633
Feature activation-1.84e-02

Pos logit contributions
ades+0.009
uddle+0.009
brush+0.008
 sho+0.008
ade+0.008
Neg logit contributions
 You-0.007
|-0.007
 seem-0.006
Warning-0.006
 generation-0.006
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
 You+0.031
|+0.030
 seem+0.028
Warning+0.027
 generation+0.027
Neg logit contributions
ades-0.029
 sho-0.025
uddle-0.025
brush-0.025
aid-0.024
Token He
Token ID679
Feature activation+6.54e-03

Pos logit contributions
|+0.020
 seem+0.019
rated+0.019
Warning+0.018
keeping+0.018
Neg logit contributions
ades-0.033
brush-0.030
 sho-0.029
uddle-0.029
 sunscreen-0.028
Token watched
Token ID7342
Feature activation+9.62e-03

Pos logit contributions
ades+0.011
uddle+0.009
aid+0.009
brush+0.009
 sho+0.009
Neg logit contributions
 You-0.010
|-0.010
 seem-0.010
 generation-0.009
Warning-0.009
Token with
Token ID351
Feature activation+1.20e-02

Pos logit contributions
ades+0.015
uddle+0.014
aid+0.013
 sho+0.013
brush+0.013
Neg logit contributions
 You-0.017
|-0.015
 seem-0.015
 generation-0.014
 Today-0.014
Token amaz
Token ID40642
Feature activation+7.81e-02

Pos logit contributions
ades+0.021
uddle+0.019
ade+0.018
brush+0.018
ip+0.017
Neg logit contributions
 You-0.019
|-0.018
 seem-0.017
 generation-0.017
keeping-0.016
Tokenement
Token ID972
Feature activation-4.06e-03

Pos logit contributions
 sho+0.113
 sunscreen+0.109
 dads+0.107
 handful+0.101
aid+0.101
Neg logit contributions
 You-0.132
 seem-0.115
 generation-0.107
Warning-0.105
|-0.103
Token as
Token ID355
Feature activation-1.74e-02

Pos logit contributions
|+0.007
 You+0.007
 seem+0.007
Warning+0.007
 generation+0.007
Neg logit contributions
ades-0.006
uddle-0.005
 sho-0.005
brush-0.005
aid-0.005
Token the
Token ID262
Feature activation+3.51e-03

Pos logit contributions
 You+0.029
|+0.028
 seem+0.027
 generation+0.026
Warning+0.025
Neg logit contributions
ades-0.028
uddle-0.025
 sho-0.024
brush-0.024
aid-0.024
Token z
Token ID1976
Feature activation+1.95e-02

Pos logit contributions
ades+0.005
 sho+0.004
uddle+0.004
brush+0.004
aid+0.004
Neg logit contributions
 You-0.007
|-0.007
 seem-0.006
Warning-0.006
 generation-0.006
Tokenig
Token ID328
Feature activation-5.97e-03

Pos logit contributions
ades+0.019
 sho+0.016
 brand+0.015
aid+0.014
brush+0.014
Neg logit contributions
 You-0.042
|-0.041
 seem-0.039
 generation-0.038
Warning-0.038
Token water
Token ID1660
Feature activation-9.48e-03

Pos logit contributions
 You+0.006
|+0.006
 seem+0.006
 generation+0.005
 Today+0.005
Neg logit contributions
ades-0.005
aid-0.005
 sho-0.005
brush-0.005
uddle-0.004
Token.
Token ID13
Feature activation-5.21e-03

Pos logit contributions
|+0.015
 You+0.014
Warning+0.013
 seem+0.013
 generation+0.013
Neg logit contributions
ades-0.016
brush-0.014
uddle-0.014
 sho-0.014
ade-0.014
Token He
Token ID679
Feature activation+1.27e-02

Pos logit contributions
|+0.007
rated+0.007
Warning+0.007
 You+0.006
keeping+0.006
Neg logit contributions
ades-0.009
 brand-0.008
brush-0.008
uddle-0.008
 sho-0.008
Token gasped
Token ID45236
Feature activation+4.07e-04

Pos logit contributions
ades+0.023
aid+0.020
brush+0.020
 penn+0.019
uddle+0.019
Neg logit contributions
|-0.018
 seem-0.017
 You-0.017
elope-0.017
 generation-0.016
Token in
Token ID287
Feature activation+6.35e-03

Pos logit contributions
ades+0.001
uddle+0.000
 sho+0.000
brush+0.000
aid+0.000
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
Warning-0.001
 generation-0.001
Token amaz
Token ID40642
Feature activation+7.79e-02

Pos logit contributions
ades+0.010
uddle+0.009
brush+0.008
 sho+0.008
aid+0.008
Neg logit contributions
 You-0.011
|-0.011
 seem-0.010
Warning-0.010
 generation-0.010
Tokenement
Token ID972
Feature activation+9.64e-03

Pos logit contributions
 dads+0.116
 sho+0.115
 sunscreen+0.113
aid+0.112
uddle+0.108
Neg logit contributions
 You-0.120
 seem-0.102
Warning-0.099
 generation-0.094
You-0.094
Token as
Token ID355
Feature activation-1.63e-02

Pos logit contributions
ades+0.014
 sho+0.012
brush+0.012
uddle+0.012
aid+0.012
Neg logit contributions
 You-0.017
|-0.016
 seem-0.016
 generation-0.015
Warning-0.015
Token he
Token ID339
Feature activation+1.88e-02

Pos logit contributions
|+0.029
 You+0.029
 seem+0.026
Warning+0.026
 generation+0.026
Neg logit contributions
ades-0.024
 sho-0.021
brush-0.021
uddle-0.020
aid-0.020
Token saw
Token ID2497
Feature activation+1.63e-02

Pos logit contributions
ades+0.033
aid+0.029
brush+0.029
 penn+0.029
uddle+0.028
Neg logit contributions
 You-0.026
|-0.026
 seem-0.025
Warning-0.024
elope-0.023
Token the
Token ID262
Feature activation-5.07e-03

Pos logit contributions
ades+0.030
uddle+0.029
brush+0.026
aj+0.026
aid+0.025
Neg logit contributions
 You-0.023
 seem-0.021
 generation-0.020
Warning-0.019
keeping-0.019
Tokenly
Token ID306
Feature activation+3.54e-03

Pos logit contributions
ades+0.044
 sho+0.039
uddle+0.038
brush+0.037
 sunscreen+0.037
Neg logit contributions
 You-0.047
 seem-0.045
|-0.044
rated-0.041
Warning-0.040
Token.
Token ID13
Feature activation+9.19e-03

Pos logit contributions
ades+0.006
uddle+0.006
 sho+0.005
aid+0.005
brush+0.005
Neg logit contributions
 You-0.005
|-0.005
 seem-0.005
 generation-0.004
Warning-0.004
Token Jill
Token ID22772
Feature activation+3.31e-02

Pos logit contributions
ades+0.017
uddle+0.015
brush+0.015
 sho+0.015
ade+0.014
Neg logit contributions
|-0.013
 You-0.012
 seem-0.011
Warning-0.011
 generation-0.011
Token nodded
Token ID14464
Feature activation+4.07e-04

Pos logit contributions
ades+0.059
uddle+0.052
aid+0.051
 sunscreen+0.050
brush+0.050
Neg logit contributions
 You-0.047
|-0.047
 seem-0.046
 generation-0.041
Warning-0.040
Token in
Token ID287
Feature activation-6.09e-03

Pos logit contributions
ades+0.001
uddle+0.001
 sho+0.001
brush+0.001
aid+0.001
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
 generation-0.001
Warning-0.001
Token amaz
Token ID40642
Feature activation+7.78e-02

Pos logit contributions
|+0.011
 You+0.011
 seem+0.010
 generation+0.010
Warning+0.010
Neg logit contributions
ades-0.009
 sho-0.008
brush-0.008
uddle-0.008
aid-0.007
Tokenement
Token ID972
Feature activation+1.77e-03

Pos logit contributions
 sho+0.114
 sunscreen+0.110
 dads+0.109
uddle+0.106
 handful+0.105
Neg logit contributions
 You-0.125
 seem-0.108
Warning-0.103
 generation-0.103
keeping-0.101
Token.
Token ID13
Feature activation+8.68e-04

Pos logit contributions
ades+0.003
 sho+0.002
uddle+0.002
brush+0.002
aid+0.002
Neg logit contributions
|-0.003
 You-0.003
 seem-0.003
Warning-0.003
 generation-0.003
Token
Token ID220
Feature activation-5.02e-03

Pos logit contributions
ades+0.002
uddle+0.002
brush+0.001
 sho+0.001
 brand+0.001
Neg logit contributions
|-0.001
 seem-0.001
 You-0.001
rated-0.001
Warning-0.001
Token
Token ID198
Feature activation+5.34e-03

Pos logit contributions
|+0.007
 You+0.007
 seem+0.007
 generation+0.007
Warning+0.007
Neg logit contributions
ades-0.009
uddle-0.008
brush-0.008
 sho-0.008
aid-0.008
Token
Token ID198
Feature activation+4.46e-03

Pos logit contributions
ades+0.010
 sho+0.009
uddle+0.009
brush+0.009
aid+0.009
Neg logit contributions
 You-0.007
|-0.007
 seem-0.006
Warning-0.006
 generation-0.006
Tokenears
Token ID4127
Feature activation+2.90e-02

Pos logit contributions
ades+0.020
 sho+0.020
brush+0.019
aid+0.019
 scissors+0.018
Neg logit contributions
|-0.053
 You-0.052
 Today-0.048
keeping-0.047
 seem-0.047
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
ades+0.047
 sho+0.042
uddle+0.041
brush+0.041
aid+0.041
Neg logit contributions
 You-0.047
|-0.046
 seem-0.042
Warning-0.042
 generation-0.042
Token Can
Token ID1680
Feature activation-8.96e-03

Pos logit contributions
|+0.019
rated+0.019
keeping+0.017
 seem+0.017
Warning+0.016
Neg logit contributions
brush-0.034
ades-0.034
aid-0.032
 sho-0.032
 moms-0.031
Token we
Token ID356
Feature activation+3.03e-02

Pos logit contributions
|+0.013
rated+0.013
keeping+0.012
Warning+0.012
 generation+0.012
Neg logit contributions
ades-0.015
brush-0.014
 sho-0.013
 moms-0.013
 penn-0.013
Token have
Token ID423
Feature activation+4.08e-02

Pos logit contributions
ades+0.079
uddle+0.068
 sho+0.068
aj+0.067
 brand+0.067
Neg logit contributions
 seem-0.027
|-0.026
 You-0.026
 Today-0.019
 Have-0.018
Token some
Token ID617
Feature activation+7.78e-02

Pos logit contributions
uddle+0.088
ades+0.088
uing+0.081
as+0.077
udd+0.077
Neg logit contributions
 You-0.044
 seem-0.042
|-0.040
 been-0.038
You-0.036
Token,
Token ID11
Feature activation+2.91e-02

Pos logit contributions
ades+0.141
uing+0.125
ming+0.125
clip+0.124
 sho+0.118
Neg logit contributions
 You-0.104
 Today-0.094
 seem-0.091
|-0.083
 been-0.081
Token please
Token ID3387
Feature activation+1.83e-02

Pos logit contributions
ades+0.045
uddle+0.040
ade+0.038
udd+0.036
act+0.036
Neg logit contributions
 You-0.055
 seem-0.049
|-0.047
 Today-0.046
You-0.043
Token?"
Token ID1701
Feature activation+1.53e-02

Pos logit contributions
ades+0.025
 sho+0.021
uddle+0.021
brush+0.021
aid+0.020
Neg logit contributions
 You-0.036
|-0.034
 seem-0.033
 Today-0.031
 generation-0.031
Token Tom
Token ID4186
Feature activation-2.08e-03

Pos logit contributions
ades+0.019
 moms+0.016
 sho+0.016
 penn+0.016
 scissors+0.016
Neg logit contributions
|-0.030
rated-0.029
keeping-0.029
 You-0.028
 generation-0.027
Token says
Token ID1139
Feature activation+4.95e-03

Pos logit contributions
|+0.004
 You+0.004
 generation+0.004
 seem+0.004
Warning+0.004
Neg logit contributions
ades-0.003
 sho-0.002
brush-0.002
uddle-0.002
 scissors-0.002
Token another
Token ID1194
Feature activation+2.80e-04

Pos logit contributions
ades+0.005
uddle+0.005
aj+0.004
udd+0.004
pron+0.004
Neg logit contributions
 You-0.005
|-0.004
 seem-0.004
 generation-0.004
 Time-0.004
Token wand
Token ID11569
Feature activation+2.99e-02

Pos logit contributions
ades+0.000
uddle+0.000
aid+0.000
 sho+0.000
 penn+0.000
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
 Today-0.000
 generation-0.000
Token.
Token ID13
Feature activation-2.49e-03

Pos logit contributions
ades+0.054
uddle+0.048
aid+0.047
 sho+0.047
brush+0.047
Neg logit contributions
 You-0.043
|-0.041
 seem-0.039
 generation-0.037
Warning-0.037
Token "
Token ID366
Feature activation-1.38e-02

Pos logit contributions
 You+0.003
|+0.002
 Today+0.002
 seem+0.002
 Time+0.002
Neg logit contributions
ades-0.006
brush-0.005
 sho-0.005
 penn-0.005
aid-0.005
TokenA
Token ID32
Feature activation+1.47e-02

Pos logit contributions
 You+0.022
 seem+0.021
|+0.021
rated+0.020
 Today+0.020
Neg logit contributions
aid-0.021
ades-0.021
brush-0.020
 sho-0.019
 scissors-0.018
Tokenbr
Token ID1671
Feature activation+7.72e-02

Pos logit contributions
ades+0.016
 penn+0.012
uddle+0.012
aid+0.011
 sho+0.011
Neg logit contributions
 You-0.033
|-0.032
 seem-0.030
 generation-0.029
 Today-0.029
Tokenac
Token ID330
Feature activation+1.55e-02

Pos logit contributions
ades+0.112
 brand+0.111
 sunscreen+0.109
 sho+0.104
 cream+0.101
Neg logit contributions
|-0.124
 You-0.121
 Today-0.118
 seem-0.111
ses-0.104
Tokenad
Token ID324
Feature activation-4.44e-04

Pos logit contributions
ades+0.030
uddle+0.026
 sho+0.026
brush+0.026
 brand+0.025
Neg logit contributions
 You-0.021
|-0.021
 seem-0.019
Warning-0.018
 generation-0.018
Tokenab
Token ID397
Feature activation+2.41e-02

Pos logit contributions
 You+0.001
|+0.001
 seem+0.001
 generation+0.001
Warning+0.001
Neg logit contributions
ades-0.001
uddle-0.001
 sho-0.001
brush-0.001
aid-0.001
Tokenra
Token ID430
Feature activation+2.49e-02

Pos logit contributions
ades+0.038
uddle+0.032
 sho+0.032
brush+0.032
aid+0.032
Neg logit contributions
 You-0.041
|-0.040
 seem-0.037
 generation-0.036
Warning-0.036
Token!"
Token ID2474
Feature activation-1.14e-02

Pos logit contributions
ades+0.039
uddle+0.035
 sho+0.033
aid+0.033
 brand+0.033
Neg logit contributions
 You-0.043
|-0.040
 seem-0.039
Warning-0.036
 generation-0.035
Token
Token ID198
Feature activation-3.29e-03

Pos logit contributions
ades+0.000
 sho+0.000
uddle+0.000
brush+0.000
aid+0.000
Neg logit contributions
 You-0.000
|-0.000
 seem-0.000
Warning-0.000
 generation-0.000
Token"
Token ID1
Feature activation+4.69e-03

Pos logit contributions
 You+0.006
|+0.006
 seem+0.006
Warning+0.006
 generation+0.005
Neg logit contributions
ades-0.005
 sho-0.004
uddle-0.004
brush-0.004
 brand-0.004
TokenCan
Token ID6090
Feature activation-7.60e-03

Pos logit contributions
ades+0.009
 sho+0.008
brush+0.008
uddle+0.008
aid+0.008
Neg logit contributions
 You-0.006
|-0.006
 seem-0.006
Warning-0.005
 generation-0.005
Token we
Token ID356
Feature activation+2.49e-02

Pos logit contributions
 You+0.008
|+0.008
 seem+0.007
Warning+0.007
 generation+0.007
Neg logit contributions
ades-0.016
 sho-0.015
uddle-0.015
brush-0.015
aid-0.014
Token have
Token ID423
Feature activation+4.53e-02

Pos logit contributions
ades+0.062
uddle+0.057
udd+0.054
aj+0.053
 sho+0.053
Neg logit contributions
 You-0.024
 seem-0.023
|-0.022
 Today-0.020
 Time-0.017
Token some
Token ID617
Feature activation+7.71e-02

Pos logit contributions
uddle+0.108
ades+0.102
uing+0.096
udd+0.094
brush+0.092
Neg logit contributions
 You-0.046
 seem-0.038
|-0.034
You-0.034
 This-0.032
Token cake
Token ID12187
Feature activation+6.96e-02

Pos logit contributions
ades+0.132
 sho+0.123
aid+0.122
uing+0.121
clip+0.120
Neg logit contributions
 You-0.107
 Today-0.103
 seem-0.087
 night-0.080
 day-0.079
Token,
Token ID11
Feature activation+3.16e-02

Pos logit contributions
ades+0.121
 sho+0.108
aid+0.108
usted+0.107
 sudden+0.105
Neg logit contributions
 You-0.106
 Today-0.097
 seem-0.090
|-0.088
 That-0.079
Token Mom
Token ID11254
Feature activation+2.40e-03

Pos logit contributions
ades+0.059
aid+0.052
uddle+0.051
ade+0.050
act+0.050
Neg logit contributions
 You-0.053
 Today-0.043
 seem-0.042
You-0.038
 Come-0.038
Tokenmy
Token ID1820
Feature activation+2.63e-03

Pos logit contributions
ades+0.004
uddle+0.003
 sho+0.003
aid+0.003
 penn+0.003
Neg logit contributions
 You-0.004
|-0.004
 seem-0.004
 Today-0.004
rated-0.004
Token?"
Token ID1701
Feature activation+2.77e-03

Pos logit contributions
ades+0.004
uddle+0.003
 sho+0.003
aid+0.003
brush+0.003
Neg logit contributions
 You-0.005
|-0.005
 seem-0.005
 Today-0.004
rated-0.004
Token said
Token ID531
Feature activation+2.23e-02

Pos logit contributions
ades+0.058
uddle+0.056
 penn+0.053
 brand+0.052
 cream+0.051
Neg logit contributions
 seem-0.030
|-0.028
Warning-0.028
 You-0.026
 generation-0.026
Token together
Token ID1978
Feature activation+1.42e-02

Pos logit contributions
ades+0.032
uddle+0.031
brush+0.028
 sho+0.028
ade+0.028
Neg logit contributions
 You-0.041
 seem-0.036
 Listen-0.034
 Come-0.033
 That-0.033
Token,
Token ID11
Feature activation-6.45e-03

Pos logit contributions
ades+0.022
uddle+0.020
 sho+0.020
aid+0.019
brush+0.019
Neg logit contributions
 You-0.023
|-0.022
 seem-0.022
Warning-0.021
 generation-0.021
Token "
Token ID366
Feature activation-1.04e-02

Pos logit contributions
|+0.011
 You+0.010
 seem+0.010
Warning+0.009
 generation+0.009
Neg logit contributions
ades-0.010
 sho-0.009
brush-0.009
uddle-0.009
aid-0.009
TokenA
Token ID32
Feature activation+1.61e-02

Pos logit contributions
|+0.015
 You+0.015
 seem+0.014
rated+0.014
keeping+0.014
Neg logit contributions
ades-0.018
aid-0.016
brush-0.016
 sho-0.015
 scissors-0.015
Tokenbr
Token ID1671
Feature activation+7.71e-02

Pos logit contributions
ades+0.026
 sho+0.023
uddle+0.023
brush+0.022
aid+0.022
Neg logit contributions
|-0.026
 You-0.026
 seem-0.024
Warning-0.024
 generation-0.023
Tokenac
Token ID330
Feature activation+2.02e-02

Pos logit contributions
 brand+0.113
 sunscreen+0.106
ades+0.105
 sho+0.100
 handsome+0.098
Neg logit contributions
 You-0.123
|-0.123
 Today-0.119
 seem-0.115
ses-0.111
Tokenad
Token ID324
Feature activation-4.39e-03

Pos logit contributions
ades+0.039
uddle+0.035
 sho+0.035
 brand+0.034
brush+0.034
Neg logit contributions
 You-0.027
|-0.027
 seem-0.024
Warning-0.024
 generation-0.023
Tokenab
Token ID397
Feature activation+2.57e-02

Pos logit contributions
 You+0.008
|+0.008
 seem+0.008
Warning+0.008
 generation+0.008
Neg logit contributions
ades-0.006
 sho-0.005
brush-0.005
uddle-0.005
aid-0.005
Tokenra
Token ID430
Feature activation+2.49e-02

Pos logit contributions
ades+0.039
uddle+0.034
 sho+0.034
brush+0.034
aid+0.033
Neg logit contributions
 You-0.044
|-0.043
 seem-0.040
Warning-0.039
 generation-0.039
Token!"
Token ID2474
Feature activation+3.07e-03

Pos logit contributions
ades+0.036
uddle+0.032
 sho+0.031
aid+0.031
brush+0.030
Neg logit contributions
 You-0.045
|-0.043
 seem-0.041
Warning-0.039
 generation-0.039
Token.
Token ID13
Feature activation-7.34e-03

Pos logit contributions
 You+0.001
|+0.001
 seem+0.001
Warning+0.001
 generation+0.001
Neg logit contributions
ades-0.001
 sho-0.001
brush-0.001
uddle-0.001
aid-0.001
Token Now
Token ID2735
Feature activation-5.61e-03

Pos logit contributions
|+0.010
 seem+0.010
Warning+0.010
rated+0.009
 You+0.009
Neg logit contributions
ades-0.012
brush-0.011
uddle-0.011
 sho-0.011
 brand-0.011
Token she
Token ID673
Feature activation+3.14e-03

Pos logit contributions
 You+0.010
|+0.010
 seem+0.009
Warning+0.009
 generation+0.009
Neg logit contributions
ades-0.008
 sho-0.007
uddle-0.007
brush-0.007
aid-0.007
Token had
Token ID550
Feature activation+8.56e-03

Pos logit contributions
ades+0.005
uddle+0.005
 sho+0.005
brush+0.005
aid+0.005
Neg logit contributions
 You-0.005
|-0.005
 seem-0.004
 generation-0.004
Warning-0.004
Token a
Token ID257
Feature activation+3.24e-02

Pos logit contributions
ades+0.016
uddle+0.014
aid+0.014
uing+0.013
brush+0.012
Neg logit contributions
 You-0.012
|-0.012
 seem-0.011
keeping-0.011
 generation-0.011
Token brand
Token ID4508
Feature activation+7.69e-02

Pos logit contributions
ades+0.044
usted+0.037
aid+0.036
ins+0.035
brush+0.034
Neg logit contributions
|-0.060
 You-0.060
 generation-0.058
 seem-0.056
 Time-0.055
Token new
Token ID649
Feature activation+2.38e-02

Pos logit contributions
usted+0.097
 swapped+0.095
brush+0.094
ades+0.094
umbers+0.093
Neg logit contributions
 You-0.122
|-0.105
 Today-0.105
 moment-0.103
 generation-0.101
Token brush
Token ID14093
Feature activation+3.33e-02

Pos logit contributions
ades+0.031
aid+0.026
brush+0.024
usted+0.024
uddle+0.024
Neg logit contributions
 You-0.046
|-0.044
 generation-0.043
 Today-0.042
 seem-0.040
Token and
Token ID290
Feature activation-8.52e-03

Pos logit contributions
ades+0.052
uddle+0.044
 sho+0.043
aid+0.043
 penn+0.042
Neg logit contributions
 You-0.057
|-0.055
 seem-0.052
 generation-0.050
keeping-0.049
Token she
Token ID673
Feature activation+2.68e-03

Pos logit contributions
 You+0.013
|+0.012
 seem+0.012
 generation+0.011
keeping+0.011
Neg logit contributions
ades-0.015
uddle-0.013
aid-0.012
 sho-0.012
brush-0.012
Token would
Token ID561
Feature activation+9.26e-03

Pos logit contributions
ades+0.005
uddle+0.004
aid+0.004
 sho+0.004
brush+0.004
Neg logit contributions
 You-0.004
|-0.004
 seem-0.003
 generation-0.003
Warning-0.003
Token
Token ID198
Feature activation-7.74e-05

Pos logit contributions
ades+0.003
 sho+0.003
uddle+0.003
brush+0.003
aid+0.003
Neg logit contributions
 You-0.002
|-0.002
 seem-0.002
Warning-0.002
 generation-0.002
TokenLittle
Token ID22253
Feature activation-6.08e-03

Pos logit contributions
 You+0.000
|+0.000
 seem+0.000
Warning+0.000
 generation+0.000
Neg logit contributions
ades-0.000
 sho-0.000
uddle-0.000
brush-0.000
aid-0.000
Token Jack
Token ID3619
Feature activation+8.29e-03

Pos logit contributions
|+0.010
 You+0.010
 generation+0.010
Warning+0.009
 seem+0.009
Neg logit contributions
ades-0.009
uddle-0.008
 sho-0.008
brush-0.008
 scissors-0.008
Token gasped
Token ID45236
Feature activation-1.58e-03

Pos logit contributions
ades+0.015
aid+0.013
uddle+0.013
brush+0.013
 penn+0.013
Neg logit contributions
|-0.012
 You-0.012
 seem-0.011
 generation-0.011
Warning-0.011
Token in
Token ID287
Feature activation+1.16e-02

Pos logit contributions
 You+0.003
|+0.003
 seem+0.003
 generation+0.003
Warning+0.003
Neg logit contributions
ades-0.002
uddle-0.002
 sho-0.002
brush-0.002
aid-0.002
Token amaz
Token ID40642
Feature activation+7.69e-02

Pos logit contributions
ades+0.018
uddle+0.016
brush+0.015
aid+0.015
 sho+0.015
Neg logit contributions
 You-0.021
|-0.020
 seem-0.019
 generation-0.018
Warning-0.018
Tokenement
Token ID972
Feature activation+7.29e-03

Pos logit contributions
 sho+0.118
 dads+0.116
aid+0.115
 sunscreen+0.114
ip+0.110
Neg logit contributions
 You-0.112
 seem-0.097
 Today-0.090
 generation-0.089
Warning-0.089
Token.
Token ID13
Feature activation-8.14e-03

Pos logit contributions
ades+0.011
 sho+0.010
uddle+0.010
brush+0.010
aid+0.010
Neg logit contributions
 You-0.012
|-0.012
 seem-0.011
 generation-0.011
Warning-0.011
Token Dad
Token ID17415
Feature activation+1.50e-02

Pos logit contributions
|+0.011
 seem+0.009
 You+0.009
Warning+0.009
keeping+0.009
Neg logit contributions
ades-0.015
 brand-0.014
 sho-0.014
uddle-0.014
brush-0.013
Token pointed
Token ID6235
Feature activation+2.99e-02

Pos logit contributions
ades+0.025
aid+0.023
 penn+0.023
uddle+0.023
 sunscreen+0.022
Neg logit contributions
 You-0.023
|-0.022
 seem-0.021
elope-0.019
 Time-0.019
Token to
Token ID284
Feature activation-6.10e-03

Pos logit contributions
ades+0.056
uddle+0.054
aid+0.054
igs+0.051
brush+0.051
Neg logit contributions
 You-0.042
 Today-0.036
 seem-0.035
 That-0.034
 Welcome-0.033
Token the
Token ID262
Feature activation-1.13e-02

Pos logit contributions
 You+0.015
|+0.015
Warning+0.014
 seem+0.013
rated+0.013
Neg logit contributions
ades-0.007
 sho-0.006
 scissors-0.006
brush-0.006
aid-0.005
Token noise
Token ID7838
Feature activation-3.98e-02

Pos logit contributions
|+0.017
 You+0.017
Warning+0.015
 seem+0.015
 generation+0.015
Neg logit contributions
ades-0.020
 sho-0.017
uddle-0.017
brush-0.017
aid-0.017
Token until
Token ID1566
Feature activation-5.98e-03

Pos logit contributions
|+0.065
 You+0.065
Warning+0.058
 seem+0.057
 generation+0.057
Neg logit contributions
ades-0.065
 sho-0.057
brush-0.056
uddle-0.055
aid-0.054
Token he
Token ID339
Feature activation-1.60e-05

Pos logit contributions
|+0.012
 You+0.012
Warning+0.011
keeping+0.011
rated+0.011
Neg logit contributions
ades-0.007
 sho-0.006
brush-0.006
 penn-0.006
 scissors-0.006
Token stumbled
Token ID24241
Feature activation-1.39e-03

Pos logit contributions
|+0.000
 seem+0.000
 You+0.000
Warning+0.000
 generation+0.000
Neg logit contributions
ades-0.000
 penn-0.000
aid-0.000
 sunscreen-0.000
 apr-0.000
Token upon
Token ID2402
Feature activation-8.02e-02

Pos logit contributions
 You+0.003
|+0.002
 seem+0.002
 generation+0.002
Warning+0.002
Neg logit contributions
ades-0.002
brush-0.002
 sho-0.002
uddle-0.002
ade-0.002
Token a
Token ID257
Feature activation+1.83e-03

Pos logit contributions
|+0.212
rated+0.197
elope+0.196
 You+0.193
Warning+0.192
Neg logit contributions
 off-0.043
 more-0.041
 crown-0.040
 exactly-0.038
ades-0.038
Token big
Token ID1263
Feature activation+1.12e-02

Pos logit contributions
ades+0.003
aid+0.002
usted+0.002
brush+0.002
 penn+0.002
Neg logit contributions
 You-0.003
|-0.003
 seem-0.003
 generation-0.003
Warning-0.003
Token surprise
Token ID5975
Feature activation+1.26e-02

Pos logit contributions
ades+0.016
aid+0.014
brush+0.013
uddle+0.013
 sho+0.013
Neg logit contributions
 You-0.021
|-0.020
 seem-0.019
 generation-0.018
Warning-0.018
Token.
Token ID13
Feature activation-7.11e-03

Pos logit contributions
ades+0.021
 sho+0.018
brush+0.018
uddle+0.018
 cream+0.017
Neg logit contributions
|-0.021
 You-0.020
 seem-0.019
Warning-0.018
 generation-0.018
Token
Token ID220
Feature activation-8.23e-03

Pos logit contributions
|+0.011
 You+0.010
Warning+0.010
rated+0.010
 seem+0.009
Neg logit contributions
ades-0.012
 sho-0.011
 brand-0.010
brush-0.010
uddle-0.010
Token what
Token ID644
Feature activation-1.80e-03

Pos logit contributions
ades+0.027
pron+0.027
brush+0.026
uddle+0.026
ip+0.025
Neg logit contributions
 You-0.022
 seem-0.021
 generation-0.020
You-0.018
 temptation-0.018
Token was
Token ID373
Feature activation-1.77e-02

Pos logit contributions
 You+0.003
|+0.003
 seem+0.003
Warning+0.002
 generation+0.002
Neg logit contributions
ades-0.003
uddle-0.003
brush-0.003
aid-0.003
 sho-0.003
Token around
Token ID1088
Feature activation-3.21e-02

Pos logit contributions
 You+0.026
|+0.024
 seem+0.023
 generation+0.021
Warning+0.021
Neg logit contributions
ades-0.033
uddle-0.030
brush-0.029
aid-0.028
 sho-0.028
Token the
Token ID262
Feature activation-2.65e-02

Pos logit contributions
|+0.064
 You+0.060
Warning+0.056
 seem+0.056
 generation+0.055
Neg logit contributions
ades-0.039
 sho-0.036
brush-0.034
 scissors-0.033
 penn-0.033
Token river
Token ID7850
Feature activation-3.55e-02

Pos logit contributions
 You+0.042
|+0.040
 seem+0.038
Warning+0.037
 generation+0.037
Neg logit contributions
ades-0.044
aid-0.040
brush-0.040
 sho-0.038
uddle-0.038
Tokenâ
Token ID22940
Feature activation-7.96e-02

Pos logit contributions
|+0.060
 You+0.058
Warning+0.054
 generation+0.053
 seem+0.051
Neg logit contributions
ades-0.056
uddle-0.048
brush-0.048
 sho-0.048
ade-0.047
Token
Token ID26391
Feature activation-6.16e-02

Pos logit contributions
ses+0.093
keeping+0.085
Warning+0.082
 seem+0.082
 gag+0.081
Neg logit contributions
brush-0.126
is-0.126
up-0.122
wing-0.121
 swapped-0.119
Token
Token ID8151
Feature activation-1.34e-02

Pos logit contributions
Warning+0.102
 You+0.102
rated+0.099
|+0.098
 generation+0.096
Neg logit contributions
ades-0.098
uddle-0.084
brush-0.079
udd-0.076
ade-0.075
Tokens
Token ID82
Feature activation-2.30e-02

Pos logit contributions
|+0.023
 You+0.022
 generation+0.021
Warning+0.021
 seem+0.020
Neg logit contributions
ades-0.021
uddle-0.017
 sho-0.017
brush-0.017
ade-0.017
Token bend
Token ID19396
Feature activation-1.62e-02

Pos logit contributions
 You+0.033
|+0.032
 seem+0.030
 generation+0.029
Warning+0.029
Neg logit contributions
ades-0.041
aid-0.037
 sho-0.037
brush-0.036
uddle-0.036
Token.
Token ID13
Feature activation-3.02e-03

Pos logit contributions
 You+0.028
|+0.028
 seem+0.026
Warning+0.025
 generation+0.025
Neg logit contributions
ades-0.025
 sho-0.021
uddle-0.021
brush-0.021
aid-0.020
Token exploring
Token ID13504
Feature activation+2.43e-02

Pos logit contributions
|+0.063
rated+0.061
 You+0.059
keeping+0.057
 seem+0.056
Neg logit contributions
ades-0.041
 friends-0.039
 penn-0.037
 sho-0.037
 scissors-0.037
Token,
Token ID11
Feature activation-1.58e-02

Pos logit contributions
ades+0.028
brush+0.022
aid+0.022
uddle+0.022
 sho+0.022
Neg logit contributions
 You-0.052
|-0.051
 seem-0.049
 generation-0.047
Warning-0.047
Token the
Token ID262
Feature activation-1.22e-02

Pos logit contributions
|+0.036
rated+0.034
keeping+0.033
 You+0.033
Warning+0.033
Neg logit contributions
 scissors-0.015
ades-0.014
 sho-0.014
 sunscreen-0.014
 penn-0.013
Token cat
Token ID3797
Feature activation-2.68e-02

Pos logit contributions
|+0.025
 You+0.024
 seem+0.023
Warning+0.022
 generation+0.022
Neg logit contributions
ades-0.014
 sho-0.013
uddle-0.013
brush-0.013
 scissors-0.012
Token stumbled
Token ID24241
Feature activation-4.65e-03

Pos logit contributions
 You+0.050
|+0.050
 seem+0.046
Warning+0.045
 generation+0.045
Neg logit contributions
ades-0.037
 sho-0.031
uddle-0.031
brush-0.031
aid-0.030
Token upon
Token ID2402
Feature activation-7.86e-02

Pos logit contributions
 You+0.009
|+0.009
 seem+0.008
 generation+0.008
Warning+0.008
Neg logit contributions
ades-0.006
 sho-0.005
brush-0.005
uddle-0.005
ade-0.005
Token a
Token ID257
Feature activation+1.17e-02

Pos logit contributions
|+0.209
keeping+0.202
elope+0.198
rated+0.197
 MacDonald+0.196
Neg logit contributions
 more-0.040
 all-0.038
 butter-0.038
 big-0.037
 scissors-0.035
Token large
Token ID1588
Feature activation+7.30e-03

Pos logit contributions
ades+0.017
aid+0.016
usted+0.015
brush+0.014
ins+0.014
Neg logit contributions
 You-0.020
 seem-0.019
|-0.019
 generation-0.018
 Today-0.017
Token shadow
Token ID9082
Feature activation-2.63e-02

Pos logit contributions
ades+0.011
aid+0.010
brush+0.009
uddle+0.009
 sho+0.009
Neg logit contributions
 You-0.013
|-0.012
 generation-0.012
 seem-0.011
 Today-0.011
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
|+0.054
 You+0.052
keeping+0.050
 generation+0.050
 seem+0.048
Neg logit contributions
ades-0.031
brush-0.029
 sho-0.028
ade-0.026
uddle-0.026
Token The
Token ID383
Feature activation-5.97e-03

Pos logit contributions
|+0.020
keeping+0.019
rated+0.019
 generation+0.019
Warning+0.018
Neg logit contributions
ades-0.018
 sho-0.018
brush-0.017
 brand-0.017
uddle-0.017
Token felt
Token ID2936
Feature activation+3.83e-03

Pos logit contributions
ades+0.019
uddle+0.017
aid+0.017
brush+0.017
 sho+0.017
Neg logit contributions
 You-0.019
|-0.018
 seem-0.017
 generation-0.016
Warning-0.016
Token like
Token ID588
Feature activation-1.12e-02

Pos logit contributions
ades+0.007
uddle+0.006
brush+0.006
 sho+0.006
aid+0.006
Neg logit contributions
 You-0.006
|-0.006
 seem-0.005
 generation-0.005
Warning-0.005
Token hours
Token ID2250
Feature activation-4.83e-03

Pos logit contributions
 You+0.019
|+0.017
 seem+0.017
 generation+0.017
Warning+0.016
Neg logit contributions
ades-0.018
uddle-0.016
aid-0.015
 sho-0.015
brush-0.015
Token before
Token ID878
Feature activation-4.18e-02

Pos logit contributions
 You+0.009
|+0.009
 seem+0.008
Warning+0.008
 generation+0.008
Neg logit contributions
ades-0.007
 sho-0.006
brush-0.006
uddle-0.006
aid-0.005
Token stumbling
Token ID37829
Feature activation-1.74e-03

Pos logit contributions
|+0.083
 You+0.076
rated+0.073
Warning+0.069
 generation+0.068
Neg logit contributions
ades-0.057
 brand-0.048
 exactly-0.047
 sunscreen-0.046
 friends-0.045
Token upon
Token ID2402
Feature activation-7.81e-02

Pos logit contributions
|+0.003
 You+0.003
Warning+0.003
 seem+0.003
 generation+0.003
Neg logit contributions
ades-0.003
 sho-0.002
brush-0.002
uddle-0.002
ade-0.002
Token a
Token ID257
Feature activation+6.19e-03

Pos logit contributions
|+0.164
 You+0.151
rated+0.149
ses+0.147
Warning+0.146
Neg logit contributions
ades-0.084
 exactly-0.076
 sho-0.069
 friends-0.068
 cream-0.067
Token strange
Token ID6283
Feature activation-3.25e-03

Pos logit contributions
ades+0.010
aid+0.008
brush+0.008
 sho+0.008
 penn+0.008
Neg logit contributions
 You-0.010
|-0.010
 seem-0.010
 generation-0.010
Warning-0.009
Token house
Token ID2156
Feature activation+4.48e-03

Pos logit contributions
 You+0.006
|+0.006
Warning+0.005
 seem+0.005
 generation+0.005
Neg logit contributions
ades-0.005
 sho-0.004
uddle-0.004
brush-0.004
ade-0.004
Token.
Token ID13
Feature activation+5.32e-04

Pos logit contributions
ades+0.008
brush+0.007
 sho+0.007
uddle+0.007
aid+0.007
Neg logit contributions
|-0.007
 You-0.007
 seem-0.006
Warning-0.006
 generation-0.006
Token The
Token ID383
Feature activation+4.85e-04

Pos logit contributions
ades+0.001
 brand+0.001
 exactly+0.001
uddle+0.001
 friends+0.001
Neg logit contributions
|-0.001
keeping-0.001
 You-0.001
rated-0.001
Warning-0.001
Token,
Token ID11
Feature activation-4.45e-03

Pos logit contributions
|+0.054
 You+0.053
 generation+0.050
 seem+0.049
Warning+0.049
Neg logit contributions
ades-0.039
 sho-0.035
uddle-0.034
brush-0.034
aid-0.032
Token it
Token ID340
Feature activation-2.12e-02

Pos logit contributions
 You+0.007
|+0.006
 seem+0.006
Warning+0.006
 generation+0.006
Neg logit contributions
ades-0.008
uddle-0.007
 sho-0.007
brush-0.007
aid-0.007
Token's
Token ID338
Feature activation-2.42e-02

Pos logit contributions
|+0.036
 You+0.035
 generation+0.034
keeping+0.034
Warning+0.033
Neg logit contributions
ades-0.031
 sho-0.030
uddle-0.028
brush-0.028
ade-0.027
Token too
Token ID1165
Feature activation-2.12e-02

Pos logit contributions
|+0.049
keeping+0.046
 generation+0.045
Warning+0.045
 You+0.045
Neg logit contributions
 sho-0.027
 friends-0.026
 exactly-0.025
 helmet-0.025
 brand-0.025
Token dangerous
Token ID4923
Feature activation-2.82e-02

Pos logit contributions
 You+0.045
|+0.044
keeping+0.044
 generation+0.043
Warning+0.042
Neg logit contributions
ades-0.024
ade-0.020
 sho-0.020
 cream-0.020
uddle-0.020
Tokenâ
Token ID22940
Feature activation-7.69e-02

Pos logit contributions
|+0.062
 generation+0.058
Warning+0.056
elope+0.055
 seem+0.053
Neg logit contributions
ades-0.027
 cream-0.026
 off-0.024
,-0.023
 sho-0.023
Token
Token ID26391
Feature activation-3.89e-02

Pos logit contributions
ses+0.097
sing+0.088
elope+0.087
keeping+0.086
 generation+0.084
Neg logit contributions
brush-0.120
up-0.117
ade-0.117
aid-0.116
 cream-0.116
Token.
Token ID13
Feature activation-6.66e-03

Pos logit contributions
|+0.065
Warning+0.064
rated+0.064
 You+0.063
 generation+0.062
Neg logit contributions
ades-0.059
uddle-0.052
brush-0.049
ip-0.048
 sho-0.047
Token
Token ID220
Feature activation-9.54e-03

Pos logit contributions
|+0.006
 seem+0.006
 generation+0.006
rated+0.006
Warning+0.005
Neg logit contributions
ades-0.014
 sho-0.013
brush-0.013
uddle-0.013
aid-0.013
Token
Token ID198
Feature activation+7.26e-03

Pos logit contributions
|+0.016
 generation+0.015
keeping+0.015
rated+0.014
 You+0.014
Neg logit contributions
ades-0.015
uddle-0.013
brush-0.012
ade-0.012
 sho-0.012
TokenMax
Token ID11518
Feature activation+1.07e-03

Pos logit contributions
ades+0.013
 sho+0.012
uddle+0.012
brush+0.012
aid+0.012
Neg logit contributions
 You-0.010
|-0.010
 seem-0.009
Warning-0.009
 generation-0.009
Token forest
Token ID8222
Feature activation-3.41e-04

Pos logit contributions
ades+0.016
aid+0.014
brush+0.013
ade+0.012
usted+0.012
Neg logit contributions
 You-0.016
|-0.016
 generation-0.015
elope-0.015
 seem-0.015
Token home
Token ID1363
Feature activation+3.27e-04

Pos logit contributions
 You+0.001
|+0.001
 seem+0.001
Warning+0.001
 generation+0.001
Neg logit contributions
ades-0.000
 sho-0.000
uddle-0.000
brush-0.000
 penn-0.000
Token until
Token ID1566
Feature activation-2.63e-03

Pos logit contributions
ades+0.001
 sho+0.000
uddle+0.000
brush+0.000
aid+0.000
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
Warning-0.000
 generation-0.000
Token he
Token ID339
Feature activation-4.01e-03

Pos logit contributions
|+0.005
 You+0.005
Warning+0.005
keeping+0.005
rated+0.005
Neg logit contributions
ades-0.003
 sho-0.003
 sunscreen-0.003
 penn-0.003
brush-0.003
Token came
Token ID1625
Feature activation-1.53e-02

Pos logit contributions
 You+0.007
|+0.007
 seem+0.006
Warning+0.006
 generation+0.006
Neg logit contributions
ades-0.006
brush-0.005
 sho-0.005
uddle-0.005
aid-0.005
Token upon
Token ID2402
Feature activation-7.65e-02

Pos logit contributions
|+0.033
 You+0.032
Warning+0.030
keeping+0.030
 seem+0.030
Neg logit contributions
ades-0.016
 friends-0.015
 sho-0.014
 friend-0.013
 exactly-0.013
Token a
Token ID257
Feature activation+3.43e-03

Pos logit contributions
elope+0.190
keeping+0.189
|+0.183
Warning+0.182
 MacDonald+0.178
Neg logit contributions
 more-0.046
 off-0.042
 cream-0.039
 exactly-0.038
 big-0.038
Token cottage
Token ID36856
Feature activation-5.23e-03

Pos logit contributions
ades+0.004
aid+0.003
brush+0.003
 sho+0.003
 penn+0.003
Neg logit contributions
 You-0.007
|-0.007
 seem-0.007
 generation-0.007
Warning-0.007
Token.
Token ID13
Feature activation-1.24e-03

Pos logit contributions
|+0.010
 You+0.009
 seem+0.009
Warning+0.009
 generation+0.009
Neg logit contributions
ades-0.007
 cream-0.006
brush-0.006
 sho-0.006
uddle-0.006
Token
Token ID198
Feature activation+3.59e-03

Pos logit contributions
|+0.002
rated+0.002
Warning+0.002
keeping+0.002
 You+0.002
Neg logit contributions
ades-0.002
 sho-0.001
 brand-0.001
 handful-0.001
uddle-0.001
Token
Token ID198
Feature activation-2.48e-04

Pos logit contributions
ades+0.007
 sho+0.006
uddle+0.006
brush+0.006
aid+0.006
Neg logit contributions
 You-0.005
|-0.005
 seem-0.004
Warning-0.004
 generation-0.004
Token walking
Token ID6155
Feature activation+2.16e-02

Pos logit contributions
 You+0.007
|+0.007
 seem+0.006
 generation+0.006
Warning+0.005
Neg logit contributions
ades-0.014
uddle-0.013
brush-0.012
aid-0.012
 sho-0.012
Token along
Token ID1863
Feature activation-1.25e-03

Pos logit contributions
ades+0.031
uddle+0.026
aid+0.025
brush+0.025
 sho+0.024
Neg logit contributions
 You-0.040
|-0.039
 seem-0.037
 Today-0.037
 generation-0.036
Token,
Token ID11
Feature activation-1.93e-02

Pos logit contributions
 You+0.003
|+0.003
 seem+0.003
Warning+0.003
 generation+0.003
Neg logit contributions
ades-0.001
uddle-0.001
 sho-0.001
brush-0.001
aid-0.001
Token he
Token ID339
Feature activation-8.36e-03

Pos logit contributions
|+0.036
 You+0.035
keeping+0.032
rated+0.031
Warning+0.031
Neg logit contributions
ades-0.026
 sho-0.024
 penn-0.022
brush-0.022
 brand-0.022
Token stumbled
Token ID24241
Feature activation-1.79e-03

Pos logit contributions
|+0.013
 You+0.013
 seem+0.012
 generation+0.012
Warning+0.011
Neg logit contributions
ades-0.014
aid-0.012
uddle-0.012
 penn-0.012
brush-0.012
Token upon
Token ID2402
Feature activation-7.62e-02

Pos logit contributions
 You+0.003
|+0.003
 seem+0.003
 generation+0.003
Warning+0.003
Neg logit contributions
ades-0.003
 sho-0.002
brush-0.002
uddle-0.002
aid-0.002
Token an
Token ID281
Feature activation+3.75e-03

Pos logit contributions
|+0.189
 You+0.178
keeping+0.178
elope+0.177
rated+0.177
Neg logit contributions
 sho-0.051
 crown-0.049
 more-0.047
 exactly-0.047
 scissors-0.046
Token old
Token ID1468
Feature activation-1.31e-02

Pos logit contributions
ades+0.006
uddle+0.005
aid+0.005
brush+0.005
 sho+0.005
Neg logit contributions
 You-0.006
|-0.006
 seem-0.006
 generation-0.006
Warning-0.006
Token tower
Token ID10580
Feature activation-1.69e-02

Pos logit contributions
 You+0.025
|+0.024
 generation+0.023
 seem+0.022
Warning+0.022
Neg logit contributions
ades-0.018
aid-0.015
uddle-0.015
brush-0.014
 sho-0.014
Token.
Token ID13
Feature activation-9.40e-03

Pos logit contributions
|+0.033
 You+0.031
 generation+0.030
keeping+0.030
Warning+0.029
Neg logit contributions
ades-0.022
brush-0.021
 sho-0.020
ade-0.019
 cream-0.019
Token Sir
Token ID7361
Feature activation-8.62e-03

Pos logit contributions
|+0.014
 You+0.012
rated+0.012
keeping+0.012
 seem+0.011
Neg logit contributions
ades-0.016
 sho-0.015
brush-0.015
 brand-0.015
 helmet-0.014
Token was
Token ID373
Feature activation+4.64e-03

Pos logit contributions
ades+0.034
brush+0.030
 penn+0.030
 sho+0.029
uddle+0.029
Neg logit contributions
|-0.026
 You-0.024
 seem-0.024
Warning-0.022
elope-0.021
Token searching
Token ID10342
Feature activation+4.02e-02

Pos logit contributions
ades+0.011
uddle+0.010
brush+0.010
aid+0.010
 penn+0.010
Neg logit contributions
|-0.004
 You-0.004
 seem-0.004
 generation-0.003
Warning-0.003
Token he
Token ID339
Feature activation+2.26e-03

Pos logit contributions
ades+0.052
brush+0.043
uddle+0.042
aid+0.042
 sho+0.042
Neg logit contributions
 You-0.080
|-0.078
 seem-0.076
Warning-0.072
 generation-0.071
Token soon
Token ID2582
Feature activation-4.54e-03

Pos logit contributions
ades+0.003
brush+0.003
aid+0.003
uddle+0.003
 sho+0.003
Neg logit contributions
 You-0.004
|-0.004
 seem-0.004
Warning-0.004
 generation-0.004
Token stumbled
Token ID24241
Feature activation+6.60e-03

Pos logit contributions
 seem+0.007
 You+0.007
|+0.007
 generation+0.007
Warning+0.007
Neg logit contributions
ades-0.007
uddle-0.006
aid-0.006
 sho-0.006
brush-0.006
Token upon
Token ID2402
Feature activation-7.59e-02

Pos logit contributions
ades+0.010
uddle+0.009
brush+0.008
 sho+0.008
ade+0.008
Neg logit contributions
 You-0.012
|-0.011
 seem-0.011
 generation-0.010
Warning-0.010
Token a
Token ID257
Feature activation+5.99e-03

Pos logit contributions
|+0.192
rated+0.181
 You+0.178
Warning+0.177
elope+0.174
Neg logit contributions
ades-0.045
 crown-0.044
 exactly-0.044
 more-0.042
 friends-0.042
Token large
Token ID1588
Feature activation+5.91e-03

Pos logit contributions
ades+0.009
aid+0.009
usted+0.008
brush+0.008
 penn+0.008
Neg logit contributions
 You-0.010
|-0.010
 seem-0.010
 generation-0.009
 Time-0.009
Token bull
Token ID6473
Feature activation-3.52e-02

Pos logit contributions
ades+0.009
aid+0.008
brush+0.008
 sho+0.008
uddle+0.007
Neg logit contributions
 You-0.010
|-0.010
 generation-0.009
 seem-0.009
 Today-0.009
Token.
Token ID13
Feature activation-9.56e-04

Pos logit contributions
|+0.068
rated+0.064
Warning+0.062
 You+0.062
 seem+0.062
Neg logit contributions
ades-0.042
uddle-0.038
 sho-0.037
 else-0.035
 friends-0.035
Token The
Token ID383
Feature activation-2.30e-03

Pos logit contributions
|+0.001
Warning+0.001
 You+0.001
keeping+0.001
 seem+0.001
Neg logit contributions
ades-0.002
 sho-0.001
 brand-0.001
 handful-0.001
uddle-0.001
Token and
Token ID290
Feature activation-3.38e-02

Pos logit contributions
|+0.044
 You+0.043
Warning+0.041
 generation+0.040
 seem+0.039
Neg logit contributions
ades-0.020
 sho-0.017
 penn-0.017
uddle-0.016
brush-0.016
Token the
Token ID262
Feature activation-1.67e-02

Pos logit contributions
 You+0.065
|+0.064
 seem+0.060
Warning+0.058
 generation+0.058
Neg logit contributions
ades-0.046
 sho-0.038
uddle-0.038
brush-0.038
aid-0.037
Token rain
Token ID6290
Feature activation-2.86e-02

Pos logit contributions
 You+0.030
|+0.030
 seem+0.028
Warning+0.027
 generation+0.027
Neg logit contributions
ades-0.024
uddle-0.020
 sho-0.020
brush-0.020
aid-0.020
Token was
Token ID373
Feature activation-3.08e-02

Pos logit contributions
 You+0.054
|+0.052
 generation+0.049
Warning+0.049
 seem+0.048
Neg logit contributions
ades-0.040
uddle-0.036
brush-0.035
 sho-0.035
ade-0.034
Token falling
Token ID7463
Feature activation-3.10e-02

Pos logit contributions
|+0.040
 You+0.039
 seem+0.038
rated+0.036
Warning+0.033
Neg logit contributions
ades-0.060
uddle-0.054
 penn-0.052
aid-0.051
ingu-0.050
Token heavily
Token ID7272
Feature activation-7.57e-02

Pos logit contributions
|+0.063
 You+0.062
 generation+0.059
Warning+0.058
keeping+0.057
Neg logit contributions
ades-0.037
 sho-0.032
brush-0.032
uddle-0.031
usted-0.030
Token.
Token ID13
Feature activation-4.90e-03

Pos logit contributions
|+0.143
 You+0.141
Warning+0.136
 generation+0.131
ses+0.130
Neg logit contributions
ades-0.090
brush-0.083
 sho-0.079
 brand-0.077
 penn-0.077
Token The
Token ID383
Feature activation-1.50e-05

Pos logit contributions
|+0.008
keeping+0.008
rated+0.008
Warning+0.007
 generation+0.007
Neg logit contributions
ades-0.007
uddle-0.007
 sho-0.006
 brand-0.006
brush-0.006
Token little
Token ID1310
Feature activation-9.96e-03

Pos logit contributions
 You+0.000
|+0.000
 seem+0.000
 generation+0.000
Warning+0.000
Neg logit contributions
ades-0.000
uddle-0.000
brush-0.000
 sho-0.000
aid-0.000
Token bird
Token ID6512
Feature activation-1.20e-02

Pos logit contributions
 You+0.020
|+0.020
 seem+0.018
Warning+0.018
 generation+0.018
Neg logit contributions
ades-0.013
uddle-0.010
brush-0.010
 sho-0.010
aid-0.010
Token was
Token ID373
Feature activation-4.91e-03

Pos logit contributions
|+0.017
 You+0.017
 seem+0.015
rated+0.014
Warning+0.014
Neg logit contributions
ades-0.022
brush-0.019
 sho-0.019
uddle-0.019
aid-0.019
Token the
Token ID262
Feature activation-5.58e-03

Pos logit contributions
ades+0.005
 sho+0.005
brush+0.004
uddle+0.004
aid+0.004
Neg logit contributions
 You-0.008
|-0.008
 seem-0.008
Warning-0.008
 generation-0.007
Token sound
Token ID2128
Feature activation-2.43e-02

Pos logit contributions
 You+0.010
|+0.010
 seem+0.010
Warning+0.009
 generation+0.009
Neg logit contributions
ades-0.008
 sho-0.007
uddle-0.007
brush-0.007
aid-0.006
Token until
Token ID1566
Feature activation+4.21e-03

Pos logit contributions
 seem+0.041
 You+0.041
|+0.039
 generation+0.037
 Welcome+0.035
Neg logit contributions
ades-0.037
uddle-0.033
aid-0.032
ingu-0.031
 sho-0.031
Token she
Token ID673
Feature activation+5.25e-03

Pos logit contributions
ades+0.005
brush+0.005
 sho+0.005
 penn+0.005
 scissors+0.005
Neg logit contributions
 You-0.008
|-0.008
Warning-0.007
keeping-0.007
rated-0.007
Token stumbled
Token ID24241
Feature activation+4.70e-03

Pos logit contributions
ades+0.010
 penn+0.008
aid+0.008
 sunscreen+0.008
 cream+0.008
Neg logit contributions
|-0.008
 seem-0.007
 You-0.007
 generation-0.007
Warning-0.007
Token upon
Token ID2402
Feature activation-7.56e-02

Pos logit contributions
ades+0.007
 sho+0.006
uddle+0.006
brush+0.006
aid+0.006
Neg logit contributions
 You-0.008
|-0.008
 seem-0.008
 generation-0.007
Warning-0.007
Token a
Token ID257
Feature activation+2.27e-03

Pos logit contributions
|+0.184
 You+0.173
rated+0.171
Warning+0.170
keeping+0.167
Neg logit contributions
 off-0.053
 exactly-0.051
 sho-0.050
 more-0.048
 crown-0.047
Token beautiful
Token ID4950
Feature activation-5.12e-03

Pos logit contributions
ades+0.003
aid+0.003
usted+0.003
 penn+0.003
 sho+0.003
Neg logit contributions
 You-0.004
|-0.004
 seem-0.004
 generation-0.004
Warning-0.004
Token bird
Token ID6512
Feature activation-3.42e-02

Pos logit contributions
 You+0.009
|+0.009
 generation+0.008
 seem+0.008
 Today+0.008
Neg logit contributions
ades-0.007
aid-0.007
 sho-0.006
brush-0.006
 penn-0.006
Token perched
Token ID49264
Feature activation-6.41e-03

Pos logit contributions
 You+0.059
|+0.058
Warning+0.055
 seem+0.055
 generation+0.053
Neg logit contributions
ades-0.050
brush-0.045
uddle-0.045
 sho-0.044
 brand-0.042
Token on
Token ID319
Feature activation-3.55e-02

Pos logit contributions
|+0.014
 You+0.014
 seem+0.013
 generation+0.013
Warning+0.012
Neg logit contributions
ades-0.007
 sho-0.006
brush-0.006
uddle-0.006
ade-0.005
Token particularly
Token ID3573
Feature activation-2.58e-02

Pos logit contributions
ades+0.003
aid+0.002
usted+0.002
brush+0.002
 penn+0.002
Neg logit contributions
|-0.003
 You-0.003
 seem-0.003
 generation-0.003
 Today-0.003
Token tall
Token ID7331
Feature activation-2.81e-04

Pos logit contributions
 You+0.056
|+0.056
Warning+0.052
keeping+0.051
 seem+0.050
Neg logit contributions
ades-0.029
uddle-0.025
 sho-0.024
ade-0.024
brush-0.023
Token tree
Token ID5509
Feature activation-2.57e-02

Pos logit contributions
 You+0.001
|+0.001
 seem+0.000
Warning+0.000
 generation+0.000
Neg logit contributions
ades-0.000
 sho-0.000
uddle-0.000
brush-0.000
aid-0.000
Token and
Token ID290
Feature activation-2.06e-02

Pos logit contributions
|+0.046
 You+0.044
Warning+0.042
 seem+0.041
 generation+0.040
Neg logit contributions
ades-0.037
brush-0.033
uddle-0.033
 sho-0.033
ade-0.032
Token stumbled
Token ID24241
Feature activation-1.64e-03

Pos logit contributions
 You+0.034
|+0.034
 seem+0.032
 generation+0.031
Warning+0.030
Neg logit contributions
ades-0.033
uddle-0.029
brush-0.028
 sho-0.028
aid-0.028
Token upon
Token ID2402
Feature activation-7.52e-02

Pos logit contributions
 You+0.003
|+0.003
 generation+0.003
 seem+0.003
Warning+0.002
Neg logit contributions
ades-0.003
brush-0.002
 sho-0.002
uddle-0.002
ade-0.002
Token a
Token ID257
Feature activation+7.38e-03

Pos logit contributions
|+0.189
rated+0.177
elope+0.175
keeping+0.175
 You+0.171
Neg logit contributions
 sho-0.052
 cream-0.048
 scissors-0.047
 exactly-0.047
 more-0.047
Token hard
Token ID1327
Feature activation-9.96e-03

Pos logit contributions
ades+0.011
aid+0.010
usted+0.009
ins+0.009
 apr+0.009
Neg logit contributions
 You-0.013
|-0.012
 seem-0.012
 generation-0.012
 Time-0.011
Token,
Token ID11
Feature activation-9.66e-03

Pos logit contributions
 You+0.019
|+0.019
 seem+0.017
Warning+0.017
 generation+0.017
Neg logit contributions
ades-0.013
 sho-0.012
uddle-0.012
brush-0.011
aid-0.011
Token shiny
Token ID22441
Feature activation+1.73e-02

Pos logit contributions
|+0.020
 You+0.018
keeping+0.018
 generation+0.017
Warning+0.017
Neg logit contributions
 sho-0.011
 brand-0.011
ades-0.011
 helmet-0.010
brush-0.010
Token thing
Token ID1517
Feature activation-2.13e-02

Pos logit contributions
ades+0.026
aid+0.022
usted+0.020
 apr+0.019
brush+0.019
Neg logit contributions
 You-0.032
|-0.029
 generation-0.028
 seem-0.028
 Today-0.028
Token case
Token ID1339
Feature activation-4.61e-03

Pos logit contributions
|+0.014
 You+0.014
 seem+0.013
Warning+0.013
 generation+0.013
Neg logit contributions
ades-0.009
brush-0.008
 sho-0.008
uddle-0.008
aid-0.007
Token sitting
Token ID5586
Feature activation-7.73e-03

Pos logit contributions
|+0.009
 You+0.009
 generation+0.008
keeping+0.008
Warning+0.008
Neg logit contributions
ades-0.006
brush-0.005
 sho-0.005
uddle-0.005
ade-0.005
Token in
Token ID287
Feature activation-3.16e-02

Pos logit contributions
|+0.017
 You+0.016
 seem+0.016
 generation+0.015
Warning+0.015
Neg logit contributions
ades-0.009
 sho-0.007
brush-0.007
uddle-0.007
 cream-0.007
Token the
Token ID262
Feature activation-2.83e-02

Pos logit contributions
|+0.072
 You+0.071
rated+0.069
 seem+0.066
ses+0.065
Neg logit contributions
 scissors-0.025
ades-0.025
 sho-0.025
brush-0.023
 handful-0.023
Token neighbor
Token ID4780
Feature activation-3.32e-02

Pos logit contributions
 You+0.043
|+0.043
 seem+0.041
Warning+0.038
 generation+0.038
Neg logit contributions
ades-0.047
 sho-0.044
brush-0.043
uddle-0.042
aid-0.041
Tokenâ
Token ID22940
Feature activation-7.51e-02

Pos logit contributions
|+0.055
 You+0.053
 seem+0.049
Warning+0.049
rated+0.049
Neg logit contributions
ades-0.050
brush-0.047
 sho-0.046
uddle-0.045
 scissors-0.044
Token
Token ID26391
Feature activation-5.92e-02

Pos logit contributions
ses+0.098
 You+0.096
 seem+0.096
 Nice+0.091
keeping+0.087
Neg logit contributions
brush-0.109
 swapped-0.108
 swap-0.103
ep-0.101
 apr-0.100
Token
Token ID8151
Feature activation+1.56e-02

Pos logit contributions
rated+0.103
Warning+0.100
 You+0.099
|+0.095
 temptation+0.093
Neg logit contributions
ades-0.087
uddle-0.078
aj-0.072
brush-0.072
udd-0.070
Tokens
Token ID82
Feature activation-1.14e-02

Pos logit contributions
ades+0.025
 sho+0.022
uddle+0.022
brush+0.022
aid+0.022
Neg logit contributions
 You-0.025
|-0.025
 seem-0.023
Warning-0.022
 generation-0.022
Token garden
Token ID11376
Feature activation+1.96e-02

Pos logit contributions
 You+0.019
|+0.019
 seem+0.018
 generation+0.017
keeping+0.017
Neg logit contributions
ades-0.018
aid-0.015
uddle-0.015
 sho-0.015
brush-0.015
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
ades+0.035
 sho+0.031
brush+0.031
uddle+0.031
 penn+0.030
Neg logit contributions
|-0.029
 You-0.028
 seem-0.026
Warning-0.025
 generation-0.025
Token his
Token ID465
Feature activation-1.22e-02

Pos logit contributions
 You+0.021
 seem+0.019
|+0.019
 generation+0.019
Warning+0.018
Neg logit contributions
ades-0.020
aid-0.018
uddle-0.018
brush-0.017
udd-0.017
Token journey
Token ID7002
Feature activation+2.15e-03

Pos logit contributions
 You+0.022
|+0.022
 seem+0.020
 generation+0.020
Warning+0.020
Neg logit contributions
ades-0.018
brush-0.015
aid-0.015
uddle-0.015
 sho-0.015
Token,
Token ID11
Feature activation-2.43e-02

Pos logit contributions
ades+0.002
aid+0.002
uddle+0.002
 sho+0.002
brush+0.002
Neg logit contributions
 You-0.005
|-0.004
 seem-0.004
 generation-0.004
Warning-0.004
Token he
Token ID339
Feature activation-6.72e-03

Pos logit contributions
|+0.045
 You+0.045
Warning+0.041
 seem+0.040
keeping+0.040
Neg logit contributions
ades-0.033
 sho-0.029
brush-0.028
 penn-0.028
 scissors-0.028
Token stumbled
Token ID24241
Feature activation+1.99e-03

Pos logit contributions
 seem+0.009
|+0.009
 You+0.009
 generation+0.008
Warning+0.008
Neg logit contributions
ades-0.012
aid-0.011
 penn-0.011
brush-0.010
uddle-0.010
Token upon
Token ID2402
Feature activation-7.48e-02

Pos logit contributions
ades+0.003
 sho+0.003
brush+0.003
uddle+0.003
aid+0.002
Neg logit contributions
 You-0.003
|-0.003
 seem-0.003
 generation-0.003
Warning-0.003
Token a
Token ID257
Feature activation+3.84e-03

Pos logit contributions
|+0.179
 You+0.168
keeping+0.167
elope+0.166
rated+0.165
Neg logit contributions
 sho-0.060
 scissors-0.057
ades-0.056
 cream-0.055
 crown-0.054
Token tree
Token ID5509
Feature activation-1.89e-02

Pos logit contributions
ades+0.006
aid+0.005
usted+0.005
ins+0.005
aj+0.005
Neg logit contributions
 You-0.006
 seem-0.006
|-0.006
 generation-0.006
 Time-0.006
Token with
Token ID351
Feature activation-1.68e-02

Pos logit contributions
|+0.040
 You+0.038
 generation+0.037
Warning+0.036
keeping+0.036
Neg logit contributions
 sho-0.021
brush-0.020
ades-0.020
ade-0.018
 cream-0.018
Token a
Token ID257
Feature activation+6.94e-03

Pos logit contributions
 You+0.025
|+0.023
 seem+0.023
Warning+0.022
 generation+0.022
Neg logit contributions
ades-0.030
uddle-0.028
brush-0.025
uing-0.025
ip-0.025
Token funny
Token ID8258
Feature activation-1.47e-02

Pos logit contributions
ades+0.010
aid+0.009
ins+0.008
ingu+0.008
usted+0.008
Neg logit contributions
 You-0.012
 seem-0.011
 Today-0.011
 Time-0.010
Warning-0.010
Token hours
Token ID2250
Feature activation-5.72e-04

Pos logit contributions
 You+0.017
|+0.017
 seem+0.016
Warning+0.016
 generation+0.015
Neg logit contributions
ades-0.014
 sho-0.012
brush-0.012
aid-0.012
uddle-0.012
Token when
Token ID618
Feature activation-2.35e-02

Pos logit contributions
 You+0.001
|+0.001
 seem+0.001
Warning+0.001
 generation+0.001
Neg logit contributions
ades-0.001
 sho-0.001
uddle-0.001
brush-0.001
aid-0.001
Token he
Token ID339
Feature activation+3.53e-03

Pos logit contributions
|+0.052
 You+0.049
keeping+0.048
rated+0.048
Warning+0.047
Neg logit contributions
ades-0.022
 friends-0.021
 helmet-0.019
 exactly-0.018
 sho-0.018
Token suddenly
Token ID6451
Feature activation+7.98e-03

Pos logit contributions
ades+0.005
aid+0.005
brush+0.005
 sho+0.005
 penn+0.004
Neg logit contributions
|-0.006
 You-0.006
 seem-0.006
Warning-0.006
 generation-0.005
Token came
Token ID1625
Feature activation-1.78e-02

Pos logit contributions
ades+0.012
uddle+0.010
 sho+0.010
brush+0.010
aid+0.010
Neg logit contributions
 You-0.014
|-0.014
 seem-0.013
Warning-0.013
 generation-0.013
Token upon
Token ID2402
Feature activation-7.48e-02

Pos logit contributions
|+0.039
 You+0.037
 generation+0.035
Warning+0.035
keeping+0.035
Neg logit contributions
ades-0.019
 friends-0.016
 sho-0.016
 exactly-0.015
 friend-0.014
Token a
Token ID257
Feature activation+4.24e-03

Pos logit contributions
elope+0.180
|+0.177
keeping+0.175
Warning+0.169
 MacDonald+0.165
Neg logit contributions
 off-0.053
 cream-0.046
 exactly-0.045
ades-0.043
 more-0.043
Token big
Token ID1263
Feature activation+6.12e-03

Pos logit contributions
ades+0.005
aid+0.004
brush+0.004
usted+0.004
 sho+0.004
Neg logit contributions
 You-0.008
|-0.008
 seem-0.008
 generation-0.008
Warning-0.007
Token,
Token ID11
Feature activation+7.82e-04

Pos logit contributions
ades+0.010
aid+0.008
brush+0.008
uddle+0.008
 sho+0.008
Neg logit contributions
 You-0.011
|-0.010
 seem-0.009
 generation-0.009
Warning-0.009
Token tall
Token ID7331
Feature activation-1.12e-03

Pos logit contributions
 sho+0.001
ades+0.001
 brand+0.001
brush+0.001
 scissors+0.001
Neg logit contributions
|-0.002
 You-0.002
keeping-0.001
 seem-0.001
 generation-0.001
Token fence
Token ID13990
Feature activation-1.70e-02

Pos logit contributions
|+0.002
 You+0.002
 seem+0.002
Warning+0.002
 generation+0.002
Neg logit contributions
ades-0.001
 sho-0.001
uddle-0.001
brush-0.001
aid-0.001
Token the
Token ID262
Feature activation-1.28e-02

Pos logit contributions
 You+0.034
|+0.034
 seem+0.032
 generation+0.031
Warning+0.031
Neg logit contributions
ades-0.026
uddle-0.022
aid-0.022
brush-0.022
 sho-0.022
Token house
Token ID2156
Feature activation-2.48e-03

Pos logit contributions
 You+0.022
|+0.022
 seem+0.020
Warning+0.020
 generation+0.019
Neg logit contributions
ades-0.019
 sho-0.017
uddle-0.017
brush-0.017
 penn-0.016
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
 You+0.005
|+0.005
 seem+0.005
Warning+0.005
 generation+0.005
Neg logit contributions
ades-0.003
 sho-0.002
uddle-0.002
brush-0.002
aid-0.002
Token soon
Token ID2582
Feature activation-8.31e-03

Pos logit contributions
 You+0.026
|+0.026
 seem+0.024
 generation+0.024
Warning+0.023
Neg logit contributions
ades-0.027
uddle-0.024
aid-0.023
brush-0.023
 sho-0.023
Token stumbled
Token ID24241
Feature activation+2.52e-03

Pos logit contributions
 You+0.014
 seem+0.014
|+0.014
 generation+0.013
keeping+0.012
Neg logit contributions
ades-0.013
aid-0.011
uddle-0.011
brush-0.011
 sho-0.011
Token upon
Token ID2402
Feature activation-7.46e-02

Pos logit contributions
ades+0.004
 sho+0.004
uddle+0.004
brush+0.004
aid+0.003
Neg logit contributions
 You-0.004
|-0.004
 seem-0.004
Warning-0.004
 generation-0.004
Token an
Token ID281
Feature activation+2.10e-04

Pos logit contributions
|+0.170
 You+0.158
rated+0.156
Warning+0.152
keeping+0.151
Neg logit contributions
ades-0.069
 exactly-0.064
 scissors-0.062
 sho-0.061
 brand-0.058
Token old
Token ID1468
Feature activation-1.36e-02

Pos logit contributions
ades+0.000
uddle+0.000
 sho+0.000
brush+0.000
aid+0.000
Neg logit contributions
 You-0.000
|-0.000
 seem-0.000
Warning-0.000
 generation-0.000
Token chest
Token ID7721
Feature activation+1.78e-03

Pos logit contributions
 You+0.025
|+0.025
 seem+0.023
Warning+0.022
 generation+0.022
Neg logit contributions
ades-0.019
 sho-0.017
uddle-0.017
brush-0.016
aid-0.016
Token.
Token ID13
Feature activation-3.42e-03

Pos logit contributions
ades+0.003
 sho+0.003
brush+0.003
uddle+0.003
aid+0.003
Neg logit contributions
 You-0.003
|-0.003
 seem-0.002
Warning-0.002
 generation-0.002
Token Peter
Token ID5613
Feature activation+4.65e-03

Pos logit contributions
|+0.006
rated+0.005
 You+0.005
keeping+0.005
 seem+0.005
Neg logit contributions
ades-0.005
 brand-0.005
 exactly-0.004
usted-0.004
 scissors-0.004
Token the
Token ID262
Feature activation-3.52e-03

Pos logit contributions
rated+0.058
 You+0.056
|+0.055
 seem+0.053
ses+0.052
Neg logit contributions
 scissors-0.011
 sho-0.010
 friends-0.009
 sunscreen-0.009
 brand-0.007
Token woods
Token ID16479
Feature activation-1.18e-02

Pos logit contributions
 You+0.005
|+0.005
 generation+0.004
 seem+0.004
Warning+0.004
Neg logit contributions
ades-0.007
brush-0.006
aid-0.006
 sho-0.006
uddle-0.006
Token when
Token ID618
Feature activation-2.44e-02

Pos logit contributions
|+0.028
 You+0.028
Warning+0.027
rated+0.026
keeping+0.026
Neg logit contributions
ades-0.008
 friends-0.008
 exactly-0.007
 cream-0.007
 else-0.007
Token she
Token ID673
Feature activation-5.61e-03

Pos logit contributions
rated+0.047
|+0.046
 You+0.046
keeping+0.044
Warning+0.042
Neg logit contributions
ades-0.029
 scissors-0.027
 brand-0.026
 friends-0.026
 mom-0.025
Token stumbled
Token ID24241
Feature activation+1.50e-03

Pos logit contributions
|+0.010
 You+0.010
 seem+0.010
 generation+0.009
Warning+0.009
Neg logit contributions
ades-0.008
 sho-0.007
aid-0.007
brush-0.007
uddle-0.006
Token upon
Token ID2402
Feature activation-7.41e-02

Pos logit contributions
ades+0.002
brush+0.002
 sho+0.002
uddle+0.002
ade+0.002
Neg logit contributions
 You-0.003
|-0.002
 seem-0.002
 generation-0.002
Warning-0.002
Token a
Token ID257
Feature activation+1.19e-02

Pos logit contributions
|+0.203
keeping+0.195
rated+0.195
edient+0.193
 You+0.189
Neg logit contributions
 more-0.035
 big-0.030
 scissors-0.028
 all-0.027
 butter-0.025
Token big
Token ID1263
Feature activation+1.84e-02

Pos logit contributions
ades+0.017
aid+0.016
usted+0.016
ins+0.014
aj+0.014
Neg logit contributions
 You-0.020
 seem-0.019
|-0.019
 generation-0.018
 Time-0.018
Token dark
Token ID3223
Feature activation-2.45e-02

Pos logit contributions
ades+0.029
aid+0.025
usted+0.023
brush+0.023
uddle+0.022
Neg logit contributions
 You-0.032
|-0.030
 seem-0.029
 Today-0.029
 generation-0.028
Token billboard
Token ID41448
Feature activation-2.38e-02

Pos logit contributions
 You+0.042
|+0.042
 seem+0.040
Warning+0.038
 generation+0.038
Neg logit contributions
ades-0.036
uddle-0.032
brush-0.032
 sho-0.032
ade-0.031
Token.
Token ID13
Feature activation-1.32e-03

Pos logit contributions
|+0.049
rated+0.043
 seem+0.043
 You+0.043
keeping+0.042
Neg logit contributions
ades-0.031
brush-0.028
 sho-0.027
 cream-0.027
uddle-0.027
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
|+0.079
 You+0.077
Warning+0.073
 seem+0.072
 generation+0.072
Neg logit contributions
ades-0.045
brush-0.038
 sho-0.036
uddle-0.035
 brand-0.035
Token it
Token ID340
Feature activation-1.62e-02

Pos logit contributions
 You+0.030
|+0.029
 seem+0.028
 generation+0.026
Warning+0.026
Neg logit contributions
ades-0.027
uddle-0.024
aid-0.023
brush-0.023
 sho-0.023
Token started
Token ID2067
Feature activation-4.82e-02

Pos logit contributions
|+0.027
 You+0.027
 seem+0.026
Warning+0.024
 generation+0.024
Neg logit contributions
ades-0.025
aid-0.022
 sho-0.021
uddle-0.021
 penn-0.021
Token to
Token ID284
Feature activation-3.60e-02

Pos logit contributions
 You+0.123
|+0.118
 generation+0.112
Warning+0.111
keeping+0.110
Neg logit contributions
ades-0.038
 brand-0.027
 sho-0.027
usted-0.024
 scissors-0.023
Token rain
Token ID6290
Feature activation-5.05e-02

Pos logit contributions
 You+0.071
keeping+0.068
|+0.068
Warning+0.063
 generation+0.062
Neg logit contributions
ades-0.043
 sunscreen-0.040
 scissors-0.039
 fan-0.038
 butter-0.038
Token heavily
Token ID7272
Feature activation-7.40e-02

Pos logit contributions
 You+0.097
Warning+0.091
 generation+0.090
|+0.090
 Have+0.085
Neg logit contributions
ades-0.068
uddle-0.062
brush-0.062
udd-0.060
ade-0.059
Token.
Token ID13
Feature activation-2.17e-03

Pos logit contributions
 You+0.148
|+0.146
Warning+0.141
ses+0.140
 generation+0.138
Neg logit contributions
ades-0.078
brush-0.070
 off-0.069
 cream-0.066
 handful-0.064
Token The
Token ID383
Feature activation-2.89e-03

Pos logit contributions
|+0.003
 You+0.003
Warning+0.002
 seem+0.002
rated+0.002
Neg logit contributions
ades-0.004
uddle-0.004
 sho-0.004
brush-0.004
 penn-0.004
Token little
Token ID1310
Feature activation-1.51e-02

Pos logit contributions
 You+0.006
|+0.006
 seem+0.006
Warning+0.006
 generation+0.006
Neg logit contributions
ades-0.003
 sho-0.003
uddle-0.003
brush-0.003
aid-0.002
Token girl
Token ID2576
Feature activation+1.18e-02

Pos logit contributions
|+0.026
 You+0.025
 seem+0.023
Warning+0.023
 generation+0.022
Neg logit contributions
ades-0.022
 sho-0.021
 penn-0.021
brush-0.021
 scissors-0.020
Token cried
Token ID16896
Feature activation-1.04e-02

Pos logit contributions
ades+0.021
aid+0.018
brush+0.018
uddle+0.018
 sho+0.018
Neg logit contributions
 You-0.018
|-0.018
 seem-0.016
Warning-0.015
 generation-0.015
Token and
Token ID290
Feature activation-1.20e-02

Pos logit contributions
ades+0.015
aid+0.013
brush+0.013
uddle+0.013
ade+0.012
Neg logit contributions
 You-0.015
|-0.014
 seem-0.013
 generation-0.013
keeping-0.012
Token low
Token ID1877
Feature activation+1.04e-03

Pos logit contributions
 You+0.020
|+0.020
 seem+0.019
 generation+0.018
Warning+0.018
Neg logit contributions
ades-0.019
uddle-0.017
brush-0.017
 sho-0.016
aid-0.016
Token until
Token ID1566
Feature activation+4.47e-03

Pos logit contributions
ades+0.001
uddle+0.001
 sho+0.001
brush+0.001
aid+0.001
Neg logit contributions
 You-0.002
|-0.002
 seem-0.002
Warning-0.002
 generation-0.002
Token they
Token ID484
Feature activation+3.83e-03

Pos logit contributions
ades+0.006
 sho+0.005
brush+0.005
uddle+0.005
 penn+0.005
Neg logit contributions
 You-0.008
|-0.008
Warning-0.007
 seem-0.007
keeping-0.007
Token came
Token ID1625
Feature activation-1.31e-02

Pos logit contributions
ades+0.007
uddle+0.006
 cream+0.006
 penn+0.006
aid+0.006
Neg logit contributions
|-0.006
 seem-0.005
 You-0.005
 generation-0.005
Warning-0.005
Token upon
Token ID2402
Feature activation-7.36e-02

Pos logit contributions
|+0.029
 You+0.028
 seem+0.027
Warning+0.027
 generation+0.026
Neg logit contributions
ades-0.014
 sho-0.012
uddle-0.011
 brand-0.011
 friends-0.011
Token the
Token ID262
Feature activation-2.16e-02

Pos logit contributions
|+0.168
elope+0.168
Warning+0.164
keeping+0.163
 MacDonald+0.159
Neg logit contributions
 off-0.058
 exactly-0.052
ades-0.048
 cream-0.044
 sho-0.041
Token most
Token ID749
Feature activation-1.92e-03

Pos logit contributions
 You+0.038
|+0.038
 seem+0.035
Warning+0.034
 generation+0.034
Neg logit contributions
ades-0.033
 sho-0.028
uddle-0.028
brush-0.028
aid-0.027
Token unexpected
Token ID10059
Feature activation-1.65e-02

Pos logit contributions
 You+0.003
 generation+0.003
 seem+0.003
|+0.003
Warning+0.003
Neg logit contributions
ades-0.003
aid-0.002
 sunscreen-0.002
 apr-0.002
ingu-0.002
Token place
Token ID1295
Feature activation-2.67e-02

Pos logit contributions
|+0.030
 You+0.029
 seem+0.026
Warning+0.026
 Today+0.025
Neg logit contributions
ades-0.023
uddle-0.022
 cream-0.021
 sho-0.021
 penn-0.021
Token.
Token ID13
Feature activation-3.87e-03

Pos logit contributions
|+0.048
 You+0.047
Warning+0.045
 seem+0.044
 generation+0.043
Neg logit contributions
ades-0.039
uddle-0.033
 sho-0.032
brush-0.032
 brand-0.032
Token it
Token ID340
Feature activation-1.28e-02

Pos logit contributions
ades+0.018
 sho+0.017
brush+0.017
 penn+0.016
uddle+0.016
Neg logit contributions
|-0.020
 You-0.019
Warning-0.018
 seem-0.018
 generation-0.017
Token seemed
Token ID3947
Feature activation-1.38e-02

Pos logit contributions
 You+0.021
|+0.020
 generation+0.019
keeping+0.019
Warning+0.019
Neg logit contributions
ades-0.020
 sho-0.019
brush-0.018
uddle-0.018
ade-0.018
Token that
Token ID326
Feature activation-2.33e-02

Pos logit contributions
 You+0.024
|+0.023
 seem+0.022
Warning+0.021
 generation+0.021
Neg logit contributions
ades-0.021
uddle-0.018
brush-0.018
 sho-0.018
aid-0.018
Token it
Token ID340
Feature activation-2.48e-02

Pos logit contributions
|+0.040
 You+0.039
Warning+0.035
 seem+0.035
 generation+0.035
Neg logit contributions
ades-0.034
 sho-0.032
 penn-0.031
brush-0.031
uddle-0.030
Token wouldn
Token ID3636
Feature activation-3.75e-02

Pos logit contributions
 You+0.043
|+0.042
 generation+0.041
keeping+0.040
Warning+0.040
Neg logit contributions
ades-0.035
 sho-0.033
brush-0.033
ade-0.032
uddle-0.032
Tokenâ
Token ID22940
Feature activation-7.36e-02

Pos logit contributions
keeping+0.052
 generation+0.052
|+0.052
 You+0.050
ses+0.049
Neg logit contributions
ades-0.063
brush-0.059
uddle-0.055
 sho-0.055
ade-0.054
Token
Token ID26391
Feature activation-4.63e-02

Pos logit contributions
ses+0.095
sing+0.091
elope+0.086
 spiders+0.083
 breathing+0.081
Neg logit contributions
up-0.122
 swapped-0.115
brush-0.112
um-0.110
 swap-0.109
Token
Token ID8151
Feature activation-7.43e-03

Pos logit contributions
|+0.077
Warning+0.075
rated+0.074
 You+0.070
 generation+0.070
Neg logit contributions
ades-0.065
brush-0.064
uddle-0.061
ip-0.060
 sho-0.060
Tokent
Token ID83
Feature activation-3.70e-02

Pos logit contributions
 You+0.008
|+0.008
 generation+0.008
Warning+0.007
 seem+0.007
Neg logit contributions
ades-0.015
brush-0.014
uddle-0.014
ade-0.014
 sho-0.014
Token be
Token ID307
Feature activation-2.75e-02

Pos logit contributions
 You+0.064
|+0.061
 generation+0.060
Warning+0.060
keeping+0.058
Neg logit contributions
ades-0.050
brush-0.049
aid-0.048
ade-0.048
 sho-0.048
Token allowed
Token ID3142
Feature activation-2.87e-02

Pos logit contributions
 You+0.046
|+0.046
 seem+0.043
 generation+0.041
Warning+0.040
Neg logit contributions
ades-0.044
uddle-0.038
brush-0.037
 sho-0.037
aid-0.036
Token find
Token ID1064
Feature activation+1.15e-02

Pos logit contributions
 You+0.021
|+0.021
 seem+0.019
Warning+0.018
 generation+0.018
Neg logit contributions
ades-0.025
uddle-0.022
 sho-0.022
brush-0.022
aid-0.022
Token and
Token ID290
Feature activation-2.27e-03

Pos logit contributions
ades+0.021
uddle+0.018
brush+0.018
 sho+0.018
aid+0.018
Neg logit contributions
 You-0.017
|-0.016
 seem-0.015
Warning-0.015
 generation-0.014
Token suddenly
Token ID6451
Feature activation-1.16e-02

Pos logit contributions
 You+0.003
|+0.003
 seem+0.003
Warning+0.003
 generation+0.003
Neg logit contributions
ades-0.004
 sho-0.003
uddle-0.003
brush-0.003
aid-0.003
Token he
Token ID339
Feature activation-8.00e-03

Pos logit contributions
 You+0.020
|+0.020
 seem+0.018
Warning+0.018
 generation+0.018
Neg logit contributions
ades-0.017
 sho-0.015
uddle-0.015
brush-0.015
aid-0.014
Token stumbled
Token ID24241
Feature activation-1.46e-03

Pos logit contributions
|+0.013
 You+0.013
 seem+0.012
 generation+0.011
Warning+0.011
Neg logit contributions
ades-0.013
aid-0.011
brush-0.011
 penn-0.011
 sho-0.011
Token upon
Token ID2402
Feature activation-7.35e-02

Pos logit contributions
 You+0.002
|+0.002
 seem+0.002
 generation+0.002
Warning+0.002
Neg logit contributions
ades-0.002
 sho-0.002
brush-0.002
uddle-0.002
ade-0.002
Token some
Token ID617
Feature activation+1.22e-02

Pos logit contributions
|+0.185
rated+0.180
elope+0.174
 You+0.172
keeping+0.172
Neg logit contributions
 exactly-0.042
 more-0.041
 sho-0.040
 crown-0.040
ades-0.040
Token old
Token ID1468
Feature activation-1.64e-02

Pos logit contributions
ades+0.018
aid+0.015
brush+0.015
uddle+0.015
ip+0.014
Neg logit contributions
 You-0.021
 seem-0.019
|-0.019
Warning-0.019
 generation-0.018
Token rocks
Token ID12586
Feature activation+7.84e-04

Pos logit contributions
|+0.033
 You+0.033
 seem+0.030
Warning+0.029
 generation+0.029
Neg logit contributions
ades-0.021
 sho-0.019
 penn-0.018
uddle-0.018
 scissors-0.018
Token.
Token ID13
Feature activation+1.23e-03

Pos logit contributions
ades+0.001
 sho+0.001
brush+0.001
uddle+0.001
aid+0.001
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
Warning-0.001
 generation-0.001
Token In
Token ID554
Feature activation-4.35e-03

Pos logit contributions
ades+0.002
 sho+0.002
 penn+0.002
brush+0.002
uddle+0.002
Neg logit contributions
|-0.002
 You-0.002
Warning-0.001
keeping-0.001
 seem-0.001
Token so
Token ID523
Feature activation-1.09e-02

Pos logit contributions
 You+0.024
 seem+0.022
|+0.020
You+0.019
 Today+0.018
Neg logit contributions
ades-0.030
uddle-0.028
aid-0.028
ip-0.026
 penn-0.026
Token pretty
Token ID2495
Feature activation+1.35e-02

Pos logit contributions
 You+0.023
 seem+0.021
|+0.021
You+0.019
 Welcome+0.019
Neg logit contributions
ades-0.013
aid-0.012
 sho-0.012
brush-0.011
uddle-0.011
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
ades+0.021
 sho+0.018
uddle+0.018
brush+0.018
aid+0.017
Neg logit contributions
 You-0.023
|-0.023
 seem-0.021
Warning-0.021
 generation-0.021
Token Can
Token ID1680
Feature activation+4.70e-04

Pos logit contributions
|+0.018
 You+0.016
 seem+0.016
Warning+0.015
 generation+0.015
Neg logit contributions
ades-0.045
brush-0.042
 sho-0.041
aid-0.040
uddle-0.040
Token I
Token ID314
Feature activation+2.92e-02

Pos logit contributions
ades+0.001
uddle+0.001
 sho+0.001
brush+0.001
aid+0.001
Neg logit contributions
 You-0.000
|-0.000
 seem-0.000
Warning-0.000
 generation-0.000
Token paint
Token ID7521
Feature activation+5.66e-02

Pos logit contributions
ades+0.062
 penn+0.057
 sho+0.055
uddle+0.054
aj+0.054
Neg logit contributions
 seem-0.036
 You-0.036
|-0.030
 Today-0.028
You-0.026
Token with
Token ID351
Feature activation+2.35e-02

Pos logit contributions
ades+0.110
uddle+0.108
 dads+0.101
 sho+0.099
 penn+0.098
Neg logit contributions
 You-0.085
 seem-0.061
You-0.060
 That-0.057
 Today-0.057
Token you
Token ID345
Feature activation+3.41e-02

Pos logit contributions
uddle+0.050
ades+0.049
 sho+0.044
uing+0.044
udd+0.043
Neg logit contributions
 You-0.031
You-0.021
 seem-0.021
 generation-0.019
 We-0.018
Token?"
Token ID1701
Feature activation+1.04e-02

Pos logit contributions
ades+0.066
aid+0.060
uddle+0.058
 penn+0.058
 sho+0.058
Neg logit contributions
 You-0.047
 Today-0.040
 seem-0.039
|-0.036
 generation-0.033
Token
Token ID198
Feature activation+1.44e-02

Pos logit contributions
ades+0.019
 sho+0.017
uddle+0.017
brush+0.017
aid+0.017
Neg logit contributions
 You-0.015
|-0.015
 seem-0.013
Warning-0.013
 generation-0.013
Token
Token ID198
Feature activation+1.22e-02

Pos logit contributions
ades+0.030
 sho+0.028
brush+0.028
uddle+0.028
 penn+0.028
Neg logit contributions
 You-0.016
|-0.016
Warning-0.013
 seem-0.013
 Today-0.012
Token you
Token ID345
Feature activation+1.66e-02

Pos logit contributions
 You+0.030
|+0.028
 seem+0.027
 generation+0.025
Warning+0.025
Neg logit contributions
ades-0.038
uddle-0.033
brush-0.032
 sho-0.032
aid-0.032
Token can
Token ID460
Feature activation+2.54e-03

Pos logit contributions
ades+0.034
uddle+0.030
 sho+0.029
 penn+0.029
 brand+0.029
Neg logit contributions
|-0.021
 You-0.021
 seem-0.021
Warning-0.018
 generation-0.017
Token also
Token ID635
Feature activation-6.35e-03

Pos logit contributions
ades+0.005
uddle+0.005
 sho+0.004
 penn+0.004
aid+0.004
Neg logit contributions
 You-0.003
|-0.003
 seem-0.003
 generation-0.003
 Today-0.003
Token have
Token ID423
Feature activation+1.09e-02

Pos logit contributions
 You+0.009
|+0.009
 seem+0.008
 generation+0.008
Warning+0.007
Neg logit contributions
ades-0.012
uddle-0.011
 sho-0.010
 penn-0.010
aid-0.010
Token some
Token ID617
Feature activation+5.14e-02

Pos logit contributions
ades+0.019
uddle+0.018
brush+0.016
aid+0.016
 sho+0.016
Neg logit contributions
 You-0.017
|-0.016
 seem-0.016
 generation-0.015
Warning-0.014
Token cake
Token ID12187
Feature activation+4.38e-02

Pos logit contributions
ades+0.076
uddle+0.063
brush+0.061
udd+0.061
usted+0.060
Neg logit contributions
 You-0.092
 seem-0.085
 Today-0.080
 temptation-0.078
|-0.078
Token and
Token ID290
Feature activation+7.04e-03

Pos logit contributions
ades+0.080
uddle+0.070
 sho+0.070
usted+0.067
aid+0.067
Neg logit contributions
 You-0.060
 seem-0.055
|-0.054
keeping-0.052
 Today-0.049
Token ice
Token ID4771
Feature activation+1.26e-02

Pos logit contributions
ades+0.010
uddle+0.009
ade+0.008
udd+0.008
 sho+0.008
Neg logit contributions
 You-0.013
 seem-0.012
|-0.012
 generation-0.012
Warning-0.011
Token cream
Token ID8566
Feature activation+3.06e-02

Pos logit contributions
ades+0.010
 sho+0.007
uddle+0.007
 brand+0.007
brush+0.007
Neg logit contributions
 You-0.031
|-0.030
keeping-0.029
 seem-0.028
 generation-0.027
Token for
Token ID329
Feature activation-9.27e-03

Pos logit contributions
ades+0.055
uddle+0.048
 sho+0.048
brush+0.047
aid+0.047
Neg logit contributions
 You-0.045
|-0.043
 seem-0.040
Warning-0.038
 generation-0.038
Token dessert
Token ID23084
Feature activation+2.65e-02

Pos logit contributions
 You+0.014
|+0.014
 seem+0.013
 generation+0.012
Warning+0.012
Neg logit contributions
ades-0.016
uddle-0.015
brush-0.014
 sho-0.014
aid-0.014
Token made
Token ID925
Feature activation-2.73e-02

Pos logit contributions
ades+0.005
 sho+0.005
brush+0.005
ade+0.005
uddle+0.005
Neg logit contributions
 You-0.005
|-0.005
keeping-0.004
Warning-0.004
 seem-0.004
Token him
Token ID683
Feature activation-7.23e-03

Pos logit contributions
 You+0.044
|+0.043
 seem+0.040
Warning+0.038
 generation+0.038
Neg logit contributions
ades-0.045
uddle-0.039
 sho-0.039
brush-0.039
aid-0.039
Token happy
Token ID3772
Feature activation-9.53e-03

Pos logit contributions
 You+0.013
|+0.013
Warning+0.012
 generation+0.012
keeping+0.011
Neg logit contributions
ades-0.010
 sho-0.009
brush-0.009
uddle-0.009
 brand-0.009
Token.
Token ID13
Feature activation-3.78e-03

Pos logit contributions
|+0.020
 You+0.019
Warning+0.019
 generation+0.018
 seem+0.018
Neg logit contributions
ades-0.011
 sho-0.009
 friends-0.009
uddle-0.009
brush-0.009
Token<|endoftext|>
Token ID50256
Feature activation-2.96e-02

Pos logit contributions
rated+0.006
|+0.006
 seem+0.006
Warning+0.006
 generation+0.006
Neg logit contributions
ades-0.005
ip-0.004
 friends-0.004
uddle-0.004
 sho-0.004
TokenOnce
Token ID7454
Feature activation+4.36e-02

Pos logit contributions
rated+0.057
 generation+0.056
ses+0.055
sing+0.055
 spine+0.054
Neg logit contributions
ades-0.037
in-0.031
uddle-0.030
ade-0.030
 handful-0.028
Token upon
Token ID2402
Feature activation-4.68e-02

Pos logit contributions
ades+0.080
brush+0.071
 sho+0.071
aid+0.071
uddle+0.070
Neg logit contributions
 You-0.062
|-0.061
 seem-0.056
Warning-0.052
 Today-0.052
Token a
Token ID257
Feature activation+6.74e-03

Pos logit contributions
rated+0.117
ses+0.115
Warning+0.112
 spine+0.112
 You+0.112
Neg logit contributions
ades-0.030
 else-0.028
 helmet-0.025
 brand-0.025
 friends-0.025
Token time
Token ID640
Feature activation+2.07e-02

Pos logit contributions
ades+0.010
uddle+0.009
 sho+0.009
brush+0.008
ade+0.008
Neg logit contributions
 You-0.012
|-0.011
keeping-0.011
Warning-0.011
 seem-0.011
Token there
Token ID612
Feature activation+2.22e-02

Pos logit contributions
ades+0.016
uddle+0.012
 sho+0.012
brush+0.011
 penn+0.011
Neg logit contributions
 You-0.050
|-0.050
Warning-0.047
 generation-0.047
 seem-0.047
Token was
Token ID373
Feature activation-3.17e-02

Pos logit contributions
ades+0.034
uddle+0.030
 sho+0.030
brush+0.029
ade+0.029
Neg logit contributions
 You-0.036
|-0.036
 generation-0.034
Warning-0.033
keeping-0.033
Token was
Token ID373
Feature activation-1.51e-03

Pos logit contributions
ades+0.012
uddle+0.011
brush+0.011
 sho+0.010
aid+0.010
Neg logit contributions
 You-0.013
|-0.013
 seem-0.012
Warning-0.012
 generation-0.012
Token so
Token ID523
Feature activation+4.96e-03

Pos logit contributions
 You+0.003
|+0.002
 seem+0.002
 generation+0.002
 Today+0.002
Neg logit contributions
ades-0.003
uddle-0.002
brush-0.002
aid-0.002
ade-0.002
Token excited
Token ID6568
Feature activation-8.41e-03

Pos logit contributions
ades+0.009
uddle+0.008
brush+0.008
 sho+0.007
 scissors+0.007
Neg logit contributions
 You-0.008
|-0.007
 seem-0.007
 Today-0.006
You-0.006
Token.
Token ID13
Feature activation-2.85e-03

Pos logit contributions
|+0.016
 You+0.015
Warning+0.015
 generation+0.015
 seem+0.015
Neg logit contributions
ades-0.011
uddle-0.009
 sho-0.009
brush-0.009
ade-0.009
Token He
Token ID679
Feature activation+1.10e-02

Pos logit contributions
|+0.003
 You+0.003
 seem+0.003
Warning+0.003
 generation+0.003
Neg logit contributions
ades-0.006
 sho-0.005
brush-0.005
uddle-0.005
 brand-0.005
Token looked
Token ID3114
Feature activation+4.54e-02

Pos logit contributions
ades+0.020
aid+0.017
uddle+0.017
brush+0.017
 penn+0.016
Neg logit contributions
 You-0.016
|-0.016
 seem-0.014
 generation-0.013
 Welcome-0.013
Token around
Token ID1088
Feature activation-2.79e-03

Pos logit contributions
ades+0.097
aid+0.091
brush+0.090
omo+0.085
uddle+0.082
Neg logit contributions
 You-0.053
|-0.044
rated-0.041
 seem-0.040
 Today-0.039
Token,
Token ID11
Feature activation-7.72e-03

Pos logit contributions
 You+0.005
|+0.004
 seem+0.004
 generation+0.004
 Today+0.004
Neg logit contributions
ades-0.005
aid-0.004
uddle-0.004
brush-0.004
ade-0.004
Token but
Token ID475
Feature activation+1.05e-02

Pos logit contributions
 You+0.012
|+0.011
 seem+0.010
 generation+0.010
Warning+0.009
Neg logit contributions
ades-0.014
uddle-0.012
aid-0.012
brush-0.012
ade-0.012
Token he
Token ID339
Feature activation+1.06e-02

Pos logit contributions
ades+0.020
uddle+0.017
aid+0.017
brush+0.017
ade+0.016
Neg logit contributions
 You-0.016
|-0.015
 seem-0.014
 generation-0.013
Warning-0.013
Token didn
Token ID1422
Feature activation-1.29e-02

Pos logit contributions
ades+0.022
aid+0.019
brush+0.019
 penn+0.019
uddle+0.019
Neg logit contributions
 You-0.013
|-0.012
 seem-0.012
 generation-0.010
elope-0.010
Token!"
Token ID2474
Feature activation+1.11e-02

Pos logit contributions
ades+0.007
uddle+0.006
 sho+0.006
aid+0.006
brush+0.006
Neg logit contributions
 You-0.008
|-0.008
 seem-0.007
 generation-0.007
Warning-0.007
Token
Token ID198
Feature activation+9.43e-03

Pos logit contributions
ades+0.019
 sho+0.017
uddle+0.017
brush+0.017
 penn+0.017
Neg logit contributions
|-0.017
rated-0.015
 You-0.015
Warning-0.015
 generation-0.015
Token
Token ID198
Feature activation+9.63e-03

Pos logit contributions
ades+0.019
 sho+0.018
brush+0.017
uddle+0.017
 penn+0.017
Neg logit contributions
 You-0.012
|-0.011
 seem-0.010
Warning-0.010
 generation-0.009
TokenHer
Token ID9360
Feature activation+2.39e-02

Pos logit contributions
ades+0.014
uddle+0.014
 sho+0.013
 brand+0.013
brush+0.013
Neg logit contributions
 You-0.017
|-0.016
Warning-0.015
 generation-0.015
You-0.015
Token dad
Token ID9955
Feature activation+2.50e-02

Pos logit contributions
ades+0.029
aid+0.025
brush+0.024
 handful+0.023
igs+0.023
Neg logit contributions
 You-0.048
 seem-0.044
 Today-0.043
ses-0.041
 Time-0.040
Token looked
Token ID3114
Feature activation+4.46e-02

Pos logit contributions
ades+0.039
uddle+0.035
 sunscreen+0.035
 penn+0.035
igs+0.034
Neg logit contributions
 You-0.041
|-0.038
 seem-0.038
 Time-0.034
Warning-0.034
Token at
Token ID379
Feature activation+4.09e-03

Pos logit contributions
ades+0.082
brush+0.074
aid+0.073
ade+0.070
igs+0.068
Neg logit contributions
 You-0.070
 Today-0.057
|-0.056
rated-0.053
 seem-0.053
Token her
Token ID607
Feature activation-7.35e-03

Pos logit contributions
ades+0.006
 sho+0.005
brush+0.005
aid+0.005
uddle+0.005
Neg logit contributions
 You-0.007
|-0.007
 seem-0.007
Warning-0.007
 generation-0.007
Token.
Token ID13
Feature activation+3.62e-03

Pos logit contributions
 You+0.014
|+0.014
 seem+0.013
Warning+0.012
 generation+0.012
Neg logit contributions
ades-0.010
 sho-0.009
uddle-0.009
brush-0.009
aid-0.008
Token He
Token ID679
Feature activation+1.75e-02

Pos logit contributions
ades+0.008
brush+0.007
 sho+0.007
uddle+0.007
aid+0.007
Neg logit contributions
|-0.004
 You-0.004
 seem-0.003
 generation-0.003
Warning-0.003
Token laughed
Token ID13818
Feature activation+1.77e-03

Pos logit contributions
ades+0.031
 penn+0.028
uddle+0.028
brush+0.028
 sunscreen+0.028
Neg logit contributions
 You-0.024
|-0.023
 seem-0.022
You-0.021
Warning-0.020
Token.
Token ID13
Feature activation-4.20e-03

Pos logit contributions
|+0.048
rated+0.047
Warning+0.045
 You+0.045
 generation+0.043
Neg logit contributions
ades-0.027
 sunscreen-0.024
 friends-0.022
 penn-0.022
 cream-0.022
Token
Token ID220
Feature activation-7.14e-04

Pos logit contributions
rated+0.006
|+0.005
 seem+0.005
Warning+0.005
 generation+0.005
Neg logit contributions
ades-0.007
brush-0.006
 sho-0.006
uddle-0.006
aid-0.006
Token
Token ID198
Feature activation-2.30e-04

Pos logit contributions
|+0.001
 You+0.001
Warning+0.001
 generation+0.001
 seem+0.001
Neg logit contributions
ades-0.001
uddle-0.001
brush-0.001
ade-0.001
aid-0.001
Token
Token ID198
Feature activation-3.59e-03

Pos logit contributions
 You+0.000
|+0.000
 seem+0.000
 generation+0.000
Warning+0.000
Neg logit contributions
ades-0.000
aid-0.000
uddle-0.000
brush-0.000
 sho-0.000
TokenAs
Token ID1722
Feature activation-1.96e-02

Pos logit contributions
 You+0.007
|+0.007
 seem+0.007
Warning+0.006
keeping+0.006
Neg logit contributions
ades-0.004
aid-0.004
brush-0.004
 sho-0.004
uddle-0.004
Token they
Token ID484
Feature activation+2.79e-02

Pos logit contributions
 You+0.034
|+0.034
 seem+0.032
Warning+0.031
 generation+0.030
Neg logit contributions
ades-0.029
 sho-0.025
brush-0.025
uddle-0.025
aid-0.024
Token walked
Token ID6807
Feature activation+3.49e-02

Pos logit contributions
ades+0.052
uddle+0.046
brush+0.045
 sho+0.045
 brand+0.044
Neg logit contributions
|-0.043
 seem-0.038
 You-0.037
 generation-0.034
Warning-0.034
Token through
Token ID832
Feature activation+5.92e-04

Pos logit contributions
ades+0.053
 sho+0.046
 butter+0.045
 apr+0.044
uddle+0.044
Neg logit contributions
|-0.061
 You-0.058
 Today-0.057
 seem-0.056
 moment-0.052
Token the
Token ID262
Feature activation+1.90e-03

Pos logit contributions
ades+0.001
uddle+0.001
brush+0.001
 sho+0.001
aid+0.001
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
 generation-0.001
Warning-0.001
Token zoo
Token ID26626
Feature activation+2.75e-02

Pos logit contributions
ades+0.003
 sho+0.003
brush+0.003
uddle+0.003
aid+0.003
Neg logit contributions
|-0.003
 You-0.003
 seem-0.002
 generation-0.002
Warning-0.002
Token,
Token ID11
Feature activation-1.48e-02

Pos logit contributions
ades+0.032
aid+0.030
 sho+0.028
brush+0.028
uddle+0.026
Neg logit contributions
 You-0.055
 seem-0.054
|-0.054
 generation-0.049
 Today-0.049
Token mom
Token ID1995
Feature activation-5.01e-03

Pos logit contributions
ades+0.010
aid+0.008
brush+0.008
usted+0.008
is+0.008
Neg logit contributions
 You-0.016
 Today-0.015
|-0.015
 seem-0.015
 generation-0.015
Token.
Token ID13
Feature activation+2.29e-02

Pos logit contributions
 You+0.007
|+0.007
 seem+0.007
Warning+0.006
 generation+0.006
Neg logit contributions
ades-0.009
uddle-0.008
 sho-0.008
aid-0.008
brush-0.008
Token She
Token ID1375
Feature activation+9.28e-03

Pos logit contributions
ades+0.055
 sho+0.050
aid+0.049
uddle+0.049
brush+0.049
Neg logit contributions
 You-0.021
|-0.019
 seem-0.017
Warning-0.015
 Today-0.015
Token was
Token ID373
Feature activation-7.07e-04

Pos logit contributions
ades+0.016
uddle+0.014
brush+0.014
 sho+0.014
aid+0.014
Neg logit contributions
 You-0.014
|-0.014
 seem-0.013
Warning-0.012
 generation-0.012
Token making
Token ID1642
Feature activation+1.95e-03

Pos logit contributions
 You+0.001
|+0.001
 seem+0.001
Warning+0.001
 generation+0.001
Neg logit contributions
ades-0.001
 sho-0.001
brush-0.001
uddle-0.001
aid-0.001
Token a
Token ID257
Feature activation+1.25e-02

Pos logit contributions
ades+0.004
uddle+0.004
brush+0.003
 sho+0.003
ig+0.003
Neg logit contributions
 You-0.002
 seem-0.002
 temptation-0.002
You-0.002
 generation-0.002
Token y
Token ID331
Feature activation+3.13e-02

Pos logit contributions
ades+0.016
aid+0.016
usted+0.015
atted+0.015
brush+0.014
Neg logit contributions
 You-0.021
 generation-0.020
 Today-0.019
 temptation-0.019
|-0.019
Tokenummy
Token ID13513
Feature activation-8.89e-03

Pos logit contributions
ades+0.046
 sho+0.041
brush+0.041
aid+0.041
 scissors+0.039
Neg logit contributions
 You-0.053
|-0.052
 seem-0.049
 Today-0.047
 generation-0.047
Token snack
Token ID26906
Feature activation+1.41e-02

Pos logit contributions
 You+0.019
|+0.019
 seem+0.018
Warning+0.017
 generation+0.017
Neg logit contributions
ades-0.010
 sho-0.008
uddle-0.008
brush-0.008
aid-0.008
Token in
Token ID287
Feature activation-2.30e-02

Pos logit contributions
ades+0.023
uddle+0.020
 sho+0.020
brush+0.020
aid+0.020
Neg logit contributions
 You-0.023
|-0.022
 seem-0.021
Warning-0.020
 generation-0.020
Token the
Token ID262
Feature activation-1.79e-02

Pos logit contributions
 You+0.041
|+0.041
 seem+0.038
Warning+0.037
 generation+0.037
Neg logit contributions
ades-0.033
 sho-0.029
brush-0.028
uddle-0.028
aid-0.028
Token serious
Token ID2726
Feature activation+9.05e-04

Pos logit contributions
 You+0.030
 seem+0.027
|+0.027
 generation+0.026
You+0.025
Neg logit contributions
ades-0.039
uddle-0.037
brush-0.033
ade-0.032
aid-0.032
Token,
Token ID11
Feature activation-3.15e-03

Pos logit contributions
ades+0.001
brush+0.001
 sho+0.001
uddle+0.001
aid+0.001
Neg logit contributions
|-0.002
 You-0.002
Warning-0.002
 generation-0.002
 seem-0.002
Token but
Token ID475
Feature activation-4.27e-03

Pos logit contributions
 You+0.005
|+0.005
 seem+0.005
 generation+0.004
Warning+0.004
Neg logit contributions
ades-0.005
uddle-0.005
 sho-0.005
brush-0.004
aid-0.004
Token that
Token ID326
Feature activation-3.73e-02

Pos logit contributions
 You+0.007
|+0.006
 seem+0.006
 generation+0.006
Warning+0.006
Neg logit contributions
ades-0.008
uddle-0.007
brush-0.007
 sho-0.006
ade-0.006
Token they
Token ID484
Feature activation+1.13e-02

Pos logit contributions
|+0.065
 You+0.064
Warning+0.059
 generation+0.058
 seem+0.058
Neg logit contributions
ades-0.054
 sho-0.049
brush-0.047
 penn-0.046
uddle-0.046
Token would
Token ID561
Feature activation+1.63e-03

Pos logit contributions
ades+0.022
uddle+0.020
 sho+0.019
brush+0.019
 penn+0.019
Neg logit contributions
|-0.015
 You-0.015
 seem-0.015
 generation-0.013
Warning-0.013
Token be
Token ID307
Feature activation+5.77e-03

Pos logit contributions
ades+0.003
uddle+0.003
 sho+0.003
brush+0.003
aid+0.003
Neg logit contributions
 You-0.002
|-0.002
 seem-0.002
 generation-0.002
Warning-0.002
Token okay
Token ID8788
Feature activation-6.89e-03

Pos logit contributions
ades+0.011
uddle+0.011
ade+0.010
is+0.009
 tant+0.009
Neg logit contributions
 You-0.008
You-0.007
 been-0.007
 late-0.006
|-0.006
Token.
Token ID13
Feature activation-5.84e-04

Pos logit contributions
|+0.011
 You+0.011
Warning+0.010
 generation+0.010
 seem+0.010
Neg logit contributions
ades-0.011
brush-0.010
 sho-0.010
uddle-0.010
aid-0.009
Token Tom
Token ID4186
Feature activation+2.04e-02

Pos logit contributions
|+0.001
rated+0.001
 seem+0.001
 You+0.001
 generation+0.001
Neg logit contributions
ades-0.001
uddle-0.001
brush-0.001
 sho-0.001
 moms-0.001
Token felt
Token ID2936
Feature activation+2.23e-02

Pos logit contributions
ades+0.038
uddle+0.032
 penn+0.032
 brand+0.032
 sunscreen+0.032
Neg logit contributions
 You-0.028
|-0.028
 seem-0.026
elope-0.024
You-0.023
Token are
Token ID389
Feature activation+5.25e-03

Pos logit contributions
ades+0.027
uddle+0.025
aid+0.024
ade+0.023
ip+0.023
Neg logit contributions
 seem-0.017
 You-0.016
|-0.015
Warning-0.012
 generation-0.012
Token proud
Token ID6613
Feature activation+3.60e-03

Pos logit contributions
ades+0.011
uddle+0.011
ade+0.010
 tant+0.010
aid+0.010
Neg logit contributions
 You-0.007
You-0.005
 having-0.005
 seem-0.005
 being-0.005
Token of
Token ID286
Feature activation-9.53e-03

Pos logit contributions
ades+0.005
uddle+0.005
 sho+0.005
brush+0.005
 sunscreen+0.004
Neg logit contributions
 You-0.007
 seem-0.006
You-0.005
 Today-0.005
Warning-0.005
Token her
Token ID607
Feature activation-3.86e-03

Pos logit contributions
|+0.019
rated+0.019
 You+0.018
Warning+0.018
 seem+0.018
Neg logit contributions
ades-0.012
 penn-0.010
aid-0.010
 scissors-0.009
 friends-0.009
Token.
Token ID13
Feature activation+5.23e-04

Pos logit contributions
|+0.007
 You+0.006
Warning+0.006
 generation+0.006
keeping+0.006
Neg logit contributions
ades-0.006
 sho-0.005
brush-0.005
uddle-0.005
 scissors-0.005
Token They
Token ID1119
Feature activation+9.57e-03

Pos logit contributions
ades+0.001
 sho+0.001
uddle+0.001
brush+0.001
aid+0.001
Neg logit contributions
 You-0.001
|-0.001
 seem-0.000
Warning-0.000
 generation-0.000
Token tell
Token ID1560
Feature activation+1.59e-04

Pos logit contributions
ades+0.018
uddle+0.016
brush+0.016
 penn+0.015
 sho+0.015
Neg logit contributions
 You-0.013
|-0.013
 seem-0.013
Warning-0.011
 generation-0.011
Token her
Token ID607
Feature activation+1.40e-02

Pos logit contributions
ades+0.000
uddle+0.000
brush+0.000
 sho+0.000
ade+0.000
Neg logit contributions
 You-0.000
 seem-0.000
|-0.000
 generation-0.000
Warning-0.000
Token she
Token ID673
Feature activation-1.51e-03

Pos logit contributions
ades+0.018
uddle+0.017
 sho+0.016
is+0.015
brush+0.015
Neg logit contributions
 You-0.026
 seem-0.026
|-0.024
 generation-0.022
 Today-0.022
Token is
Token ID318
Feature activation-4.77e-04

Pos logit contributions
 You+0.002
|+0.002
 seem+0.002
Warning+0.002
 generation+0.002
Neg logit contributions
ades-0.003
 sho-0.002
uddle-0.002
brush-0.002
aid-0.002
Token a
Token ID257
Feature activation+2.58e-02

Pos logit contributions
 You+0.001
 seem+0.001
You+0.001
|+0.001
 generation+0.001
Neg logit contributions
ades-0.001
uddle-0.001
 tant-0.001
ade-0.001
brush-0.001
Token.
Token ID13
Feature activation+4.40e-03

Pos logit contributions
ades+0.060
uddle+0.056
 sho+0.054
aid+0.053
 brand+0.052
Neg logit contributions
 You-0.043
 seem-0.039
|-0.039
keeping-0.037
Warning-0.036
Token They
Token ID1119
Feature activation+2.97e-02

Pos logit contributions
ades+0.010
 sho+0.009
brush+0.009
 penn+0.009
uddle+0.009
Neg logit contributions
|-0.004
 You-0.004
 seem-0.003
Warning-0.003
rated-0.003
Token also
Token ID635
Feature activation+2.61e-02

Pos logit contributions
ades+0.063
uddle+0.059
ade+0.054
 brand+0.053
brush+0.053
Neg logit contributions
 seem-0.035
|-0.029
 You-0.029
Warning-0.028
You-0.027
Token saw
Token ID2497
Feature activation+1.24e-02

Pos logit contributions
ades+0.048
uddle+0.045
brush+0.040
ade+0.039
aid+0.039
Neg logit contributions
 seem-0.036
|-0.035
Warning-0.033
 You-0.033
 generation-0.032
Token some
Token ID617
Feature activation+2.31e-02

Pos logit contributions
ades+0.023
uddle+0.022
ade+0.020
brush+0.020
uing+0.020
Neg logit contributions
 You-0.016
 seem-0.015
 generation-0.015
|-0.014
keeping-0.013
Token new
Token ID649
Feature activation+2.88e-03

Pos logit contributions
ades+0.029
uddle+0.026
ingu+0.025
ilantro+0.025
ip+0.024
Neg logit contributions
 seem-0.040
 You-0.040
Warning-0.035
You-0.035
|-0.035
Token or
Token ID393
Feature activation+4.54e-04

Pos logit contributions
ades+0.004
uddle+0.003
aid+0.003
brush+0.003
 sho+0.003
Neg logit contributions
 You-0.005
|-0.005
 seem-0.005
keeping-0.005
 generation-0.005
Tokennam
Token ID7402
Feature activation+1.61e-02

Pos logit contributions
ades+0.001
uddle+0.001
ade+0.001
aid+0.001
ip+0.001
Neg logit contributions
 seem-0.001
 You-0.001
|-0.001
 generation-0.001
Warning-0.001
Tokenents
Token ID658
Feature activation+9.14e-03

Pos logit contributions
ades+0.014
uddle+0.012
aid+0.012
 sunscreen+0.012
 cream+0.012
Neg logit contributions
|-0.036
 You-0.035
 seem-0.034
keeping-0.034
 generation-0.034
Token that
Token ID326
Feature activation-2.13e-02

Pos logit contributions
ades+0.013
 sho+0.011
uddle+0.011
brush+0.011
aid+0.010
Neg logit contributions
 You-0.017
|-0.017
 seem-0.016
Warning-0.015
 generation-0.015
Token Santa
Token ID8909
Feature activation-2.07e-02

Pos logit contributions
 You+0.034
|+0.033
 seem+0.032
 generation+0.030
Warning+0.030
Neg logit contributions
ades-0.035
uddle-0.031
 sho-0.030
aid-0.030
brush-0.030
Token Mia
Token ID32189
Feature activation+1.64e-02

Pos logit contributions
|+0.000
 You+0.000
 generation+0.000
keeping+0.000
Warning+0.000
Neg logit contributions
ades-0.000
uddle-0.000
brush-0.000
 sho-0.000
aid-0.000
Token saw
Token ID2497
Feature activation+1.21e-02

Pos logit contributions
ades+0.030
uddle+0.026
 brand+0.026
brush+0.026
aid+0.025
Neg logit contributions
 You-0.023
|-0.023
 seem-0.022
Warning-0.020
elope-0.020
Token that
Token ID326
Feature activation-1.18e-02

Pos logit contributions
ades+0.019
uddle+0.017
brush+0.017
 sho+0.017
 penn+0.016
Neg logit contributions
 You-0.020
 seem-0.019
|-0.018
Warning-0.017
You-0.017
Token the
Token ID262
Feature activation-8.91e-03

Pos logit contributions
|+0.024
 You+0.023
Warning+0.022
 generation+0.021
keeping+0.021
Neg logit contributions
ades-0.014
 sho-0.013
brush-0.012
 scissors-0.012
 penn-0.012
Token tiger
Token ID26241
Feature activation-2.21e-02

Pos logit contributions
 You+0.015
|+0.015
 seem+0.014
Warning+0.014
 generation+0.014
Neg logit contributions
ades-0.014
 sho-0.012
uddle-0.012
brush-0.012
aid-0.011
Token was
Token ID373
Feature activation-1.47e-02

Pos logit contributions
 You+0.033
|+0.033
 seem+0.030
Warning+0.029
 generation+0.029
Neg logit contributions
ades-0.038
 sho-0.034
uddle-0.034
brush-0.034
aid-0.033
Token not
Token ID407
Feature activation-2.87e-02

Pos logit contributions
 You+0.021
|+0.019
 seem+0.018
You+0.016
keeping+0.016
Neg logit contributions
ades-0.028
uddle-0.025
brush-0.024
aid-0.024
ade-0.024
Token mean
Token ID1612
Feature activation+4.48e-03

Pos logit contributions
 You+0.050
|+0.049
 seem+0.045
Warning+0.045
 generation+0.044
Neg logit contributions
ades-0.043
 sho-0.038
brush-0.037
uddle-0.037
aid-0.037
Token,
Token ID11
Feature activation-9.18e-03

Pos logit contributions
ades+0.006
 sho+0.006
uddle+0.006
brush+0.006
 penn+0.006
Neg logit contributions
|-0.008
 You-0.008
 generation-0.007
Warning-0.007
 seem-0.007
Token just
Token ID655
Feature activation-2.89e-02

Pos logit contributions
|+0.016
 You+0.016
Warning+0.015
 seem+0.015
 generation+0.015
Neg logit contributions
ades-0.013
 sho-0.012
brush-0.012
uddle-0.011
aid-0.011
Token hungry
Token ID14720
Feature activation-3.80e-03

Pos logit contributions
|+0.054
 You+0.050
Warning+0.050
 generation+0.047
keeping+0.046
Neg logit contributions
ades-0.038
 sho-0.036
 cream-0.035
 scissors-0.034
brush-0.034
Token it
Token ID340
Feature activation-1.44e-02

Pos logit contributions
 You+0.027
|+0.026
 seem+0.024
 generation+0.024
Warning+0.024
Neg logit contributions
ades-0.024
uddle-0.021
brush-0.021
 sho-0.020
aid-0.020
Token used
Token ID973
Feature activation+6.53e-03

Pos logit contributions
|+0.023
 You+0.023
 seem+0.021
Warning+0.020
 generation+0.020
Neg logit contributions
ades-0.024
uddle-0.020
brush-0.020
aid-0.020
 sho-0.020
Token its
Token ID663
Feature activation+4.73e-03

Pos logit contributions
ades+0.010
uddle+0.010
brush+0.009
 sho+0.009
aid+0.009
Neg logit contributions
 You-0.011
|-0.010
 seem-0.010
 generation-0.009
keeping-0.009
Token strong
Token ID1913
Feature activation+1.45e-02

Pos logit contributions
ades+0.007
uddle+0.006
brush+0.006
aid+0.006
 sho+0.006
Neg logit contributions
 You-0.008
|-0.008
 seem-0.008
 generation-0.008
Warning-0.007
Token claws
Token ID28421
Feature activation+1.04e-02

Pos logit contributions
ades+0.021
uddle+0.018
 sho+0.018
brush+0.018
aid+0.018
Neg logit contributions
 You-0.026
|-0.025
 seem-0.024
 generation-0.023
Warning-0.023
Token to
Token ID284
Feature activation-1.35e-02

Pos logit contributions
ades+0.017
uddle+0.015
aid+0.015
 sho+0.014
 penn+0.014
Neg logit contributions
 You-0.016
 seem-0.016
|-0.015
keeping-0.014
Warning-0.014
Token pick
Token ID2298
Feature activation+7.59e-04

Pos logit contributions
 You+0.021
|+0.021
 seem+0.020
 generation+0.019
Warning+0.019
Neg logit contributions
ades-0.023
uddle-0.020
 sho-0.019
brush-0.019
aid-0.019
Token up
Token ID510
Feature activation+3.20e-03

Pos logit contributions
ades+0.001
uddle+0.001
 sho+0.001
brush+0.001
aid+0.001
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
Warning-0.001
 generation-0.001
Token the
Token ID262
Feature activation-1.36e-02

Pos logit contributions
ades+0.005
uddle+0.005
brush+0.005
aid+0.005
 sho+0.005
Neg logit contributions
 You-0.005
|-0.005
 seem-0.005
 generation-0.004
Warning-0.004
Token garbage
Token ID15413
Feature activation+4.65e-03

Pos logit contributions
 You+0.026
|+0.025
 seem+0.024
Warning+0.023
 generation+0.023
Neg logit contributions
ades-0.019
 sho-0.016
uddle-0.016
brush-0.016
aid-0.015
Token and
Token ID290
Feature activation-5.76e-03

Pos logit contributions
ades+0.008
 sho+0.006
uddle+0.006
aid+0.006
brush+0.006
Neg logit contributions
 You-0.008
|-0.007
 seem-0.007
Warning-0.007
 generation-0.007
Token be
Token ID307
Feature activation-2.69e-02

Pos logit contributions
 You+0.041
|+0.040
 seem+0.037
Warning+0.036
 generation+0.036
Neg logit contributions
ades-0.040
 sho-0.035
brush-0.035
uddle-0.034
aid-0.034
Token useful
Token ID4465
Feature activation+1.79e-02

Pos logit contributions
 You+0.045
|+0.045
 seem+0.042
 generation+0.040
keeping+0.039
Neg logit contributions
ades-0.043
uddle-0.038
aid-0.037
brush-0.037
 sho-0.037
Token again
Token ID757
Feature activation-7.90e-03

Pos logit contributions
ades+0.028
uddle+0.024
 sho+0.024
aid+0.024
brush+0.024
Neg logit contributions
 You-0.031
|-0.029
 seem-0.028
 generation-0.027
keeping-0.027
Token!"
Token ID2474
Feature activation-9.59e-04

Pos logit contributions
 You+0.012
|+0.012
 seem+0.011
 generation+0.011
Warning+0.011
Neg logit contributions
ades-0.013
uddle-0.012
 sho-0.012
aid-0.012
brush-0.012
Token
Token ID220
Feature activation-1.13e-02

Pos logit contributions
rated+0.002
|+0.002
 seem+0.002
Warning+0.002
 generation+0.002
Neg logit contributions
ades-0.001
uddle-0.001
aid-0.001
ip-0.001
brush-0.001
Token
Token ID198
Feature activation+4.79e-04

Pos logit contributions
|+0.016
 You+0.016
 seem+0.015
 generation+0.015
Warning+0.015
Neg logit contributions
ades-0.020
uddle-0.018
 sho-0.017
brush-0.017
aid-0.017
Token
Token ID198
Feature activation+4.79e-05

Pos logit contributions
ades+0.001
 sho+0.001
uddle+0.001
brush+0.001
aid+0.001
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
Warning-0.001
 generation-0.001
TokenAnd
Token ID1870
Feature activation-3.29e-03

Pos logit contributions
ades+0.000
 sho+0.000
uddle+0.000
brush+0.000
 penn+0.000
Neg logit contributions
|-0.000
 You-0.000
Warning-0.000
 seem-0.000
 generation-0.000
Token<|endoftext|>
Token ID50256
Feature activation-3.84e-02

Pos logit contributions
 You+0.006
|+0.006
 seem+0.006
Warning+0.005
 generation+0.005
Neg logit contributions
ades-0.005
uddle-0.004
 sho-0.004
brush-0.004
aid-0.004
TokenOnce
Token ID7454
Feature activation+4.33e-02

Pos logit contributions
rated+0.071
ses+0.071
 generation+0.070
 seem+0.068
sing+0.067
Neg logit contributions
ades-0.051
in-0.044
uddle-0.043
ade-0.040
 handful-0.039
Token upon
Token ID2402
Feature activation-4.89e-02

Pos logit contributions
ades+0.080
brush+0.072
 sho+0.071
aid+0.071
uddle+0.070
Neg logit contributions
 You-0.061
|-0.061
 seem-0.056
 Today-0.052
Warning-0.051
Token He
Token ID679
Feature activation+1.62e-02

Pos logit contributions
|+0.010
 You+0.010
 generation+0.009
Warning+0.009
 seem+0.009
Neg logit contributions
ades-0.016
 sho-0.015
uddle-0.014
brush-0.014
 handful-0.014
Token thought
Token ID1807
Feature activation+2.43e-02

Pos logit contributions
ades+0.031
brush+0.028
uddle+0.027
aid+0.027
 sunscreen+0.026
Neg logit contributions
|-0.021
 You-0.021
 seem-0.019
Warning-0.018
 generation-0.018
Token it
Token ID340
Feature activation-1.35e-02

Pos logit contributions
ades+0.039
uddle+0.035
aid+0.035
brush+0.033
 sho+0.033
Neg logit contributions
 You-0.040
 seem-0.038
|-0.037
 generation-0.034
Warning-0.034
Token might
Token ID1244
Feature activation-3.44e-02

Pos logit contributions
|+0.024
 You+0.024
keeping+0.022
 generation+0.022
Warning+0.022
Neg logit contributions
ades-0.020
 sho-0.017
uddle-0.017
brush-0.017
ade-0.017
Token help
Token ID1037
Feature activation-3.29e-02

Pos logit contributions
 You+0.063
|+0.062
keeping+0.060
 generation+0.057
 Come+0.057
Neg logit contributions
ades-0.040
ade-0.040
 sho-0.039
brush-0.039
 scissors-0.038
Token him
Token ID683
Feature activation-8.99e-03

Pos logit contributions
 You+0.055
|+0.055
Warning+0.050
 seem+0.049
 generation+0.049
Neg logit contributions
ades-0.051
 sho-0.045
brush-0.044
uddle-0.044
aid-0.043
Token.
Token ID13
Feature activation-6.15e-03

Pos logit contributions
 You+0.014
|+0.014
 seem+0.013
 generation+0.012
Warning+0.012
Neg logit contributions
ades-0.015
uddle-0.013
 sho-0.013
brush-0.013
aid-0.013
Token He
Token ID679
Feature activation+1.40e-02

Pos logit contributions
|+0.009
 You+0.009
 seem+0.008
 generation+0.008
Warning+0.008
Neg logit contributions
ades-0.010
uddle-0.009
 sho-0.009
brush-0.009
aid-0.009
Token wished
Token ID16555
Feature activation-5.44e-03

Pos logit contributions
ades+0.027
uddle+0.023
brush+0.023
aid+0.022
 brand+0.022
Neg logit contributions
|-0.019
 You-0.018
 seem-0.017
 generation-0.017
Warning-0.016
Token and
Token ID290
Feature activation+2.21e-04

Pos logit contributions
|+0.010
 You+0.010
Warning+0.009
 generation+0.009
 seem+0.009
Neg logit contributions
ades-0.007
 sho-0.006
 penn-0.006
brush-0.006
 brand-0.006
Token wished
Token ID16555
Feature activation-1.12e-02

Pos logit contributions
ades+0.000
 sho+0.000
 penn+0.000
 scissors+0.000
 friends+0.000
Neg logit contributions
|-0.000
 You-0.000
Warning-0.000
 seem-0.000
 Today-0.000
Token.
Token ID13
Feature activation+7.71e-03

Pos logit contributions
|+0.007
 You+0.007
Warning+0.006
 seem+0.006
 generation+0.006
Neg logit contributions
ades-0.006
uddle-0.005
 sho-0.005
brush-0.005
 penn-0.005
Token Sally
Token ID25737
Feature activation-8.10e-04

Pos logit contributions
ades+0.011
uddle+0.011
 sho+0.010
ade+0.010
 sunscreen+0.010
Neg logit contributions
|-0.012
 You-0.011
Warning-0.011
 seem-0.011
rated-0.011
Token saw
Token ID2497
Feature activation+1.89e-02

Pos logit contributions
|+0.001
 You+0.001
 seem+0.001
 generation+0.001
Warning+0.001
Neg logit contributions
ades-0.001
aid-0.001
brush-0.001
uddle-0.001
 sho-0.001
Token her
Token ID607
Feature activation+1.24e-02

Pos logit contributions
ades+0.035
uddle+0.031
brush+0.031
aj+0.030
pron+0.029
Neg logit contributions
 You-0.027
 seem-0.025
You-0.024
 generation-0.023
|-0.023
Token friend
Token ID1545
Feature activation+2.84e-03

Pos logit contributions
ades+0.014
aid+0.012
brush+0.011
ade+0.010
usted+0.010
Neg logit contributions
 You-0.026
|-0.025
 seem-0.025
 generation-0.023
keeping-0.023
Token Tim
Token ID5045
Feature activation-2.75e-02

Pos logit contributions
ades+0.004
aid+0.003
brush+0.003
uddle+0.003
 sho+0.003
Neg logit contributions
 You-0.005
|-0.005
 seem-0.005
 generation-0.005
Warning-0.005
Token.
Token ID13
Feature activation+9.81e-03

Pos logit contributions
|+0.054
 You+0.054
Warning+0.053
 generation+0.051
 seem+0.050
Neg logit contributions
ades-0.031
 sho-0.028
 scissors-0.027
uddle-0.026
brush-0.026
Token Tim
Token ID5045
Feature activation+9.38e-04

Pos logit contributions
ades+0.021
uddle+0.019
 sho+0.018
brush+0.018
 brand+0.018
Neg logit contributions
|-0.011
 You-0.010
 seem-0.010
 generation-0.010
Warning-0.010
Token said
Token ID531
Feature activation+2.26e-02

Pos logit contributions
ades+0.001
brush+0.001
uddle+0.001
aid+0.001
 sho+0.001
Neg logit contributions
 You-0.002
|-0.002
 seem-0.001
Warning-0.001
 generation-0.001
Token,
Token ID11
Feature activation+6.10e-04

Pos logit contributions
ades+0.035
uddle+0.033
udd+0.031
aj+0.031
 apr+0.030
Neg logit contributions
 You-0.042
 seem-0.038
 Welcome-0.035
 Listen-0.034
 That-0.033
Token "
Token ID366
Feature activation-9.23e-03

Pos logit contributions
ades+0.001
 sho+0.001
brush+0.001
uddle+0.001
aid+0.001
Neg logit contributions
|-0.001
 You-0.001
Warning-0.001
 seem-0.001
 generation-0.001
Token about
Token ID546
Feature activation-1.41e-02

Pos logit contributions
|+0.017
 You+0.016
Warning+0.015
 generation+0.015
 seem+0.015
Neg logit contributions
ades-0.014
brush-0.013
 sho-0.012
ade-0.012
 cream-0.012
Token not
Token ID407
Feature activation-2.64e-02

Pos logit contributions
|+0.027
 You+0.026
 seem+0.025
Warning+0.025
rated+0.024
Neg logit contributions
ades-0.019
 sho-0.016
 penn-0.016
 scissors-0.016
brush-0.016
Token chewing
Token ID36615
Feature activation+8.61e-03

Pos logit contributions
|+0.039
 You+0.037
Warning+0.034
 generation+0.033
 Today+0.032
Neg logit contributions
ades-0.046
 sho-0.043
aid-0.042
 penn-0.042
brush-0.042
Token things
Token ID1243
Feature activation-8.08e-04

Pos logit contributions
ades+0.013
 sho+0.011
uddle+0.011
brush+0.011
 penn+0.010
Neg logit contributions
|-0.015
 You-0.015
 seem-0.014
Warning-0.014
 generation-0.014
Token that
Token ID326
Feature activation-3.33e-02

Pos logit contributions
 You+0.001
 seem+0.001
|+0.001
Warning+0.001
 generation+0.001
Neg logit contributions
ades-0.001
uddle-0.001
 sho-0.001
aid-0.001
brush-0.001
Token weren
Token ID6304
Feature activation-5.08e-02

Pos logit contributions
 You+0.052
|+0.052
Warning+0.048
 generation+0.047
keeping+0.046
Neg logit contributions
ades-0.053
 sho-0.048
brush-0.048
uddle-0.047
aid-0.046
Token't
Token ID470
Feature activation-4.14e-02

Pos logit contributions
|+0.069
keeping+0.068
 generation+0.067
 You+0.067
elope+0.064
Neg logit contributions
ades-0.093
brush-0.084
ade-0.082
is-0.080
 sho-0.079
Token food
Token ID2057
Feature activation-1.05e-02

Pos logit contributions
|+0.074
 You+0.073
Warning+0.067
 seem+0.066
 generation+0.066
Neg logit contributions
ades-0.060
 sho-0.053
brush-0.052
aid-0.051
 penn-0.051
Token.
Token ID13
Feature activation-4.55e-03

Pos logit contributions
 You+0.017
|+0.016
 seem+0.015
Warning+0.014
 generation+0.014
Neg logit contributions
ades-0.017
 sho-0.015
uddle-0.015
brush-0.015
aid-0.015
Token
Token ID198
Feature activation+5.48e-03

Pos logit contributions
|+0.007
 You+0.007
 generation+0.007
Warning+0.007
 seem+0.007
Neg logit contributions
ades-0.007
uddle-0.007
 sho-0.006
brush-0.006
 brand-0.006
Token
Token ID198
Feature activation-3.32e-03

Pos logit contributions
ades+0.011
 sho+0.010
brush+0.010
uddle+0.010
aid+0.009
Neg logit contributions
 You-0.007
|-0.007
 seem-0.006
Warning-0.006
 generation-0.006
Token it
Token ID340
Feature activation-2.40e-02

Pos logit contributions
ades+0.006
 sho+0.006
uddle+0.006
brush+0.006
 penn+0.006
Neg logit contributions
|-0.005
 You-0.005
Warning-0.004
 seem-0.004
 generation-0.004
Token went
Token ID1816
Feature activation-1.66e-02

Pos logit contributions
 You+0.041
|+0.040
Warning+0.038
keeping+0.038
 generation+0.037
Neg logit contributions
ades-0.035
 sho-0.031
brush-0.031
uddle-0.031
ade-0.031
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
|+0.035
 You+0.034
Warning+0.033
ses+0.033
rated+0.032
Neg logit contributions
ades-0.017
brush-0.014
uddle-0.014
 sho-0.014
ade-0.013
Token<|endoftext|>
Token ID50256
Feature activation-3.20e-02

Pos logit contributions
rated+0.024
ses+0.022
Warning+0.022
 spine+0.022
keeping+0.022
Neg logit contributions
aid-0.010
brush-0.009
 friends-0.009
ades-0.009
 brand-0.009
TokenOnce
Token ID7454
Feature activation+4.13e-02

Pos logit contributions
rated+0.060
 generation+0.059
ses+0.058
 seem+0.058
 spine+0.057
Neg logit contributions
ades-0.041
in-0.034
uddle-0.034
ade-0.033
 handful-0.032
Token upon
Token ID2402
Feature activation-4.99e-02

Pos logit contributions
ades+0.075
 sho+0.067
brush+0.067
aid+0.067
uddle+0.066
Neg logit contributions
 You-0.059
|-0.058
 seem-0.053
 Today-0.049
Warning-0.049
Token a
Token ID257
Feature activation+1.72e-03

Pos logit contributions
ses+0.125
rated+0.123
 spine+0.121
Warning+0.120
 You+0.120
Neg logit contributions
 else-0.031
ades-0.030
 friends-0.025
 brand-0.024
 butter-0.023
Token time
Token ID640
Feature activation+1.68e-02

Pos logit contributions
ades+0.002
uddle+0.002
 sho+0.002
ade+0.002
brush+0.002
Neg logit contributions
 You-0.003
keeping-0.003
Warning-0.003
|-0.003
 seem-0.003
Token there
Token ID612
Feature activation+2.22e-02

Pos logit contributions
ades+0.012
uddle+0.008
 sho+0.008
brush+0.007
 penn+0.007
Neg logit contributions
|-0.042
 You-0.042
 generation-0.040
Warning-0.040
keeping-0.040
Token was
Token ID373
Feature activation-3.62e-02

Pos logit contributions
ades+0.035
uddle+0.031
 sho+0.030
ade+0.030
brush+0.030
Neg logit contributions
 You-0.036
|-0.035
 generation-0.033
Warning-0.033
keeping-0.033
Token a
Token ID257
Feature activation+3.26e-03

Pos logit contributions
rated+0.082
keeping+0.079
|+0.079
Warning+0.078
 You+0.078
Neg logit contributions
 sunscreen-0.032
 butter-0.029
ades-0.029
 scissors-0.028
 friends-0.028
Token robot
Token ID9379
Feature activation+7.62e-04

Pos logit contributions
ades+0.030
aid+0.024
uddle+0.024
brush+0.023
ade+0.023
Neg logit contributions
 You-0.035
|-0.034
 generation-0.032
 seem-0.032
 Today-0.031
Token friend
Token ID1545
Feature activation+1.88e-02

Pos logit contributions
ades+0.001
uddle+0.001
aid+0.001
 sho+0.001
brush+0.001
Neg logit contributions
 You-0.001
|-0.001
 seem-0.001
 generation-0.001
keeping-0.001
Token.
Token ID13
Feature activation+6.51e-03

Pos logit contributions
ades+0.039
uddle+0.035
 penn+0.033
aid+0.033
igs+0.033
Neg logit contributions
 You-0.025
|-0.023
 seem-0.021
You-0.019
keeping-0.019
Token<|endoftext|>
Token ID50256
Feature activation-2.67e-02

Pos logit contributions
ades+0.009
uddle+0.008
 moms+0.008
aid+0.008
ade+0.008
Neg logit contributions
rated-0.011
 generation-0.011
 seem-0.011
Warning-0.011
|-0.010
TokenOnce
Token ID7454
Feature activation+4.48e-02

Pos logit contributions
 generation+0.051
rated+0.051
 You+0.050
ses+0.049
 seem+0.048
Neg logit contributions
ades-0.033
ade-0.026
uddle-0.025
 friends-0.024
in-0.024
Token upon
Token ID2402
Feature activation-5.38e-02

Pos logit contributions
ades+0.084
brush+0.077
aid+0.077
 sho+0.076
 penn+0.073
Neg logit contributions
 You-0.061
|-0.060
 seem-0.056
 Today-0.053
Warning-0.049
Token a
Token ID257
Feature activation+3.26e-03

Pos logit contributions
rated+0.131
keeping+0.129
Warning+0.127
 spine+0.127
 generation+0.126
Neg logit contributions
ades-0.037
 else-0.034
 cream-0.033
 friends-0.032
 big-0.030
Token time
Token ID640
Feature activation+1.66e-02

Pos logit contributions
ades+0.005
uddle+0.004
 sho+0.004
ade+0.004
brush+0.004
Neg logit contributions
 You-0.006
|-0.006
keeping-0.005
Warning-0.005
 seem-0.005
Token,
Token ID11
Feature activation+1.83e-02

Pos logit contributions
ades+0.014
 sho+0.011
uddle+0.011
brush+0.011
aid+0.010
Neg logit contributions
 You-0.039
|-0.039
 seem-0.037
Warning-0.036
 generation-0.036
Token there
Token ID612
Feature activation+2.17e-02

Pos logit contributions
ades+0.020
 sho+0.017
uddle+0.016
brush+0.016
aid+0.015
Neg logit contributions
|-0.039
 You-0.039
 generation-0.037
keeping-0.036
 seem-0.036
Token was
Token ID373
Feature activation-3.28e-02

Pos logit contributions
ades+0.033
uddle+0.029
 sho+0.029
ade+0.028
brush+0.027
Neg logit contributions
 You-0.036
|-0.035
 generation-0.034
Warning-0.034
keeping-0.034
Token it
Token ID340
Feature activation-2.45e-04

Pos logit contributions
ades+0.018
uddle+0.016
aid+0.015
brush+0.015
 sho+0.015
Neg logit contributions
 You-0.017
|-0.015
 seem-0.015
Warning-0.014
 generation-0.014
Token safe
Token ID3338
Feature activation+1.65e-03

Pos logit contributions
 You+0.000
|+0.000
 seem+0.000
Warning+0.000
 generation+0.000
Neg logit contributions
ades-0.000
uddle-0.000
aid-0.000
brush-0.000
 sho-0.000
Token to
Token ID284
Feature activation-2.79e-02

Pos logit contributions
ades+0.002
 sho+0.002
uddle+0.002
aid+0.002
brush+0.002
Neg logit contributions
 You-0.003
|-0.003
 seem-0.003
 generation-0.003
keeping-0.003
Token go
Token ID467
Feature activation-1.53e-02

Pos logit contributions
 You+0.045
|+0.043
Warning+0.040
 seem+0.040
 generation+0.039
Neg logit contributions
ades-0.045
 sho-0.040
brush-0.040
uddle-0.039
aid-0.039
Token outside
Token ID2354
Feature activation-5.61e-03

Pos logit contributions
 You+0.030
|+0.030
 seem+0.028
Warning+0.028
 generation+0.027
Neg logit contributions
ades-0.019
uddle-0.016
 sho-0.016
brush-0.016
aid-0.016
Token when
Token ID618
Feature activation-4.64e-02

Pos logit contributions
 You+0.010
|+0.010
 seem+0.009
Warning+0.009
 generation+0.009
Neg logit contributions
ades-0.008
 sho-0.007
uddle-0.007
brush-0.007
aid-0.007
Token it
Token ID340
Feature activation-4.05e-02

Pos logit contributions
|+0.075
Warning+0.067
 You+0.066
keeping+0.065
 seem+0.064
Neg logit contributions
ades-0.072
 friends-0.066
 sho-0.066
 sunscreen-0.066
 scissors-0.066
Token is
Token ID318
Feature activation-2.71e-02

Pos logit contributions
keeping+0.053
 generation+0.051
Warning+0.049
|+0.047
 You+0.046
Neg logit contributions
uddle-0.076
ades-0.075
ade-0.071
 sho-0.070
brush-0.069
Token raining
Token ID43079
Feature activation-3.98e-02

Pos logit contributions
 You+0.033
|+0.033
 seem+0.030
 generation+0.028
 Today+0.027
Neg logit contributions
ades-0.056
uddle-0.050
aid-0.049
brush-0.049
 sho-0.048
Token?"
Token ID1701
Feature activation-4.19e-03

Pos logit contributions
 You+0.088
|+0.085
Warning+0.085
 generation+0.085
 seem+0.083
Neg logit contributions
ades-0.039
uddle-0.031
ade-0.030
brush-0.030
 cream-0.028
Token Lily
Token ID20037
Feature activation+1.99e-02

Pos logit contributions
|+0.007
 generation+0.007
 You+0.007
Warning+0.007
rated+0.007
Neg logit contributions
ades-0.006
uddle-0.005
brush-0.005
aid-0.005
 sho-0.005
Token playing
Token ID2712
Feature activation+9.79e-03

Pos logit contributions
|+0.030
 You+0.028
rated+0.027
Warning+0.027
 seem+0.027
Neg logit contributions
ades-0.020
 sho-0.018
 scissors-0.018
 brand-0.017
 sunscreen-0.017
Token too
Token ID1165
Feature activation-1.66e-02

Pos logit contributions
ades+0.014
 sho+0.012
uddle+0.012
brush+0.012
aid+0.011
Neg logit contributions
|-0.018
 You-0.018
 seem-0.017
Warning-0.016
 generation-0.016
Token much
Token ID881
Feature activation-6.92e-03

Pos logit contributions
|+0.026
 You+0.026
 seem+0.024
Warning+0.024
 generation+0.023
Neg logit contributions
ades-0.028
uddle-0.024
 sho-0.024
brush-0.024
ade-0.024
Token in
Token ID287
Feature activation-2.88e-02

Pos logit contributions
|+0.012
 You+0.011
Warning+0.010
 generation+0.010
 seem+0.010
Neg logit contributions
ades-0.011
brush-0.010
 sho-0.010
uddle-0.010
aid-0.010
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
|+0.061
 You+0.061
 seem+0.058
rated+0.057
Warning+0.056
Neg logit contributions
ades-0.030
 sho-0.026
 scissors-0.025
 penn-0.025
brush-0.024
Token dark
Token ID3223
Feature activation-4.14e-02

Pos logit contributions
|+0.043
 You+0.043
 seem+0.040
Warning+0.039
 generation+0.039
Neg logit contributions
ades-0.031
uddle-0.028
 sho-0.028
brush-0.027
ade-0.026
Token.
Token ID13
Feature activation+2.49e-04

Pos logit contributions
 You+0.070
|+0.067
 seem+0.066
Warning+0.063
 generation+0.061
Neg logit contributions
ades-0.059
brush-0.056
ade-0.054
uddle-0.053
 sho-0.052
Token Bun
Token ID28515
Feature activation-1.95e-02

Pos logit contributions
ades+0.000
 sho+0.000
uddle+0.000
brush+0.000
aid+0.000
Neg logit contributions
 You-0.000
|-0.000
 seem-0.000
Warning-0.000
 generation-0.000
Token Bun
Token ID28515
Feature activation-1.68e-02

Pos logit contributions
 You+0.035
|+0.033
Warning+0.032
 seem+0.032
 generation+0.031
Neg logit contributions
ades-0.028
 sho-0.026
uddle-0.026
aid-0.025
brush-0.024
Token promised
Token ID8072
Feature activation-1.35e-02

Pos logit contributions
 You+0.030
|+0.028
Warning+0.028
 generation+0.028
 seem+0.027
Neg logit contributions
ades-0.024
 sho-0.022
uddle-0.022
ade-0.021
aid-0.021
Token to
Token ID284
Feature activation+9.47e-03

Pos logit contributions
|+0.027
 You+0.027
 seem+0.025
Warning+0.025
 generation+0.025
Neg logit contributions
ades-0.017
 sho-0.014
brush-0.014
uddle-0.014
aid-0.014