TOP ACTIVATIONS
MAX = 8.123

and a walk ie - talk ie . He said ,
into the walk ie - talk ie . "
on the walk ie - talk ie and said , "
." He unh ooks Anna from the zip line
and a walk ie - talk ie to talk to each
hay to create little n ooks and cr ann ies ,
into the walk ie - talk ie . She had seen
holding a walk ie - talk ie . He was talking
and a walk ie - talk ie . "
so excited and when the mer maid saw him she gave
hear my whistle . Wh ist ling is not fun with
swimming around her . The mer maid smiled and said ,
get into all the n ooks and cr ann ies .
water . The mer maid smiled at Melody .
- up , " ins ist on standing still ! I
castle so much . The mer maid said , " I
that day on , Cr ust y tried his best to
made . The mer maid learnt that taking risks
sailed for days , exploring unch arted waters until he came
the garden shop . The gard ener gave her some special

MIN ACTIVATIONS
MIN = -6.992

are my lovely girl ." <|endoftext|> L ily and Tom are
fighting is not nice . <|endoftext|> L ila and Tom are
Happy birthday , me ." <|endoftext|> L ily liked to play
beautiful basket of flowers ." <|endoftext|> L ily and Max love
you too , Lily ." <|endoftext|> Anna and Ben are twins
new friend , too ." <|endoftext|> Anna and Ben were playing
Anna and Elsa too ." <|endoftext|> L ily and Ben were
read a book together ." <|endoftext|> Tom and Lily were best
want more squash ." <|endoftext|> <|endoftext|> <|endoftext|> L ily and Sam
lonely . The end . <|endoftext|> L ily and Tom were
sweet and wonderful twins ." <|endoftext|> L ily likes to write
we love our garage ." <|endoftext|> L ily was an adventurous
hockey . Good friends ." <|endoftext|> S ara and Ben were
are Mom and Dad . <|endoftext|> L ily and Ben were
but not too much . <|endoftext|> Anna and Ben were playing
is good , always ." <|endoftext|> S ara and Ben are
her little girl , " Would you like a puppy ?".
more squash ." <|endoftext|> <|endoftext|> <|endoftext|> L ily and Sam went
is useful and fun ." <|endoftext|> One day , a man
, too , Lily ." <|endoftext|> Tom and Mia are friends

INTERVAL 4.344 - 8.123
CONTAINS 0.294%

playing with him . Then an idea came to him .
upon a time , a nos y wolf was walking in
got very alert and called out to her mom my .
It is hard to get out . You are in big
The yacht sailed away with all the fun things the little

INTERVAL 0.566 - 4.344
CONTAINS 36.127%

the clouds . Sometimes , he would see a bird fly
light . She eagerly helped light the candle , and everyone
stepped back and said nothing . The little girl
tells his mom about his broken math book . His mom
to know how much they weigh before we make apple pie

INTERVAL -3.213 - 0.566
CONTAINS 61.676%

found a pretty flower . The bear decided to give the
the other trees . They felt happy and proud . They
wanted to swing higher and higher . He let go of
time for bed . But when Alex lay down , he
a nice bath and a nice bed time story . They

INTERVAL -6.992 - -3.213
CONTAINS 1.903%

friends . <|endoftext|> L ily and Ben were playing by the
's right . Now , don 't forget ."
her toy . From that day on , Z ippy made
have sweet dreams . Good night , sweet ie . I
make it smell nice . <|endoftext|> Once upon a time ,
Token and
Token ID290
Feature activation-2.40e+00

Pos logit contributions
ars+0.000
oster+0.000
Elsa+0.000
ers+0.000
Finally+0.000
Neg logit contributions
 Sus-0.000
 Suz-0.000
 Joe-0.000
 Mike-0.000
 microsc-0.000
Token a
Token ID257
Feature activation+1.85e+00

Pos logit contributions
ars+0.056
ers+0.051
oster+0.050
Elsa+0.046
ze+0.043
Neg logit contributions
 Suz-0.066
 Sus-0.066
arter-0.062
 microsc-0.060
 Su-0.059
Token walk
Token ID2513
Feature activation+1.13e-02

Pos logit contributions
 Suz+0.049
 Sus+0.049
 Joe+0.044
 Su+0.044
 microsc+0.044
Neg logit contributions
ars-0.045
oster-0.042
ers-0.040
Elsa-0.038
ze-0.036
Tokenie
Token ID494
Feature activation+6.46e-01

Pos logit contributions
 Sus+0.000
 Suz+0.000
 Joe+0.000
 Su+0.000
 Mike+0.000
Neg logit contributions
ars-0.000
oster-0.000
Elsa-0.000
ers-0.000
Finally-0.000
Token-
Token ID12
Feature activation+6.58e-01

Pos logit contributions
 Suz+0.014
 Sus+0.014
 Joe+0.014
 Su+0.013
 Jo+0.013
Neg logit contributions
Elsa-0.017
oster-0.016
ars-0.016
Finally-0.015
ers-0.014
Tokentalk
Token ID16620
Feature activation+8.12e+00

Pos logit contributions
 Sus+0.017
 Suz+0.016
 Su+0.015
 Joe+0.015
 Mike+0.015
Neg logit contributions
ars-0.014
Elsa-0.014
bes-0.014
oster-0.014
ze-0.013
Tokenie
Token ID494
Feature activation+1.12e+00

Pos logit contributions
 Suz+0.209
 Sus+0.206
 microsc+0.205
arter+0.200
 Nice+0.192
Neg logit contributions
ers-0.186
ars-0.185
oster-0.170
ze-0.150
Elsa-0.146
Token.
Token ID13
Feature activation+6.38e-01

Pos logit contributions
 Suz+0.023
 Sus+0.023
 Joe+0.023
 M+0.022
 Jo+0.021
Neg logit contributions
ars-0.031
oster-0.030
Elsa-0.030
Finally-0.029
Now-0.028
Token He
Token ID679
Feature activation-3.53e-01

Pos logit contributions
 Suz+0.014
 Sus+0.013
 Joe+0.012
 microsc+0.012
 Su+0.012
Neg logit contributions
ars-0.019
oster-0.018
ers-0.017
Elsa-0.017
Finally-0.016
Token said
Token ID531
Feature activation-1.81e+00

Pos logit contributions
ars+0.009
oster+0.009
ers+0.008
Elsa+0.008
ze+0.008
Neg logit contributions
 Suz-0.009
 Sus-0.009
 Joe-0.008
 Su-0.008
 microsc-0.008
Token,
Token ID11
Feature activation+4.01e-01

Pos logit contributions
ars+0.048
oster+0.044
ers+0.042
Elsa+0.041
Finally+0.039
Neg logit contributions
 Suz-0.045
 Sus-0.045
 Joe-0.041
 Su-0.040
 microsc-0.040
Token into
Token ID656
Feature activation+2.68e+00

Pos logit contributions
ars+0.052
oster+0.048
ers+0.046
Elsa+0.044
Finally+0.043
Neg logit contributions
 Suz-0.056
 Sus-0.056
 Joe-0.051
 Su-0.050
 Jill-0.048
Token the
Token ID262
Feature activation-5.50e-01

Pos logit contributions
 Suz+0.080
 Sus+0.078
arter+0.074
 microsc+0.072
 Su+0.072
Neg logit contributions
ars-0.052
ers-0.048
oster-0.048
Elsa-0.045
ze-0.041
Token walk
Token ID2513
Feature activation-4.52e-02

Pos logit contributions
ars+0.014
oster+0.013
ers+0.013
Elsa+0.012
Finally+0.011
Neg logit contributions
 Sus-0.014
 Suz-0.014
 microsc-0.013
 Joe-0.012
 Su-0.012
Tokenie
Token ID494
Feature activation+7.03e-01

Pos logit contributions
ars+0.001
oster+0.001
Elsa+0.001
Finally+0.001
ers+0.001
Neg logit contributions
 Sus-0.001
 Suz-0.001
 Joe-0.001
 Su-0.001
 Mike-0.001
Token-
Token ID12
Feature activation+5.48e-01

Pos logit contributions
 Suz+0.018
 Sus+0.018
 Joe+0.016
 M+0.016
 Su+0.015
Neg logit contributions
ars-0.019
oster-0.018
Finally-0.017
Elsa-0.017
Now-0.017
Tokentalk
Token ID16620
Feature activation+7.97e+00

Pos logit contributions
 Sus+0.014
 Suz+0.014
 Su+0.013
 Joe+0.012
 Mike+0.012
Neg logit contributions
ars-0.013
 End-0.012
bes-0.012
Elsa-0.011
oster-0.011
Tokenie
Token ID494
Feature activation+1.17e+00

Pos logit contributions
 Suz+0.208
 Sus+0.206
 microsc+0.204
arter+0.193
 Nice+0.191
Neg logit contributions
ers-0.168
ars-0.165
oster-0.157
ze-0.143
'-0.138
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
 Sus+0.026
 Suz+0.026
 Joe+0.024
 M+0.024
 Su+0.022
Neg logit contributions
ars-0.035
oster-0.033
Finally-0.032
Now-0.031
Eventually-0.031
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
 Suz+0.008
 Sus+0.008
 Joe+0.007
 Su+0.007
 microsc+0.007
Neg logit contributions
ars-0.005
oster-0.005
ers-0.004
Elsa-0.004
ze-0.004
Token
Token ID198
Feature activation+3.08e-01

Pos logit contributions
 Suz+0.005
 Sus+0.005
 Joe+0.004
 Su+0.004
 M+0.004
Neg logit contributions
ars-0.004
oster-0.004
ers-0.004
jected-0.003
Elsa-0.003
Token"
Token ID1
Feature activation-3.19e+00

Pos logit contributions
 Suz+0.008
 Sus+0.008
 Jill+0.008
 Su+0.008
 Joe+0.008
Neg logit contributions
ars-0.006
oster-0.005
ers-0.005
ze-0.005
Elsa-0.005
Token on
Token ID319
Feature activation+1.08e+00

Pos logit contributions
 Suz+0.016
 Sus+0.016
 Joe+0.014
 Su+0.014
 M+0.014
Neg logit contributions
ars-0.013
oster-0.013
Elsa-0.012
Finally-0.011
ers-0.011
Token the
Token ID262
Feature activation-5.72e-01

Pos logit contributions
 Sus+0.025
 Suz+0.025
 M+0.022
 Su+0.022
 Joe+0.022
Neg logit contributions
ars-0.033
oster-0.030
jected-0.027
ers-0.026
Finally-0.026
Token walk
Token ID2513
Feature activation-8.38e-02

Pos logit contributions
ars+0.015
oster+0.014
ers+0.013
Elsa+0.013
Finally+0.012
Neg logit contributions
 Suz-0.014
 Sus-0.014
 microsc-0.013
 Joe-0.012
 Su-0.012
Tokenie
Token ID494
Feature activation+5.12e-01

Pos logit contributions
ars+0.002
oster+0.002
Elsa+0.002
Finally+0.002
Eventually+0.002
Neg logit contributions
 Suz-0.002
 Sus-0.002
 Joe-0.002
 Su-0.002
 Mike-0.002
Token-
Token ID12
Feature activation+2.20e-01

Pos logit contributions
 Suz+0.013
 Sus+0.012
 Joe+0.012
 M+0.011
 Su+0.011
Neg logit contributions
ars-0.013
Elsa-0.012
oster-0.012
Finally-0.012
Eventually-0.011
Tokentalk
Token ID16620
Feature activation+7.87e+00

Pos logit contributions
 Suz+0.005
 Sus+0.005
 Su+0.005
 Joe+0.005
 Mike+0.005
Neg logit contributions
ars-0.005
 End-0.005
Elsa-0.005
bes-0.004
oster-0.004
Tokenie
Token ID494
Feature activation+9.60e-01

Pos logit contributions
 Suz+0.201
 Sus+0.200
 microsc+0.192
arter+0.181
 Joe+0.180
Neg logit contributions
ers-0.181
ars-0.181
oster-0.163
ze-0.148
'-0.146
Token and
Token ID290
Feature activation-2.77e+00

Pos logit contributions
 Suz+0.022
 Sus+0.022
 Joe+0.021
 M+0.021
 Su+0.020
Neg logit contributions
ars-0.026
oster-0.025
Finally-0.024
Elsa-0.024
Eventually-0.024
Token said
Token ID531
Feature activation-2.14e+00

Pos logit contributions
ars+0.067
oster+0.061
ers+0.061
Elsa+0.056
ze+0.053
Neg logit contributions
 Suz-0.074
 Sus-0.073
arter-0.067
 microsc-0.066
 Su-0.066
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
ars+0.053
oster+0.049
ers+0.047
Elsa+0.044
Finally+0.042
Neg logit contributions
 Suz-0.057
 Sus-0.056
 Joe-0.051
 Su-0.051
 microsc-0.050
Token "
Token ID366
Feature activation-3.03e+00

Pos logit contributions
 Sus+0.000
 Suz+0.000
 Joe+0.000
 M+0.000
 Nice+0.000
Neg logit contributions
ars-0.001
oster-0.000
ers-0.000
Elsa-0.000
jected-0.000
Token."
Token ID526
Feature activation-1.50e+00

Pos logit contributions
ars+0.012
oster+0.011
ers+0.011
Elsa+0.010
ze+0.009
Neg logit contributions
 Suz-0.015
 Sus-0.015
 microsc-0.014
 Joe-0.014
 Su-0.014
Token
Token ID198
Feature activation-5.48e-01

Pos logit contributions
ars+0.038
oster+0.035
ers+0.034
ze+0.031
jected+0.031
Neg logit contributions
 Sus-0.041
 Suz-0.040
 Joe-0.038
 M-0.037
 Jill-0.036
Token
Token ID198
Feature activation-3.13e-01

Pos logit contributions
ars+0.016
ers+0.015
oster+0.015
ze+0.014
jected+0.014
Neg logit contributions
 Suz-0.011
 Sus-0.011
 Joe-0.010
 M-0.010
 Mike-0.010
TokenHe
Token ID1544
Feature activation-1.25e+00

Pos logit contributions
ars+0.006
ze+0.006
ers+0.006
oster+0.006
age+0.006
Neg logit contributions
 Sus-0.007
 Suz-0.007
 Jill-0.007
 M-0.007
 Jo-0.007
Token unh
Token ID14274
Feature activation+1.14e+00

Pos logit contributions
ars+0.034
oster+0.031
Elsa+0.029
Finally+0.029
ers+0.029
Neg logit contributions
 Suz-0.028
 Sus-0.027
 Joe-0.025
 M-0.025
 microsc-0.025
Tokenooks
Token ID31085
Feature activation+7.87e+00

Pos logit contributions
 Suz+0.031
 Sus+0.030
 Joe+0.028
 Su+0.027
arter+0.027
Neg logit contributions
ars-0.027
oster-0.025
ers-0.024
Elsa-0.023
ze-0.021
Token Anna
Token ID11735
Feature activation-3.26e-01

Pos logit contributions
 Suz+0.232
 Sus+0.230
 Joe+0.210
 microsc+0.209
 Su+0.209
Neg logit contributions
ars-0.170
oster-0.154
ers-0.150
Elsa-0.139
ze-0.129
Token from
Token ID422
Feature activation+2.64e+00

Pos logit contributions
oster+0.008
Finally+0.007
ars+0.007
Elsa+0.007
 End+0.007
Neg logit contributions
 M-0.007
 Joe-0.007
 Suz-0.007
 Sus-0.007
 microsc-0.007
Token the
Token ID262
Feature activation-5.91e-01

Pos logit contributions
 Sus+0.078
 Suz+0.078
arter+0.074
 Nice+0.073
 microsc+0.072
Neg logit contributions
ars-0.051
ers-0.049
oster-0.046
Elsa-0.043
 End-0.037
Token zip
Token ID19974
Feature activation+1.55e-01

Pos logit contributions
ars+0.014
oster+0.013
ers+0.012
Elsa+0.011
jected+0.011
Neg logit contributions
 Suz-0.016
 Sus-0.016
 microsc-0.015
 Su-0.015
 Joe-0.015
Token line
Token ID1627
Feature activation-1.35e+00

Pos logit contributions
 Suz+0.004
 Sus+0.004
 Joe+0.003
 Su+0.003
 Mike+0.003
Neg logit contributions
ars-0.004
oster-0.004
Elsa-0.004
Finally-0.004
Eventually-0.003
Token and
Token ID290
Feature activation-2.08e+00

Pos logit contributions
 Suz+0.011
 Sus+0.011
 Joe+0.010
 microsc+0.010
 Su+0.010
Neg logit contributions
ars-0.009
oster-0.008
Elsa-0.008
ers-0.008
Finally-0.007
Token a
Token ID257
Feature activation+1.62e+00

Pos logit contributions
ars+0.043
ers+0.038
oster+0.038
Elsa+0.033
ze+0.032
Neg logit contributions
 Suz-0.063
 Sus-0.062
arter-0.058
 Su-0.057
 Nice-0.056
Token walk
Token ID2513
Feature activation-3.42e-01

Pos logit contributions
 Suz+0.044
 Sus+0.043
 Su+0.040
 Joe+0.039
arter+0.038
Neg logit contributions
ars-0.039
oster-0.036
ers-0.034
Elsa-0.033
ze-0.031
Tokenie
Token ID494
Feature activation+8.07e-01

Pos logit contributions
ars+0.008
oster+0.008
Elsa+0.007
ers+0.007
Finally+0.007
Neg logit contributions
 Suz-0.009
 Sus-0.009
 Joe-0.008
 Su-0.008
 microsc-0.008
Token-
Token ID12
Feature activation+3.76e-01

Pos logit contributions
 Suz+0.019
 Sus+0.019
 Joe+0.018
 microsc+0.018
 Su+0.017
Neg logit contributions
ars-0.020
oster-0.019
Elsa-0.019
Finally-0.018
ers-0.018
Tokentalk
Token ID16620
Feature activation+7.75e+00

Pos logit contributions
 Sus+0.010
 Suz+0.010
 Su+0.009
 Joe+0.009
 microsc+0.009
Neg logit contributions
ars-0.008
Elsa-0.008
 End-0.008
oster-0.008
bes-0.007
Tokenie
Token ID494
Feature activation+1.17e+00

Pos logit contributions
 Suz+0.193
 Sus+0.188
 microsc+0.185
arter+0.179
 Joe+0.174
Neg logit contributions
ars-0.188
ers-0.185
oster-0.176
ze-0.157
Elsa-0.153
Token to
Token ID284
Feature activation-2.66e-01

Pos logit contributions
 Suz+0.029
 Sus+0.029
 Joe+0.027
 microsc+0.026
 Su+0.026
Neg logit contributions
ars-0.029
oster-0.027
Elsa-0.027
Finally-0.026
ers-0.025
Token talk
Token ID1561
Feature activation+2.40e+00

Pos logit contributions
ars+0.006
oster+0.005
ers+0.005
Elsa+0.005
ze+0.005
Neg logit contributions
 Suz-0.007
 Sus-0.007
arter-0.007
 microsc-0.007
 Su-0.007
Token to
Token ID284
Feature activation+9.00e-01

Pos logit contributions
 Suz+0.059
 Sus+0.059
 Joe+0.053
 Su+0.053
 microsc+0.053
Neg logit contributions
ars-0.063
oster-0.059
ers-0.056
Elsa-0.054
Finally-0.051
Token each
Token ID1123
Feature activation+2.04e-01

Pos logit contributions
 Suz+0.024
 Sus+0.024
 Joe+0.022
 microsc+0.022
 Su+0.022
Neg logit contributions
ars-0.021
oster-0.020
ers-0.019
Elsa-0.018
ze-0.017
Token hay
Token ID27678
Feature activation-2.90e+00

Pos logit contributions
ars+0.021
oster+0.019
ers+0.018
Elsa+0.018
jected+0.017
Neg logit contributions
 Suz-0.018
 Sus-0.017
 microsc-0.016
 Joe-0.016
 Su-0.015
Token to
Token ID284
Feature activation+2.08e-01

Pos logit contributions
ars+0.076
oster+0.070
Elsa+0.067
ers+0.064
Finally+0.062
Neg logit contributions
 Suz-0.074
 Sus-0.072
 Joe-0.068
 M-0.064
 Su-0.064
Token create
Token ID2251
Feature activation-3.64e-01

Pos logit contributions
 Suz+0.006
 Sus+0.006
 Joe+0.005
 Su+0.005
 microsc+0.005
Neg logit contributions
ars-0.005
oster-0.005
ers-0.005
Elsa-0.004
jected-0.004
Token little
Token ID1310
Feature activation-1.19e+00

Pos logit contributions
ars+0.009
oster+0.009
jected+0.008
Elsa+0.008
Finally+0.008
Neg logit contributions
 Suz-0.010
 Sus-0.010
 Joe-0.009
 Su-0.008
 M-0.008
Token n
Token ID299
Feature activation+2.17e+00

Pos logit contributions
ars+0.029
oster+0.026
ers+0.025
Elsa+0.024
Finally+0.022
Neg logit contributions
 Suz-0.032
 Sus-0.032
 Joe-0.029
 Su-0.029
 microsc-0.029
Tokenooks
Token ID31085
Feature activation+7.73e+00

Pos logit contributions
 Suz+0.056
 Sus+0.056
 microsc+0.050
 Su+0.050
 Joe+0.050
Neg logit contributions
ars-0.055
oster-0.051
ers-0.050
Elsa-0.046
ze-0.044
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
 Suz+0.237
 Sus+0.235
 Joe+0.216
 Su+0.215
 microsc+0.214
Neg logit contributions
ars-0.160
oster-0.145
ers-0.136
Elsa-0.129
Finally-0.121
Token cr
Token ID1067
Feature activation+4.12e+00

Pos logit contributions
ars+0.032
oster+0.029
ers+0.029
Elsa+0.027
Finally+0.026
Neg logit contributions
 Suz-0.031
 Sus-0.031
arter-0.028
 Su-0.027
 Joe-0.027
Tokenann
Token ID1236
Feature activation+4.54e+00

Pos logit contributions
 Suz+0.101
 Sus+0.100
 Joe+0.090
 Su+0.089
 microsc+0.089
Neg logit contributions
ars-0.109
oster-0.102
ers-0.098
Elsa-0.094
ze-0.088
Tokenies
Token ID444
Feature activation+1.38e+00

Pos logit contributions
 Suz+0.087
 Sus+0.084
 Joe+0.075
 microsc+0.073
 Su+0.073
Neg logit contributions
ars-0.143
oster-0.136
Elsa-0.131
ers-0.128
Finally-0.124
Token,
Token ID11
Feature activation+5.11e-01

Pos logit contributions
 Suz+0.035
 Sus+0.035
 Joe+0.032
 microsc+0.031
 Su+0.031
Neg logit contributions
ars-0.035
oster-0.034
Elsa-0.031
ers-0.031
Finally-0.030
Token into
Token ID656
Feature activation+2.55e+00

Pos logit contributions
ars+0.057
oster+0.053
ers+0.050
Elsa+0.048
Finally+0.047
Neg logit contributions
 Sus-0.054
 Suz-0.054
 Joe-0.049
 Su-0.048
 M-0.046
Token the
Token ID262
Feature activation-1.05e-01

Pos logit contributions
 Suz+0.078
 Sus+0.077
arter+0.071
 microsc+0.071
 Joe+0.071
Neg logit contributions
ars-0.051
oster-0.046
ers-0.045
Elsa-0.042
ze-0.038
Token walk
Token ID2513
Feature activation-2.82e-01

Pos logit contributions
ars+0.003
oster+0.003
ers+0.002
Elsa+0.002
Finally+0.002
Neg logit contributions
 Suz-0.003
 Sus-0.003
 microsc-0.002
 Joe-0.002
 Su-0.002
Tokenie
Token ID494
Feature activation+5.05e-01

Pos logit contributions
ars+0.007
oster+0.007
Elsa+0.006
Finally+0.006
Eventually+0.006
Neg logit contributions
 Sus-0.007
 Suz-0.007
 Su-0.007
 Joe-0.007
 Mike-0.006
Token-
Token ID12
Feature activation+3.21e-01

Pos logit contributions
 Suz+0.012
 Sus+0.012
 Joe+0.011
 M+0.010
 Su+0.010
Neg logit contributions
ars-0.014
oster-0.014
Finally-0.013
Now-0.013
Elsa-0.013
Tokentalk
Token ID16620
Feature activation+7.69e+00

Pos logit contributions
 Suz+0.008
 Sus+0.008
 Su+0.007
 Joe+0.007
 Jan+0.007
Neg logit contributions
ars-0.007
 End-0.007
Elsa-0.007
oster-0.007
bes-0.007
Tokenie
Token ID494
Feature activation+8.56e-01

Pos logit contributions
 Suz+0.198
 Sus+0.197
 microsc+0.193
arter+0.181
 Joe+0.180
Neg logit contributions
ers-0.171
ars-0.170
oster-0.163
ze-0.147
Elsa-0.141
Token.
Token ID13
Feature activation+3.57e-01

Pos logit contributions
 Suz+0.018
 Sus+0.018
 Joe+0.017
 M+0.016
 Su+0.016
Neg logit contributions
ars-0.026
oster-0.025
Finally-0.024
Now-0.024
Eventually-0.023
Token She
Token ID1375
Feature activation+9.57e-02

Pos logit contributions
 Suz+0.009
 Sus+0.009
 microsc+0.008
arter+0.008
 Su+0.008
Neg logit contributions
ars-0.009
oster-0.008
ers-0.008
Elsa-0.007
Finally-0.007
Token had
Token ID550
Feature activation+1.07e+00

Pos logit contributions
 Suz+0.002
 Sus+0.002
 Joe+0.002
 Su+0.002
 microsc+0.002
Neg logit contributions
ars-0.003
oster-0.002
ers-0.002
Elsa-0.002
Finally-0.002
Token seen
Token ID1775
Feature activation+1.39e+00

Pos logit contributions
 Suz+0.032
 Sus+0.031
 Joe+0.029
 Su+0.029
 microsc+0.029
Neg logit contributions
ars-0.023
oster-0.021
ers-0.020
Elsa-0.019
Finally-0.017
Token holding
Token ID4769
Feature activation+1.49e-01

Pos logit contributions
ars+0.004
oster+0.004
ers+0.004
Elsa+0.004
ze+0.004
Neg logit contributions
 Suz-0.004
 Sus-0.004
 microsc-0.004
 Joe-0.004
 Su-0.004
Token a
Token ID257
Feature activation+2.55e+00

Pos logit contributions
 Sus+0.004
 Suz+0.004
 Joe+0.004
 Su+0.004
 microsc+0.004
Neg logit contributions
ars-0.004
oster-0.004
ers-0.003
Elsa-0.003
Finally-0.003
Token walk
Token ID2513
Feature activation-2.80e-01

Pos logit contributions
 Suz+0.075
 Sus+0.073
 Joe+0.067
 Su+0.067
 microsc+0.066
Neg logit contributions
ars-0.057
oster-0.051
ers-0.049
Elsa-0.046
ze-0.043
Tokenie
Token ID494
Feature activation+8.22e-01

Pos logit contributions
ars+0.007
oster+0.006
Elsa+0.006
Finally+0.006
ers+0.006
Neg logit contributions
 Suz-0.007
 Sus-0.007
 Joe-0.007
 Su-0.007
 microsc-0.006
Token-
Token ID12
Feature activation+6.34e-01

Pos logit contributions
 Suz+0.020
 Sus+0.020
 Joe+0.019
 microsc+0.018
 Su+0.018
Neg logit contributions
ars-0.021
oster-0.021
Elsa-0.020
Finally-0.020
Eventually-0.019
Tokentalk
Token ID16620
Feature activation+7.67e+00

Pos logit contributions
 Sus+0.016
 Suz+0.016
 Su+0.015
 Joe+0.015
 Jill+0.014
Neg logit contributions
ars-0.014
oster-0.014
bes-0.014
Elsa-0.013
 End-0.013
Tokenie
Token ID494
Feature activation+1.34e+00

Pos logit contributions
 Suz+0.196
 Sus+0.193
 microsc+0.181
arter+0.176
 Joe+0.172
Neg logit contributions
ars-0.185
ers-0.178
oster-0.166
ze-0.150
'-0.146
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
 Suz+0.030
 Sus+0.030
 Joe+0.028
 M+0.027
 microsc+0.027
Neg logit contributions
ars-0.038
oster-0.036
Finally-0.034
Elsa-0.034
Eventually-0.033
Token He
Token ID679
Feature activation+4.58e-01

Pos logit contributions
 Suz+0.032
 Sus+0.032
arter+0.031
 microsc+0.030
yt+0.029
Neg logit contributions
ars-0.029
ers-0.027
oster-0.026
Elsa-0.024
Finally-0.023
Token was
Token ID373
Feature activation-2.00e-01

Pos logit contributions
 Suz+0.012
 Sus+0.012
arter+0.011
 microsc+0.011
 Su+0.011
Neg logit contributions
ars-0.011
oster-0.010
ers-0.010
Elsa-0.009
ze-0.009
Token talking
Token ID3375
Feature activation+1.59e+00

Pos logit contributions
ars+0.005
oster+0.005
ers+0.005
Elsa+0.004
ze+0.004
Neg logit contributions
 Suz-0.005
 Sus-0.005
 microsc-0.005
 Joe-0.005
 Su-0.005
Token and
Token ID290
Feature activation-2.30e+00

Pos logit contributions
 Suz+0.014
 Sus+0.014
 Joe+0.013
 Su+0.013
 microsc+0.013
Neg logit contributions
ars-0.010
oster-0.009
ers-0.009
Elsa-0.009
Finally-0.008
Token a
Token ID257
Feature activation+1.52e+00

Pos logit contributions
ars+0.051
ers+0.046
oster+0.045
Elsa+0.041
ze+0.039
Neg logit contributions
 Suz-0.066
 Sus-0.065
arter-0.061
 Su-0.059
 microsc-0.059
Token walk
Token ID2513
Feature activation-3.25e-01

Pos logit contributions
 Suz+0.040
 Sus+0.039
 Joe+0.036
 Su+0.036
 microsc+0.035
Neg logit contributions
ars-0.038
oster-0.035
ers-0.034
Elsa-0.032
ze-0.030
Tokenie
Token ID494
Feature activation+6.39e-01

Pos logit contributions
ars+0.008
oster+0.007
Elsa+0.007
ers+0.007
Finally+0.006
Neg logit contributions
 Suz-0.009
 Sus-0.009
 Joe-0.008
 Su-0.008
 microsc-0.008
Token-
Token ID12
Feature activation+4.17e-01

Pos logit contributions
 Suz+0.015
 Sus+0.015
 Joe+0.014
 microsc+0.014
 Su+0.013
Neg logit contributions
ars-0.016
oster-0.016
Elsa-0.015
Finally-0.014
ers-0.014
Tokentalk
Token ID16620
Feature activation+7.62e+00

Pos logit contributions
 Suz+0.011
 Sus+0.011
 Su+0.010
 Joe+0.010
 Jan+0.010
Neg logit contributions
 End-0.009
ars-0.009
Elsa-0.008
oster-0.008
bes-0.008
Tokenie
Token ID494
Feature activation+1.02e+00

Pos logit contributions
 Suz+0.194
 microsc+0.188
 Sus+0.188
arter+0.182
 Joe+0.174
Neg logit contributions
ars-0.178
ers-0.178
oster-0.167
ze-0.151
Elsa-0.144
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
 Suz+0.025
 Sus+0.025
 Joe+0.024
 microsc+0.023
 M+0.023
Neg logit contributions
ars-0.026
oster-0.024
Elsa-0.023
Finally-0.023
ers-0.022
Token
Token ID198
Feature activation+6.29e-01

Pos logit contributions
 Suz+0.033
 Sus+0.033
arter+0.030
 microsc+0.030
 Joe+0.030
Neg logit contributions
ars-0.024
oster-0.021
ers-0.021
Elsa-0.020
Finally-0.018
Token
Token ID198
Feature activation+7.30e-01

Pos logit contributions
 Sus+0.018
 Suz+0.018
 Su+0.016
 Joe+0.016
 Mike+0.016
Neg logit contributions
ars-0.014
oster-0.013
ers-0.012
Elsa-0.012
ze-0.011
Token"
Token ID1
Feature activation-2.65e+00

Pos logit contributions
 Sus+0.021
 Suz+0.020
 Su+0.020
 Joe+0.018
 Mike+0.018
Neg logit contributions
ars-0.015
oster-0.013
ers-0.013
Elsa-0.012
ze-0.012
Token so
Token ID523
Feature activation-6.08e-01

Pos logit contributions
ars+0.009
oster+0.008
ers+0.008
Elsa+0.007
Finally+0.007
Neg logit contributions
 Sus-0.009
 Suz-0.009
 Joe-0.008
 Su-0.008
 Joey-0.008
Token excited
Token ID6568
Feature activation-1.31e+00

Pos logit contributions
ars+0.014
oster+0.013
ers+0.012
Elsa+0.011
ze+0.011
Neg logit contributions
 Sus-0.017
 Suz-0.017
 Joe-0.016
 Su-0.016
 microsc-0.015
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
ars+0.029
ers+0.026
oster+0.026
Elsa+0.024
ze+0.023
Neg logit contributions
 Suz-0.037
 Sus-0.036
 microsc-0.034
arter-0.033
 Su-0.033
Token when
Token ID618
Feature activation+8.23e-01

Pos logit contributions
ars+0.034
oster+0.032
ers+0.031
Elsa+0.029
ze+0.028
Neg logit contributions
 Suz-0.033
 Sus-0.032
arter-0.030
 microsc-0.030
 Su-0.029
Token the
Token ID262
Feature activation-3.05e-01

Pos logit contributions
arter+0.021
 Suz+0.020
 Sus+0.019
 microsc+0.019
 Su+0.018
Neg logit contributions
ars-0.017
ers-0.016
oster-0.016
Elsa-0.015
Finally-0.015
Token mer
Token ID4017
Feature activation+7.61e+00

Pos logit contributions
ars+0.008
oster+0.007
ers+0.007
Elsa+0.006
Finally+0.006
Neg logit contributions
 Sus-0.008
 Suz-0.008
 Joe-0.007
 microsc-0.007
 Su-0.007
Tokenmaid
Token ID23151
Feature activation+3.02e+00

Pos logit contributions
 Suz+0.173
 Sus+0.166
 Su+0.158
 microsc+0.154
 Joe+0.153
Neg logit contributions
ars-0.214
oster-0.204
ers-0.199
Elsa-0.184
ze-0.181
Token saw
Token ID2497
Feature activation+1.53e+00

Pos logit contributions
 Suz+0.077
 Sus+0.075
arter+0.068
 Su+0.068
 microsc+0.068
Neg logit contributions
ars-0.077
oster-0.070
ers-0.069
Elsa-0.065
ze-0.061
Token him
Token ID683
Feature activation+4.20e-01

Pos logit contributions
 Suz+0.039
 Sus+0.038
 microsc+0.035
arter+0.035
 Su+0.034
Neg logit contributions
ars-0.039
oster-0.035
ers-0.035
Elsa-0.032
ze-0.031
Token she
Token ID673
Feature activation+4.29e-02

Pos logit contributions
 Suz+0.011
 Sus+0.011
 Joe+0.010
 Su+0.010
 microsc+0.010
Neg logit contributions
ars-0.010
oster-0.010
ers-0.009
Elsa-0.009
Finally-0.008
Token gave
Token ID2921
Feature activation+1.85e-01

Pos logit contributions
 Suz+0.001
 Sus+0.001
 microsc+0.001
 Su+0.001
 Joe+0.001
Neg logit contributions
ars-0.001
oster-0.001
ers-0.001
Elsa-0.001
ze-0.001
Token hear
Token ID3285
Feature activation+2.11e+00

Pos logit contributions
 Suz+0.044
 Sus+0.044
 Joe+0.041
arter+0.040
 Su+0.039
Neg logit contributions
ars-0.039
ers-0.037
oster-0.036
Elsa-0.034
Finally-0.032
Token my
Token ID616
Feature activation-5.15e-01

Pos logit contributions
 Suz+0.059
 Sus+0.059
 Joe+0.054
 Su+0.053
 microsc+0.053
Neg logit contributions
ars-0.049
oster-0.045
ers-0.043
Elsa-0.040
Finally-0.038
Token whistle
Token ID16121
Feature activation-1.00e+00

Pos logit contributions
ars+0.013
oster+0.012
ers+0.012
Elsa+0.011
Finally+0.011
Neg logit contributions
 Suz-0.013
 Sus-0.013
 Joe-0.012
 Su-0.012
 M-0.012
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
ars+0.026
oster+0.024
Elsa+0.024
ers+0.023
Finally+0.023
Neg logit contributions
 Suz-0.023
 Sus-0.022
 Joe-0.021
arter-0.020
 M-0.020
Token Wh
Token ID854
Feature activation+1.67e+00

Pos logit contributions
 Suz+0.038
 Sus+0.037
 microsc+0.034
 Joe+0.034
 Su+0.033
Neg logit contributions
ars-0.034
oster-0.031
ers-0.030
Elsa-0.029
ze-0.027
Tokenist
Token ID396
Feature activation+7.59e+00

Pos logit contributions
 Suz+0.043
 Sus+0.042
arter+0.039
 Joe+0.039
 Su+0.038
Neg logit contributions
ars-0.041
oster-0.040
Elsa-0.037
Finally-0.037
Eventually-0.035
Tokenling
Token ID1359
Feature activation+2.40e+00

Pos logit contributions
 Suz+0.217
 Sus+0.215
 Joe+0.196
 microsc+0.196
 Su+0.195
Neg logit contributions
ars-0.171
oster-0.155
ers-0.151
Elsa-0.139
ze-0.132
Token is
Token ID318
Feature activation+1.76e-01

Pos logit contributions
 Suz+0.060
 Sus+0.058
 Joe+0.055
 M+0.054
 Su+0.053
Neg logit contributions
ars-0.056
oster-0.054
Elsa-0.052
Finally-0.051
ers-0.049
Token not
Token ID407
Feature activation+4.29e+00

Pos logit contributions
 Suz+0.005
 Sus+0.005
 microsc+0.004
 Su+0.004
 Joe+0.004
Neg logit contributions
ars-0.004
oster-0.004
ers-0.004
Elsa-0.004
ze-0.003
Token fun
Token ID1257
Feature activation-9.28e-01

Pos logit contributions
 Suz+0.113
 Sus+0.112
 Joe+0.101
 microsc+0.101
 Su+0.101
Neg logit contributions
ars-0.106
oster-0.098
ers-0.095
Elsa-0.089
ze-0.084
Token with
Token ID351
Feature activation+1.62e+00

Pos logit contributions
ars+0.021
oster+0.019
ers+0.019
Elsa+0.017
ze+0.017
Neg logit contributions
 Suz-0.027
 Sus-0.026
 microsc-0.024
 Su-0.024
 Joe-0.024
Token swimming
Token ID14899
Feature activation-1.12e+00

Pos logit contributions
 Suz+0.026
 Sus+0.025
 Joe+0.023
 Su+0.023
 microsc+0.023
Neg logit contributions
ars-0.031
oster-0.029
ers-0.028
Elsa-0.027
ze-0.026
Token around
Token ID1088
Feature activation+1.11e-01

Pos logit contributions
ars+0.023
oster+0.021
ers+0.021
Elsa+0.018
ze+0.017
Neg logit contributions
 Suz-0.034
 Sus-0.034
 microsc-0.031
arter-0.031
 Su-0.031
Token her
Token ID607
Feature activation+7.47e-01

Pos logit contributions
 Suz+0.003
 Sus+0.003
 Joe+0.003
 Su+0.003
 microsc+0.003
Neg logit contributions
ars-0.003
oster-0.002
ers-0.002
Elsa-0.002
Finally-0.002
Token.
Token ID13
Feature activation+4.94e-01

Pos logit contributions
 Suz+0.020
 Sus+0.020
 Joe+0.019
 microsc+0.018
 Su+0.018
Neg logit contributions
ars-0.018
oster-0.016
ers-0.016
Elsa-0.015
Finally-0.014
Token The
Token ID383
Feature activation-1.19e+00

Pos logit contributions
 Suz+0.013
arter+0.013
 Sus+0.013
 microsc+0.013
 Su+0.012
Neg logit contributions
ars-0.010
oster-0.009
ers-0.009
Elsa-0.009
Finally-0.009
Token mer
Token ID4017
Feature activation+7.57e+00

Pos logit contributions
ars+0.040
oster+0.037
ers+0.036
Elsa+0.035
Finally+0.034
Neg logit contributions
 Suz-0.021
 Sus-0.021
 Joe-0.018
 microsc-0.018
 Su-0.017
Tokenmaid
Token ID23151
Feature activation+3.12e+00

Pos logit contributions
 Suz+0.175
 Sus+0.171
 Su+0.168
 microsc+0.163
 Joe+0.161
Neg logit contributions
ars-0.201
oster-0.196
ers-0.190
ze-0.174
Elsa-0.168
Token smiled
Token ID13541
Feature activation+6.95e-01

Pos logit contributions
 Suz+0.085
 Sus+0.084
arter+0.084
 microsc+0.080
 Su+0.080
Neg logit contributions
ars-0.069
oster-0.063
ers-0.061
Elsa-0.055
ze-0.054
Token and
Token ID290
Feature activation-2.91e+00

Pos logit contributions
 Suz+0.022
 Sus+0.022
 microsc+0.020
 Su+0.020
 Joe+0.020
Neg logit contributions
ars-0.013
oster-0.012
ers-0.011
Elsa-0.010
ze-0.010
Token said
Token ID531
Feature activation-1.69e+00

Pos logit contributions
ars+0.058
oster+0.052
ers+0.051
Elsa+0.046
ze+0.044
Neg logit contributions
arter-0.084
 microsc-0.081
 Suz-0.080
 Sus-0.080
 Nice-0.075
Token,
Token ID11
Feature activation+4.00e-01

Pos logit contributions
ars+0.045
oster+0.041
ers+0.040
Elsa+0.038
Finally+0.036
Neg logit contributions
 Sus-0.043
 Suz-0.043
 Joe-0.039
 Su-0.038
 microsc-0.037
Token get
Token ID651
Feature activation+9.15e-01

Pos logit contributions
 Suz+0.021
 Sus+0.021
 microsc+0.020
arter+0.019
 Nice+0.019
Neg logit contributions
ars-0.018
oster-0.016
ers-0.016
Elsa-0.015
ze-0.015
Token into
Token ID656
Feature activation+2.51e+00

Pos logit contributions
 Sus+0.024
 Suz+0.024
 Joe+0.022
 Su+0.022
 microsc+0.022
Neg logit contributions
ars-0.023
oster-0.021
ers-0.020
Elsa-0.019
Finally-0.018
Token all
Token ID477
Feature activation+5.34e+00

Pos logit contributions
 Suz+0.069
 Sus+0.068
arter+0.063
 Su+0.062
 Joe+0.062
Neg logit contributions
ars-0.058
ers-0.052
oster-0.052
Elsa-0.048
ze-0.046
Token the
Token ID262
Feature activation-6.70e-01

Pos logit contributions
 Suz+0.146
arter+0.144
 Sus+0.142
 Nice+0.140
 microsc+0.139
Neg logit contributions
ars-0.113
ers-0.110
oster-0.098
Elsa-0.091
Now-0.088
Token n
Token ID299
Feature activation+3.54e+00

Pos logit contributions
ars+0.017
oster+0.016
ers+0.016
Elsa+0.015
ze+0.014
Neg logit contributions
 Suz-0.017
 Sus-0.017
 Joe-0.015
 Su-0.015
 microsc-0.015
Tokenooks
Token ID31085
Feature activation+7.56e+00

Pos logit contributions
 Suz+0.082
 Sus+0.082
 Joe+0.074
 Su+0.073
 microsc+0.073
Neg logit contributions
ars-0.099
oster-0.092
ers-0.090
Elsa-0.084
Finally-0.080
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
 Suz+0.245
 Sus+0.242
 microsc+0.223
 Su+0.222
 Joe+0.222
Neg logit contributions
ars-0.140
oster-0.124
ers-0.123
Elsa-0.109
ze-0.101
Token cr
Token ID1067
Feature activation+4.11e+00

Pos logit contributions
ars+0.026
oster+0.024
ers+0.023
Elsa+0.022
Finally+0.021
Neg logit contributions
 Suz-0.027
 Sus-0.027
arter-0.025
 Su-0.024
 Joe-0.024
Tokenann
Token ID1236
Feature activation+4.48e+00

Pos logit contributions
 Suz+0.101
 Sus+0.100
 Joe+0.089
 microsc+0.089
 Su+0.089
Neg logit contributions
ars-0.109
oster-0.102
ers-0.098
Elsa-0.094
ze-0.088
Tokenies
Token ID444
Feature activation+1.59e+00

Pos logit contributions
 Suz+0.086
 Sus+0.085
 Joe+0.074
 Su+0.073
 microsc+0.072
Neg logit contributions
ars-0.140
oster-0.136
Elsa-0.130
ers-0.126
Finally-0.123
Token.
Token ID13
Feature activation+5.14e-01

Pos logit contributions
 Suz+0.041
 Sus+0.040
 Joe+0.037
 Su+0.036
 microsc+0.036
Neg logit contributions
ars-0.041
oster-0.039
ers-0.036
Elsa-0.036
Finally-0.034
Token water
Token ID1660
Feature activation-9.72e-01

Pos logit contributions
 Suz+0.002
 Sus+0.002
arter+0.002
 Su+0.002
 Joe+0.002
Neg logit contributions
ars-0.002
ers-0.002
oster-0.001
Elsa-0.001
ze-0.001
Token.
Token ID13
Feature activation+4.87e-01

Pos logit contributions
ars+0.023
oster+0.021
ers+0.020
Elsa+0.020
Finally+0.018
Neg logit contributions
 Suz-0.026
 Sus-0.026
 Joe-0.024
 Su-0.024
 microsc-0.024
Token
Token ID198
Feature activation+6.75e-01

Pos logit contributions
 Sus+0.013
 Suz+0.012
 Jill+0.012
 J+0.012
 Joe+0.012
Neg logit contributions
oster-0.013
ars-0.013
ers-0.012
ze-0.011
jected-0.010
Token
Token ID198
Feature activation+5.28e-01

Pos logit contributions
 Sus+0.015
 Suz+0.015
 Joe+0.014
 J+0.014
 Jill+0.014
Neg logit contributions
oster-0.018
ars-0.018
ers-0.018
ze-0.017
jected-0.016
TokenThe
Token ID464
Feature activation-1.56e+00

Pos logit contributions
 J+0.014
 Jill+0.014
 Jan+0.014
 Sus+0.014
 microsc+0.013
Neg logit contributions
oster-0.010
ers-0.010
ars-0.009
oscope-0.009
bes-0.009
Token mer
Token ID4017
Feature activation+7.46e+00

Pos logit contributions
ars+0.045
oster+0.042
ers+0.040
Elsa+0.038
Finally+0.037
Neg logit contributions
 Sus-0.035
 Suz-0.035
 microsc-0.031
 Joe-0.031
 Joey-0.030
Tokenmaid
Token ID23151
Feature activation+2.56e+00

Pos logit contributions
 Suz+0.172
 Sus+0.165
 Su+0.158
 Joe+0.149
 microsc+0.146
Neg logit contributions
ars-0.215
oster-0.202
ers-0.198
Elsa-0.181
ze-0.177
Token smiled
Token ID13541
Feature activation+7.20e-01

Pos logit contributions
 Suz+0.066
 Sus+0.066
arter+0.060
 microsc+0.060
 Su+0.060
Neg logit contributions
ars-0.064
oster-0.059
ers-0.057
Elsa-0.053
ze-0.050
Token at
Token ID379
Feature activation+7.17e-01

Pos logit contributions
 Suz+0.022
 Sus+0.022
 microsc+0.020
 Su+0.020
 Joe+0.020
Neg logit contributions
ars-0.015
oster-0.013
ers-0.013
Elsa-0.012
ze-0.011
Token Melody
Token ID38906
Feature activation+1.90e+00

Pos logit contributions
 Sus+0.017
 Suz+0.016
 Joe+0.015
 microsc+0.015
 J+0.015
Neg logit contributions
ars-0.021
oster-0.020
ers-0.018
Elsa-0.018
ze-0.017
Token.
Token ID13
Feature activation+5.95e-01

Pos logit contributions
 Sus+0.047
 Suz+0.045
 Joe+0.044
 M+0.042
 Jo+0.040
Neg logit contributions
ars-0.050
oster-0.049
Elsa-0.044
Finally-0.044
ers-0.043
Token-
Token ID12
Feature activation-9.29e-02

Pos logit contributions
 Sus+0.046
 Suz+0.046
 Su+0.041
 Joe+0.041
 microsc+0.041
Neg logit contributions
ars-0.038
oster-0.034
Elsa-0.031
ers-0.030
Finally-0.030
Tokenup
Token ID929
Feature activation+1.87e+00

Pos logit contributions
ars+0.002
Elsa+0.002
 End+0.002
bes+0.002
oster+0.002
Neg logit contributions
 Sus-0.003
 Suz-0.002
 Su-0.002
 Joe-0.002
 J-0.002
Token,
Token ID11
Feature activation+2.45e-01

Pos logit contributions
 Sus+0.046
 Suz+0.045
 Joe+0.041
 Su+0.040
 microsc+0.040
Neg logit contributions
ars-0.050
oster-0.047
Elsa-0.044
Finally-0.044
Eventually-0.041
Token "
Token ID366
Feature activation-2.13e+00

Pos logit contributions
 Suz+0.006
 Sus+0.006
 Joe+0.005
 Su+0.005
 microsc+0.005
Neg logit contributions
ars-0.007
oster-0.006
ers-0.006
Elsa-0.006
Finally-0.006
Tokenins
Token ID1040
Feature activation+5.46e-01

Pos logit contributions
ars+0.047
ers+0.043
oster+0.042
bes+0.037
Elsa+0.037
Neg logit contributions
 Sus-0.061
 Suz-0.060
 Su-0.055
 Mike-0.054
 Joe-0.054
Tokenist
Token ID396
Feature activation+7.45e+00

Pos logit contributions
 Suz+0.013
 Sus+0.013
 Joe+0.012
 Su+0.012
 microsc+0.011
Neg logit contributions
ars-0.014
oster-0.013
ers-0.013
Elsa-0.013
Finally-0.012
Token on
Token ID319
Feature activation+3.98e-03

Pos logit contributions
 Suz+0.182
 Sus+0.181
 Joe+0.162
 microsc+0.161
 Su+0.161
Neg logit contributions
ars-0.198
oster-0.183
ers-0.179
Elsa-0.169
ze-0.159
Token standing
Token ID5055
Feature activation-1.15e+00

Pos logit contributions
 Suz+0.000
 Sus+0.000
 Joe+0.000
 Su+0.000
 microsc+0.000
Neg logit contributions
ars-0.000
oster-0.000
ers-0.000
Elsa-0.000
Finally-0.000
Token still
Token ID991
Feature activation-1.05e-01

Pos logit contributions
ars+0.028
oster+0.026
Elsa+0.024
ers+0.023
Finally+0.022
Neg logit contributions
 Sus-0.031
 Suz-0.031
 Joe-0.029
 Su-0.028
 microsc-0.028
Token!
Token ID0
Feature activation+8.03e-01

Pos logit contributions
ars+0.003
oster+0.003
ers+0.002
Elsa+0.002
Finally+0.002
Neg logit contributions
 Sus-0.003
 Suz-0.003
 Joe-0.002
 Su-0.002
 microsc-0.002
Token I
Token ID314
Feature activation+3.86e-02

Pos logit contributions
 Suz+0.019
 Sus+0.019
 Joe+0.017
 Su+0.017
 microsc+0.017
Neg logit contributions
ars-0.022
oster-0.020
ers-0.020
Elsa-0.019
ze-0.018
Token castle
Token ID16669
Feature activation-1.13e+00

Pos logit contributions
 Suz+0.015
 Sus+0.014
 Su+0.013
 Joe+0.013
 microsc+0.013
Neg logit contributions
ars-0.014
oster-0.013
ers-0.013
Elsa-0.012
ze-0.012
Token so
Token ID523
Feature activation-9.33e-01

Pos logit contributions
ars+0.027
oster+0.025
ers+0.024
Elsa+0.023
Finally+0.021
Neg logit contributions
 Suz-0.031
 Sus-0.030
 Joe-0.027
 Su-0.027
 microsc-0.027
Token much
Token ID881
Feature activation-4.46e-01

Pos logit contributions
ars+0.021
oster+0.020
ers+0.018
Elsa+0.017
ze+0.017
Neg logit contributions
 Suz-0.024
 Sus-0.024
 microsc-0.022
 Su-0.022
arter-0.022
Token.
Token ID13
Feature activation+4.91e-01

Pos logit contributions
ars+0.009
ers+0.008
oster+0.008
Elsa+0.007
ze+0.007
Neg logit contributions
 Suz-0.014
 Sus-0.013
 Su-0.012
 microsc-0.012
arter-0.012
Token The
Token ID383
Feature activation-1.39e+00

Pos logit contributions
 Suz+0.012
 Sus+0.012
 microsc+0.011
 Su+0.011
arter+0.011
Neg logit contributions
ars-0.013
oster-0.012
ers-0.011
Elsa-0.011
Finally-0.010
Token mer
Token ID4017
Feature activation+7.41e+00

Pos logit contributions
ars+0.044
oster+0.041
ers+0.040
Elsa+0.039
ze+0.037
Neg logit contributions
 Suz-0.027
 Sus-0.026
 Su-0.023
 Joe-0.023
arter-0.023
Tokenmaid
Token ID23151
Feature activation+2.77e+00

Pos logit contributions
 Suz+0.173
 Sus+0.167
 Su+0.163
 Joe+0.154
 microsc+0.153
Neg logit contributions
ars-0.206
oster-0.195
ers-0.193
ze-0.174
Elsa-0.173
Token said
Token ID531
Feature activation-1.87e+00

Pos logit contributions
 Suz+0.075
 Sus+0.074
arter+0.069
 Su+0.068
 microsc+0.068
Neg logit contributions
ars-0.065
oster-0.060
ers-0.058
Elsa-0.053
ze-0.052
Token,
Token ID11
Feature activation+4.97e-01

Pos logit contributions
ars+0.051
oster+0.046
ers+0.043
Elsa+0.043
Finally+0.041
Neg logit contributions
 Sus-0.049
 Suz-0.048
 Joe-0.045
 Su-0.043
 Joey-0.042
Token "
Token ID366
Feature activation-2.51e+00

Pos logit contributions
 Sus+0.010
 Suz+0.010
 Joe+0.009
 M+0.009
 Su+0.009
Neg logit contributions
ars-0.016
oster-0.015
ers-0.015
Elsa-0.014
bes-0.013
TokenI
Token ID40
Feature activation-3.66e-01

Pos logit contributions
ars+0.049
oster+0.043
ers+0.042
Elsa+0.037
bes+0.035
Neg logit contributions
 Suz-0.080
 Sus-0.080
 Joe-0.074
 Su-0.073
 microsc-0.072
Token that
Token ID326
Feature activation-8.31e-01

Pos logit contributions
ars+0.032
oster+0.029
ers+0.026
Elsa+0.026
ze+0.025
Neg logit contributions
 Sus-0.049
 Suz-0.049
 Joe-0.047
 Joey-0.045
 Su-0.044
Token day
Token ID1110
Feature activation-4.28e+00

Pos logit contributions
ars+0.023
oster+0.022
ers+0.021
Elsa+0.020
Finally+0.019
Neg logit contributions
 Suz-0.019
 Sus-0.019
 Joe-0.018
 microsc-0.017
 Su-0.017
Token on
Token ID319
Feature activation-5.46e-01

Pos logit contributions
ars+0.091
oster+0.087
ers+0.083
Elsa+0.078
bes+0.074
Neg logit contributions
 Suz-0.122
 Sus-0.121
 Joe-0.114
 Su-0.113
 J-0.109
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
ars+0.012
oster+0.011
ers+0.011
Elsa+0.010
ze+0.009
Neg logit contributions
 Suz-0.016
 Sus-0.015
 microsc-0.014
arter-0.014
 Su-0.014
Token Cr
Token ID3864
Feature activation+4.78e+00

Pos logit contributions
 Suz+0.021
 Sus+0.020
arter+0.019
 microsc+0.018
 Su+0.017
Neg logit contributions
ars-0.036
oster-0.033
ers-0.033
Elsa-0.031
Finally-0.030
Tokenust
Token ID436
Feature activation+7.35e+00

Pos logit contributions
 Suz+0.080
 Sus+0.079
 Su+0.066
 Joe+0.066
 microsc+0.066
Neg logit contributions
ars-0.166
oster-0.156
ers-0.153
Elsa-0.145
Finally-0.140
Tokeny
Token ID88
Feature activation+5.87e-01

Pos logit contributions
 Sus+0.183
 Suz+0.183
 microsc+0.175
 Su+0.170
 M+0.161
Neg logit contributions
ars-0.192
oster-0.179
ers-0.175
age-0.155
jected-0.152
Token tried
Token ID3088
Feature activation+4.52e-01

Pos logit contributions
 Sus+0.013
 Suz+0.013
 Joe+0.012
 M+0.012
 J+0.012
Neg logit contributions
ars-0.017
oster-0.015
ers-0.014
Elsa-0.014
Finally-0.014
Token his
Token ID465
Feature activation-3.58e-01

Pos logit contributions
 Suz+0.011
 Sus+0.011
arter+0.010
 Su+0.010
 microsc+0.010
Neg logit contributions
ars-0.011
ers-0.011
oster-0.010
Elsa-0.010
ze-0.010
Token best
Token ID1266
Feature activation+1.19e+00

Pos logit contributions
ars+0.009
oster+0.008
ers+0.008
Elsa+0.008
Finally+0.007
Neg logit contributions
 Suz-0.009
 Sus-0.009
 microsc-0.008
 Joe-0.008
arter-0.008
Token to
Token ID284
Feature activation-3.45e-01

Pos logit contributions
 Suz+0.033
 Sus+0.033
 microsc+0.031
arter+0.030
 Su+0.030
Neg logit contributions
ars-0.027
ers-0.024
oster-0.024
ze-0.023
age-0.022
Token made
Token ID925
Feature activation-1.16e+00

Pos logit contributions
ars+0.043
oster+0.040
ers+0.039
Elsa+0.037
ze+0.035
Neg logit contributions
 Suz-0.034
 Sus-0.034
 Joe-0.030
 Su-0.030
 microsc-0.029
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
ars+0.034
oster+0.033
jected+0.029
Elsa+0.029
Finally+0.028
Neg logit contributions
 Sus-0.029
 Suz-0.028
 Joe-0.027
 Su-0.026
 M-0.025
Token
Token ID220
Feature activation+7.99e-01

Pos logit contributions
ars+0.006
oster+0.006
ers+0.005
Elsa+0.005
ze+0.005
Neg logit contributions
 Suz-0.008
 Sus-0.008
 Joe-0.007
 Su-0.007
 microsc-0.007
Token
Token ID198
Feature activation-3.89e-01

Pos logit contributions
 Suz+0.021
 Sus+0.021
 Joe+0.019
 Su+0.019
 Mike+0.018
Neg logit contributions
ars-0.020
oster-0.020
ers-0.018
jected-0.018
ze-0.017
TokenThe
Token ID464
Feature activation-2.27e+00

Pos logit contributions
ars+0.009
ers+0.008
oster+0.008
jected+0.007
ze+0.007
Neg logit contributions
 Suz-0.011
 Sus-0.011
 Su-0.010
 Joe-0.010
 M-0.010
Token mer
Token ID4017
Feature activation+7.31e+00

Pos logit contributions
ars+0.051
ers+0.045
oster+0.044
bes+0.042
lor+0.041
Neg logit contributions
 Suz-0.064
 Sus-0.063
 Joe-0.058
 M-0.057
 Joey-0.056
Tokenmaid
Token ID23151
Feature activation+2.55e+00

Pos logit contributions
 Suz+0.161
 Sus+0.157
 Su+0.144
 microsc+0.143
 Joe+0.141
Neg logit contributions
ars-0.212
oster-0.202
ers-0.196
Elsa-0.183
ze-0.179
Token learnt
Token ID26338
Feature activation-4.50e-01

Pos logit contributions
 Suz+0.066
 Sus+0.065
 Joe+0.059
 Su+0.058
 microsc+0.057
Neg logit contributions
ars-0.065
oster-0.060
ers-0.058
Elsa-0.055
Finally-0.052
Token that
Token ID326
Feature activation-4.76e-01

Pos logit contributions
ars+0.011
oster+0.011
ers+0.010
Elsa+0.009
jected+0.009
Neg logit contributions
 Sus-0.012
 Suz-0.012
 Joe-0.011
 Su-0.011
 microsc-0.011
Token taking
Token ID2263
Feature activation-1.14e+00

Pos logit contributions
ars+0.012
oster+0.011
ers+0.011
Elsa+0.010
ze+0.009
Neg logit contributions
 Suz-0.012
 Sus-0.012
 Joe-0.011
 Su-0.011
 microsc-0.011
Token risks
Token ID7476
Feature activation+1.08e+00

Pos logit contributions
ars+0.028
oster+0.027
Elsa+0.024
ers+0.023
Finally+0.023
Neg logit contributions
 Sus-0.030
 Suz-0.030
 Joe-0.027
 Su-0.026
 microsc-0.026
Token sailed
Token ID33835
Feature activation-7.26e-01

Pos logit contributions
 Sus+0.022
 Suz+0.022
 Joe+0.021
 Su+0.020
 Mike+0.019
Neg logit contributions
ars-0.024
oster-0.022
Elsa-0.022
ers-0.021
Finally-0.020
Token for
Token ID329
Feature activation-1.89e+00

Pos logit contributions
ars+0.020
oster+0.019
ers+0.018
Elsa+0.018
ze+0.016
Neg logit contributions
 Suz-0.017
 Sus-0.017
 Joe-0.015
 Su-0.015
 microsc-0.014
Token days
Token ID1528
Feature activation+1.23e-01

Pos logit contributions
ars+0.042
oster+0.039
ers+0.037
Elsa+0.035
Finally+0.032
Neg logit contributions
 Suz-0.054
 Sus-0.054
 Joe-0.049
 Su-0.049
 microsc-0.049
Token,
Token ID11
Feature activation+9.80e-01

Pos logit contributions
 Sus+0.003
 Suz+0.003
 Joe+0.003
 Mike+0.003
 Su+0.003
Neg logit contributions
ars-0.003
oster-0.003
Elsa-0.003
ers-0.003
ze-0.003
Token exploring
Token ID13504
Feature activation+7.51e-01

Pos logit contributions
 Suz+0.023
 Sus+0.023
 Joe+0.021
 Su+0.021
 microsc+0.020
Neg logit contributions
ars-0.027
oster-0.026
ers-0.025
Elsa-0.024
Finally-0.022
Token unch
Token ID16258
Feature activation+7.31e+00

Pos logit contributions
 Suz+0.022
 Sus+0.022
arter+0.020
 Su+0.020
 microsc+0.020
Neg logit contributions
ars-0.016
ers-0.015
oster-0.014
Elsa-0.013
ze-0.012
Tokenarted
Token ID19112
Feature activation-3.68e-01

Pos logit contributions
 Suz+0.173
 Sus+0.171
 microsc+0.156
 M+0.151
 Nice+0.151
Neg logit contributions
ars-0.186
ers-0.175
oster-0.173
jected-0.156
Elsa-0.155
Token waters
Token ID10150
Feature activation+2.14e+00

Pos logit contributions
ars+0.009
oster+0.009
Elsa+0.008
ers+0.008
Finally+0.008
Neg logit contributions
 Sus-0.009
 Suz-0.009
 Joe-0.009
 Su-0.009
 microsc-0.008
Token until
Token ID1566
Feature activation-4.70e-01

Pos logit contributions
 Suz+0.060
 Sus+0.060
 Joe+0.055
 Su+0.054
 microsc+0.054
Neg logit contributions
ars-0.049
oster-0.045
ers-0.043
Elsa-0.042
Finally-0.038
Token he
Token ID339
Feature activation+6.12e-02

Pos logit contributions
ars+0.012
ers+0.011
oster+0.011
Elsa+0.009
Finally+0.009
Neg logit contributions
 Suz-0.012
 Sus-0.011
arter-0.011
 microsc-0.011
 Nice-0.010
Token came
Token ID1625
Feature activation+3.12e+00

Pos logit contributions
 Suz+0.002
 Sus+0.002
 Joe+0.001
 Su+0.001
 microsc+0.001
Neg logit contributions
ars-0.002
oster-0.001
ers-0.001
Elsa-0.001
Finally-0.001
Token the
Token ID262
Feature activation-3.24e-01

Pos logit contributions
ars+0.007
oster+0.006
ers+0.006
Elsa+0.006
ze+0.005
Neg logit contributions
 Suz-0.008
 Sus-0.008
 microsc-0.007
 Joe-0.007
 Su-0.007
Token garden
Token ID11376
Feature activation+7.08e-01

Pos logit contributions
ars+0.010
oster+0.009
ers+0.009
bes+0.009
Elsa+0.008
Neg logit contributions
 Suz-0.007
 Sus-0.006
 microsc-0.006
 Jill-0.006
 M-0.006
Token shop
Token ID6128
Feature activation+1.22e+00

Pos logit contributions
 Suz+0.017
 Sus+0.017
 Joe+0.016
 Su+0.015
 microsc+0.015
Neg logit contributions
ars-0.020
oster-0.018
Elsa-0.017
ers-0.017
Finally-0.016
Token.
Token ID13
Feature activation+8.03e-01

Pos logit contributions
 Sus+0.032
 Suz+0.031
 Joe+0.029
 Su+0.028
 Mike+0.027
Neg logit contributions
ars-0.034
oster-0.032
Elsa-0.029
ers-0.029
Finally-0.028
Token The
Token ID383
Feature activation-1.20e+00

Pos logit contributions
 Sus+0.020
 Suz+0.020
 Joe+0.018
 Su+0.018
 M+0.017
Neg logit contributions
ars-0.022
oster-0.021
ers-0.020
Elsa-0.018
ze-0.018
Token gard
Token ID16985
Feature activation+7.28e+00

Pos logit contributions
ars+0.039
oster+0.036
ers+0.035
lor+0.034
Elsa+0.034
Neg logit contributions
 Suz-0.023
 Sus-0.022
 M-0.020
 Jill-0.019
 microsc-0.019
Tokenener
Token ID877
Feature activation+2.81e+00

Pos logit contributions
 Suz+0.195
 Sus+0.190
 Joe+0.167
 Su+0.167
arter+0.164
Neg logit contributions
ars-0.195
oster-0.177
ers-0.164
Elsa-0.161
Finally-0.160
Token gave
Token ID2921
Feature activation+1.13e-01

Pos logit contributions
 Suz+0.059
 Sus+0.058
 Joe+0.052
 Su+0.050
 M+0.047
Neg logit contributions
ars-0.087
oster-0.080
Elsa-0.079
ers-0.075
Finally-0.073
Token her
Token ID607
Feature activation-3.10e-01

Pos logit contributions
 Sus+0.002
 Suz+0.002
 M+0.002
 Joe+0.002
 Su+0.002
Neg logit contributions
ars-0.004
oster-0.004
bes-0.004
Finally-0.003
age-0.003
Token some
Token ID617
Feature activation+1.40e+00

Pos logit contributions
ars+0.009
oster+0.008
ers+0.007
Elsa+0.007
Finally+0.007
Neg logit contributions
 Suz-0.008
 Sus-0.008
 Joe-0.007
 M-0.007
 Su-0.007
Token special
Token ID2041
Feature activation+5.51e-01

Pos logit contributions
 Suz+0.035
 Sus+0.035
 Joe+0.031
 microsc+0.031
 Su+0.031
Neg logit contributions
ars-0.036
oster-0.034
ers-0.032
Elsa-0.032
jected-0.030
Token are
Token ID389
Feature activation-3.39e-01

Pos logit contributions
ars+0.064
oster+0.061
ers+0.058
Elsa+0.056
ze+0.053
Neg logit contributions
 Suz-0.049
 Sus-0.049
 Joe-0.043
 microsc-0.043
 Su-0.042
Token my
Token ID616
Feature activation-1.38e+00

Pos logit contributions
ars+0.009
oster+0.008
ers+0.008
Elsa+0.007
ze+0.007
Neg logit contributions
 Sus-0.009
 Suz-0.009
 Joe-0.008
 Su-0.008
 microsc-0.008
Token lovely
Token ID14081
Feature activation-5.41e-01

Pos logit contributions
ars+0.032
oster+0.029
ers+0.028
Elsa+0.027
ze+0.025
Neg logit contributions
 Suz-0.038
 Sus-0.038
 Su-0.034
 Joe-0.034
 microsc-0.034
Token girl
Token ID2576
Feature activation-1.75e+00

Pos logit contributions
ars+0.012
ers+0.012
oster+0.011
Elsa+0.010
ze+0.010
Neg logit contributions
 Suz-0.015
 Sus-0.014
 microsc-0.014
 Su-0.013
 Joey-0.013
Token."
Token ID526
Feature activation-2.00e+00

Pos logit contributions
ars+0.046
oster+0.044
Finally+0.040
ers+0.039
Elsa+0.039
Neg logit contributions
 Sus-0.043
 Suz-0.043
 Joe-0.039
 M-0.038
 Su-0.038
Token<|endoftext|>
Token ID50256
Feature activation-6.99e+00

Pos logit contributions
oster+0.051
ars+0.051
ers+0.047
ze+0.044
jected+0.042
Neg logit contributions
 Suz-0.053
 Sus-0.053
 M-0.047
 Su-0.047
 Joe-0.047
TokenL
Token ID43
Feature activation-1.33e+00

Pos logit contributions
ars+0.176
oster+0.173
ers+0.151
ze+0.151
Finally+0.149
Neg logit contributions
 Suz-0.173
 Sus-0.172
 Joe-0.160
 M-0.157
 microsc-0.157
Tokenily
Token ID813
Feature activation-8.36e-01

Pos logit contributions
ars+0.025
oster+0.022
ers+0.021
Elsa+0.020
ze+0.018
Neg logit contributions
 Suz-0.043
 Sus-0.043
 Joe-0.040
 Su-0.039
 microsc-0.039
Token and
Token ID290
Feature activation-3.58e+00

Pos logit contributions
ars+0.020
oster+0.019
ers+0.017
Elsa+0.017
Finally+0.017
Neg logit contributions
 Sus-0.023
 Suz-0.023
 Joe-0.021
 Su-0.021
 M-0.020
Token Tom
Token ID4186
Feature activation-3.60e-01

Pos logit contributions
ars+0.095
oster+0.086
ers+0.085
Elsa+0.083
ze+0.077
Neg logit contributions
 Suz-0.085
 Sus-0.084
 microsc-0.078
arter-0.078
 Su-0.074
Token are
Token ID389
Feature activation-1.37e+00

Pos logit contributions
ars+0.010
oster+0.009
ers+0.009
Elsa+0.008
ze+0.008
Neg logit contributions
 Suz-0.009
 Sus-0.009
 Joe-0.008
 Su-0.008
 microsc-0.008
Token fighting
Token ID4330
Feature activation+2.32e+00

Pos logit contributions
ars+0.060
oster+0.054
ers+0.052
Elsa+0.049
Finally+0.046
Neg logit contributions
 Suz-0.077
 Sus-0.076
 Su-0.070
 Joe-0.069
 microsc-0.069
Token is
Token ID318
Feature activation-1.33e-01

Pos logit contributions
 Suz+0.069
 Sus+0.068
 Su+0.062
 Joe+0.062
 microsc+0.062
Neg logit contributions
ars-0.051
oster-0.045
ers-0.044
Elsa-0.041
ze-0.038
Token not
Token ID407
Feature activation+3.87e+00

Pos logit contributions
ars+0.003
oster+0.003
ers+0.003
Elsa+0.003
ze+0.002
Neg logit contributions
 Suz-0.004
 Sus-0.004
 Joe-0.003
 microsc-0.003
 Su-0.003
Token nice
Token ID3621
Feature activation+3.29e-02

Pos logit contributions
 Suz+0.115
 Sus+0.109
 microsc+0.103
 Su+0.102
arter+0.100
Neg logit contributions
ars-0.081
ers-0.073
oster-0.070
Elsa-0.064
ze-0.064
Token.
Token ID13
Feature activation+6.56e-01

Pos logit contributions
 Suz+0.001
 Sus+0.001
 Su+0.001
 microsc+0.001
arter+0.001
Neg logit contributions
ars-0.001
ers-0.001
oster-0.000
ze-0.000
'-0.000
Token<|endoftext|>
Token ID50256
Feature activation-6.99e+00

Pos logit contributions
 Suz+0.017
 Sus+0.017
 microsc+0.016
 Joe+0.016
 Su+0.016
Neg logit contributions
ars-0.016
oster-0.015
ers-0.014
Elsa-0.013
Finally-0.013
TokenL
Token ID43
Feature activation-1.26e+00

Pos logit contributions
ars+0.186
oster+0.182
ze+0.162
ers+0.160
Finally+0.158
Neg logit contributions
 Sus-0.164
 Suz-0.162
 Joe-0.153
 M-0.149
 microsc-0.146
Tokenila
Token ID10102
Feature activation-9.65e-01

Pos logit contributions
ars+0.033
oster+0.031
ers+0.029
Elsa+0.028
ze+0.027
Neg logit contributions
 Suz-0.032
 Sus-0.032
 Su-0.029
 Joe-0.029
 microsc-0.028
Token and
Token ID290
Feature activation-3.37e+00

Pos logit contributions
ars+0.024
oster+0.023
Finally+0.021
Elsa+0.020
ers+0.020
Neg logit contributions
 Sus-0.025
 Suz-0.025
 Joe-0.024
 J-0.023
 Su-0.022
Token Tom
Token ID4186
Feature activation-1.20e-01

Pos logit contributions
ars+0.094
oster+0.088
ers+0.086
Elsa+0.080
Finally+0.077
Neg logit contributions
 Suz-0.080
 Sus-0.079
 Joe-0.072
 Su-0.071
 microsc-0.069
Token are
Token ID389
Feature activation-1.27e+00

Pos logit contributions
ars+0.003
oster+0.003
ers+0.003
Elsa+0.003
Finally+0.003
Neg logit contributions
 Suz-0.003
 Sus-0.003
 Joe-0.002
 Su-0.002
 microsc-0.002
Token Happy
Token ID14628
Feature activation-4.21e-01

Pos logit contributions
 Suz+0.005
 Sus+0.004
 microsc+0.004
arter+0.004
 Joe+0.004
Neg logit contributions
ars-0.005
oster-0.005
ers-0.005
Elsa-0.004
Finally-0.004
Token birthday
Token ID10955
Feature activation-8.83e-01

Pos logit contributions
ars+0.014
oster+0.014
ers+0.013
Elsa+0.012
jected+0.012
Neg logit contributions
 Sus-0.007
 Suz-0.007
 Su-0.006
arter-0.006
 Joe-0.006
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
ars+0.022
oster+0.020
ers+0.019
Elsa+0.018
Finally+0.018
Neg logit contributions
 Suz-0.023
 Sus-0.023
 Joe-0.021
 Su-0.021
 microsc-0.020
Token me
Token ID502
Feature activation-1.73e-01

Pos logit contributions
 Suz+0.004
 Sus+0.004
 microsc+0.004
arter+0.004
 Joe+0.004
Neg logit contributions
ars-0.006
oster-0.005
ers-0.005
Elsa-0.005
ze-0.005
Token."
Token ID526
Feature activation-2.05e+00

Pos logit contributions
ars+0.005
oster+0.005
Elsa+0.004
Finally+0.004
Eventually+0.004
Neg logit contributions
 Sus-0.004
 Suz-0.004
 Joe-0.003
 Jill-0.003
arter-0.003
Token<|endoftext|>
Token ID50256
Feature activation-6.99e+00

Pos logit contributions
ars+0.047
oster+0.043
ers+0.042
Elsa+0.040
Finally+0.037
Neg logit contributions
 Suz-0.057
 Sus-0.057
 Joe-0.052
 microsc-0.052
 Su-0.051
TokenL
Token ID43
Feature activation-1.38e+00

Pos logit contributions
ars+0.178
oster+0.177
ze+0.156
Finally+0.154
ers+0.154
Neg logit contributions
 Suz-0.166
 Sus-0.165
 Joe-0.154
 microsc-0.152
 M-0.152
Tokenily
Token ID813
Feature activation-5.42e-01

Pos logit contributions
ars+0.030
oster+0.027
ers+0.026
Elsa+0.024
ze+0.023
Neg logit contributions
 Suz-0.041
 Sus-0.040
 Joe-0.037
 Su-0.037
 microsc-0.037
Token liked
Token ID8288
Feature activation-3.13e+00

Pos logit contributions
ars+0.013
oster+0.012
ers+0.011
Elsa+0.010
Finally+0.010
Neg logit contributions
 Sus-0.015
 Suz-0.015
 Joe-0.014
 Su-0.014
 M-0.013
Token to
Token ID284
Feature activation-1.49e+00

Pos logit contributions
ars+0.064
ers+0.062
oster+0.058
Elsa+0.057
ze+0.053
Neg logit contributions
 Suz-0.084
 Sus-0.080
arter-0.080
 Nice-0.072
 Joe-0.072
Token play
Token ID711
Feature activation-3.70e+00

Pos logit contributions
ars+0.035
oster+0.032
ers+0.031
Elsa+0.030
ze+0.029
Neg logit contributions
 Suz-0.038
 Sus-0.038
arter-0.035
 microsc-0.035
 Su-0.034
Token beautiful
Token ID4950
Feature activation+1.25e+00

Pos logit contributions
 Suz+0.060
 Sus+0.060
 Joe+0.053
 microsc+0.053
 Su+0.053
Neg logit contributions
ars-0.060
oster-0.055
ers-0.054
Elsa-0.050
ze-0.048
Token basket
Token ID7988
Feature activation-9.48e-01

Pos logit contributions
 Sus+0.030
 Suz+0.030
 Su+0.026
 Joe+0.026
 microsc+0.025
Neg logit contributions
ars-0.036
oster-0.032
ers-0.031
Elsa-0.030
jected-0.029
Token of
Token ID286
Feature activation-2.67e-01

Pos logit contributions
ars+0.026
oster+0.024
ers+0.022
Elsa+0.022
Finally+0.022
Neg logit contributions
 Sus-0.022
 Suz-0.022
 Joe-0.020
 Su-0.020
arter-0.019
Token flowers
Token ID12734
Feature activation-9.34e-01

Pos logit contributions
ars+0.007
oster+0.006
ers+0.006
Elsa+0.006
ze+0.005
Neg logit contributions
 Suz-0.007
 Sus-0.007
arter-0.006
 Joe-0.006
 Su-0.006
Token."
Token ID526
Feature activation-1.90e+00

Pos logit contributions
ars+0.024
oster+0.022
Finally+0.020
ers+0.020
Elsa+0.020
Neg logit contributions
 Sus-0.023
 Suz-0.023
 Joe-0.021
 microsc-0.021
 Su-0.021
Token<|endoftext|>
Token ID50256
Feature activation-6.98e+00

Pos logit contributions
ars+0.046
oster+0.043
ers+0.041
Elsa+0.037
ze+0.037
Neg logit contributions
 Suz-0.053
 Sus-0.053
 Su-0.047
 Joe-0.047
 microsc-0.047
TokenL
Token ID43
Feature activation-1.32e+00

Pos logit contributions
ars+0.183
oster+0.182
ze+0.162
ers+0.159
Finally+0.158
Neg logit contributions
 Suz-0.163
 Sus-0.162
 Joe-0.152
 M-0.149
 microsc-0.148
Tokenily
Token ID813
Feature activation-6.30e-01

Pos logit contributions
ars+0.027
oster+0.025
ers+0.023
Elsa+0.022
ze+0.021
Neg logit contributions
 Suz-0.041
 Sus-0.041
 Su-0.037
 microsc-0.037
 Joe-0.037
Token and
Token ID290
Feature activation-3.34e+00

Pos logit contributions
ars+0.014
oster+0.013
ers+0.012
Elsa+0.012
Finally+0.012
Neg logit contributions
 Sus-0.018
 Suz-0.018
 Joe-0.017
 Su-0.016
 M-0.016
Token Max
Token ID5436
Feature activation-7.31e-01

Pos logit contributions
ars+0.082
oster+0.074
ers+0.073
Elsa+0.070
ze+0.065
Neg logit contributions
 Suz-0.088
 Sus-0.086
 microsc-0.080
arter-0.079
 Su-0.077
Token love
Token ID1842
Feature activation-2.51e+00

Pos logit contributions
ars+0.019
oster+0.018
ers+0.018
Elsa+0.017
ze+0.016
Neg logit contributions
 Suz-0.018
 Sus-0.018
 Joe-0.016
 microsc-0.016
 Su-0.016
Token you
Token ID345
Feature activation-1.83e+00

Pos logit contributions
ars+0.030
oster+0.026
ers+0.026
Elsa+0.023
ze+0.021
Neg logit contributions
 Suz-0.062
 Sus-0.061
 Su-0.057
 Joe-0.057
 microsc-0.056
Token too
Token ID1165
Feature activation-9.02e-01

Pos logit contributions
ars+0.047
oster+0.045
ers+0.042
Elsa+0.040
ze+0.038
Neg logit contributions
 Sus-0.046
 Suz-0.046
 Joe-0.042
 M-0.041
 microsc-0.040
Token,
Token ID11
Feature activation+5.06e-01

Pos logit contributions
ars+0.013
"+0.011
ers+0.010
Finally+0.009
'+0.009
Neg logit contributions
 Suz-0.032
 Sus-0.031
 microsc-0.030
 Su-0.029
arter-0.029
Token Lily
Token ID20037
Feature activation-9.16e-01

Pos logit contributions
 Suz+0.011
 Sus+0.011
 microsc+0.010
 Joe+0.010
arter+0.010
Neg logit contributions
ars-0.014
ers-0.013
oster-0.013
Elsa-0.013
Finally-0.012
Token."
Token ID526
Feature activation-2.02e+00

Pos logit contributions
ars+0.029
oster+0.028
Finally+0.026
Eventually+0.025
Elsa+0.025
Neg logit contributions
 Sus-0.018
 Suz-0.017
 M-0.016
 Joe-0.016
 Jill-0.015
Token<|endoftext|>
Token ID50256
Feature activation-6.97e+00

Pos logit contributions
ars+0.056
oster+0.056
ers+0.051
ze+0.049
jected+0.047
Neg logit contributions
 Suz-0.049
 Sus-0.049
 M-0.044
 Jill-0.043
 Su-0.043
TokenAnna
Token ID31160
Feature activation-9.54e-01

Pos logit contributions
ars+0.175
oster+0.172
ers+0.152
ze+0.150
Elsa+0.148
Neg logit contributions
 Suz-0.174
 Sus-0.173
 Joe-0.161
 microsc-0.158
 M-0.157
Token and
Token ID290
Feature activation-3.66e+00

Pos logit contributions
ars+0.023
oster+0.021
Finally+0.019
ers+0.019
Elsa+0.019
Neg logit contributions
 Sus-0.026
 Suz-0.026
 Joe-0.024
 Su-0.024
 M-0.023
Token Ben
Token ID3932
Feature activation-1.61e+00

Pos logit contributions
ars+0.094
oster+0.085
ers+0.083
Elsa+0.082
ze+0.076
Neg logit contributions
 Suz-0.090
 Sus-0.088
 microsc-0.082
arter-0.081
 Su-0.079
Token are
Token ID389
Feature activation-1.36e+00

Pos logit contributions
ars+0.043
oster+0.040
ers+0.038
Elsa+0.036
ze+0.034
Neg logit contributions
 Suz-0.040
 Sus-0.039
 Joe-0.035
 Su-0.035
 microsc-0.035
Token twins
Token ID20345
Feature activation-1.25e+00

Pos logit contributions
ars+0.042
oster+0.040
ers+0.039
Elsa+0.036
ze+0.035
Neg logit contributions
 Sus-0.028
 Suz-0.028
 Joe-0.025
 Su-0.025
 microsc-0.025
Token new
Token ID649
Feature activation-3.45e-01

Pos logit contributions
ars+0.044
oster+0.042
ers+0.041
Elsa+0.039
ze+0.038
Neg logit contributions
 Suz-0.033
 Sus-0.032
 Joe-0.029
 Su-0.029
arter-0.029
Token friend
Token ID1545
Feature activation-6.04e-01

Pos logit contributions
ars+0.008
ers+0.007
oster+0.007
Elsa+0.007
ze+0.007
Neg logit contributions
 Suz-0.009
arter-0.009
 Sus-0.009
 Su-0.008
 Joe-0.008
Token,
Token ID11
Feature activation+1.85e-01

Pos logit contributions
ars+0.017
oster+0.016
Finally+0.015
Eventually+0.015
Elsa+0.014
Neg logit contributions
 Sus-0.014
 Suz-0.013
 Joe-0.012
 M-0.012
 Jill-0.012
Token too
Token ID1165
Feature activation-6.34e-01

Pos logit contributions
 Suz+0.005
 Sus+0.005
arter+0.004
 microsc+0.004
 Joe+0.004
Neg logit contributions
ars-0.004
oster-0.004
Elsa-0.004
ers-0.004
ze-0.003
Token."
Token ID526
Feature activation-2.48e+00

Pos logit contributions
ars+0.012
oster+0.011
ers+0.011
Elsa+0.010
ze+0.009
Neg logit contributions
 Suz-0.020
 Sus-0.019
 microsc-0.018
 Su-0.018
 Joe-0.018
Token<|endoftext|>
Token ID50256
Feature activation-6.97e+00

Pos logit contributions
ars+0.048
oster+0.042
ers+0.041
Elsa+0.038
Finally+0.035
Neg logit contributions
 Suz-0.079
 Sus-0.078
 microsc-0.072
arter-0.072
 Joe-0.072
TokenAnna
Token ID31160
Feature activation-8.78e-01

Pos logit contributions
ars+0.179
oster+0.176
ers+0.154
ze+0.154
Elsa+0.152
Neg logit contributions
 Suz-0.170
 Sus-0.169
 Joe-0.158
 M-0.152
 microsc-0.151
Token and
Token ID290
Feature activation-3.77e+00

Pos logit contributions
ars+0.019
oster+0.018
ers+0.016
Elsa+0.016
Finally+0.016
Neg logit contributions
 Sus-0.026
 Suz-0.026
 Joe-0.024
 Su-0.023
 M-0.023
Token Ben
Token ID3932
Feature activation-1.41e+00

Pos logit contributions
ars+0.100
oster+0.091
ers+0.089
Elsa+0.086
ze+0.082
Neg logit contributions
 Suz-0.089
 Sus-0.088
 microsc-0.081
arter-0.080
 Su-0.077
Token were
Token ID547
Feature activation-9.53e-01

Pos logit contributions
ars+0.037
oster+0.034
ers+0.034
Elsa+0.032
ze+0.030
Neg logit contributions
 Suz-0.035
 Sus-0.034
 microsc-0.031
arter-0.031
 Su-0.031
Token playing
Token ID2712
Feature activation-4.75e+00

Pos logit contributions
ars+0.028
oster+0.027
ers+0.025
Elsa+0.024
Finally+0.023
Neg logit contributions
 Sus-0.021
 Suz-0.021
 Joe-0.019
 Su-0.018
 M-0.018
Token Anna
Token ID11735
Feature activation-1.91e+00

Pos logit contributions
ars+0.033
oster+0.030
ers+0.029
Elsa+0.026
Finally+0.026
Neg logit contributions
 Sus-0.036
 Suz-0.036
 microsc-0.032
 Joe-0.032
 Su-0.032
Token and
Token ID290
Feature activation-3.03e+00

Pos logit contributions
ars+0.056
oster+0.055
Finally+0.050
Eventually+0.048
Elsa+0.047
Neg logit contributions
 Sus-0.039
 Suz-0.037
 M-0.036
 Nice-0.033
 Jill-0.033
Token Elsa
Token ID19226
Feature activation-9.25e-01

Pos logit contributions
ars+0.076
oster+0.069
ers+0.068
Elsa+0.067
ze+0.061
Neg logit contributions
 Suz-0.078
 Sus-0.076
 Su-0.069
 Joe-0.069
 microsc-0.068
Token too
Token ID1165
Feature activation-7.09e-01

Pos logit contributions
ars+0.028
oster+0.028
Finally+0.025
Eventually+0.025
ers+0.024
Neg logit contributions
 Sus-0.017
 M-0.016
 Suz-0.015
 microsc-0.015
 Nice-0.014
Token."
Token ID526
Feature activation-2.33e+00

Pos logit contributions
ars+0.014
ers+0.012
oster+0.012
Elsa+0.011
ze+0.010
Neg logit contributions
 Suz-0.023
 Sus-0.022
 Su-0.021
 Joe-0.020
 microsc-0.020
Token<|endoftext|>
Token ID50256
Feature activation-6.94e+00

Pos logit contributions
ars+0.055
oster+0.052
ers+0.049
ze+0.043
Elsa+0.043
Neg logit contributions
 Sus-0.066
 Suz-0.066
 Joe-0.060
 microsc-0.059
 Su-0.059
TokenL
Token ID43
Feature activation-1.50e+00

Pos logit contributions
ars+0.178
oster+0.176
ze+0.155
ers+0.154
Finally+0.152
Neg logit contributions
 Suz-0.165
 Sus-0.165
 Joe-0.154
 M-0.152
 microsc-0.151
Tokenily
Token ID813
Feature activation-6.77e-01

Pos logit contributions
ars+0.031
oster+0.027
ers+0.027
Elsa+0.025
Finally+0.023
Neg logit contributions
 Suz-0.046
 Sus-0.046
 Joe-0.042
 microsc-0.042
 Su-0.042
Token and
Token ID290
Feature activation-3.42e+00

Pos logit contributions
ars+0.015
oster+0.014
ers+0.013
Finally+0.013
Elsa+0.012
Neg logit contributions
 Sus-0.019
 Suz-0.019
 Joe-0.018
 Su-0.017
 M-0.017
Token Ben
Token ID3932
Feature activation-1.33e+00

Pos logit contributions
ars+0.082
oster+0.073
ers+0.072
Elsa+0.072
ze+0.066
Neg logit contributions
 Suz-0.088
 Sus-0.086
 microsc-0.082
arter-0.082
 Su-0.077
Token were
Token ID547
Feature activation-9.99e-01

Pos logit contributions
ars+0.036
oster+0.034
ers+0.033
Elsa+0.031
Finally+0.030
Neg logit contributions
 Suz-0.031
 Sus-0.031
 Joe-0.028
 Su-0.028
 microsc-0.028
Token read
Token ID1100
Feature activation-1.06e-01

Pos logit contributions
ars+0.010
oster+0.009
ers+0.009
Elsa+0.008
jected+0.008
Neg logit contributions
 Suz-0.007
 Sus-0.007
 Joe-0.006
 Su-0.006
 M-0.006
Token a
Token ID257
Feature activation+1.33e+00

Pos logit contributions
ars+0.003
oster+0.002
ers+0.002
Elsa+0.002
jected+0.002
Neg logit contributions
 Suz-0.003
 Sus-0.003
 Joe-0.003
 microsc-0.003
 Su-0.003
Token book
Token ID1492
Feature activation-1.84e+00

Pos logit contributions
 Suz+0.038
 Sus+0.038
 microsc+0.036
 Su+0.035
 Joe+0.035
Neg logit contributions
ars-0.029
oster-0.027
ers-0.026
Elsa-0.023
jected-0.023
Token together
Token ID1978
Feature activation+4.36e-01

Pos logit contributions
ars+0.048
oster+0.046
Elsa+0.043
ers+0.041
ze+0.040
Neg logit contributions
 Sus-0.044
 Suz-0.043
 Jill-0.039
 Joe-0.038
 Su-0.038
Token."
Token ID526
Feature activation-2.17e+00

Pos logit contributions
 Sus+0.011
 Suz+0.010
 Joe+0.010
 M+0.009
 Su+0.009
Neg logit contributions
ars-0.012
oster-0.011
Elsa-0.010
ers-0.010
Finally-0.009
Token<|endoftext|>
Token ID50256
Feature activation-6.93e+00

Pos logit contributions
ars+0.053
oster+0.051
ers+0.048
ze+0.044
jected+0.043
Neg logit contributions
 Suz-0.060
 Sus-0.060
 Su-0.054
 Joe-0.053
 M-0.053
TokenTom
Token ID13787
Feature activation+1.24e+00

Pos logit contributions
ars+0.179
oster+0.178
ze+0.157
ers+0.156
Finally+0.154
Neg logit contributions
 Suz-0.164
 Sus-0.163
 Joe-0.153
 M-0.150
 microsc-0.149
Token and
Token ID290
Feature activation-2.80e+00

Pos logit contributions
 Suz+0.032
 Sus+0.031
 microsc+0.029
 Su+0.028
arter+0.028
Neg logit contributions
ars-0.031
ers-0.028
oster-0.028
Elsa-0.026
ze-0.024
Token Lily
Token ID20037
Feature activation-1.19e+00

Pos logit contributions
ars+0.068
oster+0.061
ers+0.060
Elsa+0.059
age+0.054
Neg logit contributions
arter-0.068
 Suz-0.067
 microsc-0.066
 Sus-0.066
 Nice-0.059
Token were
Token ID547
Feature activation-1.04e+00

Pos logit contributions
ars+0.032
oster+0.030
ers+0.029
Elsa+0.027
Finally+0.026
Neg logit contributions
 Suz-0.029
 Sus-0.029
 Joe-0.026
 Su-0.026
 microsc-0.025
Token best
Token ID1266
Feature activation-2.54e+00

Pos logit contributions
ars+0.034
oster+0.033
ers+0.031
Elsa+0.029
ze+0.029
Neg logit contributions
 Sus-0.020
 Suz-0.020
 Joe-0.017
 Su-0.017
 microsc-0.017
Token want
Token ID765
Feature activation-1.60e+00

Pos logit contributions
ars+0.054
oster+0.049
ers+0.049
Elsa+0.045
jected+0.045
Neg logit contributions
 Suz-0.044
 Sus-0.044
 Joe-0.040
 Su-0.039
arter-0.039
Token more
Token ID517
Feature activation+1.08e-01

Pos logit contributions
ars+0.039
ers+0.035
oster+0.035
Elsa+0.033
ze+0.031
Neg logit contributions
 Suz-0.042
 Sus-0.041
arter-0.039
 Su-0.038
 microsc-0.038
Token squash
Token ID34613
Feature activation-9.85e-01

Pos logit contributions
 Suz+0.003
 Sus+0.003
 Su+0.003
 microsc+0.003
 Joe+0.003
Neg logit contributions
ars-0.002
oster-0.002
ers-0.002
Elsa-0.002
ze-0.002
Token."
Token ID526
Feature activation-2.05e+00

Pos logit contributions
ars+0.027
oster+0.025
ers+0.023
Elsa+0.023
Finally+0.023
Neg logit contributions
 Suz-0.023
 Sus-0.023
 Joe-0.021
 microsc-0.020
 Su-0.020
Token<|endoftext|>
Token ID50256
Feature activation-6.66e+00

Pos logit contributions
ars+0.045
oster+0.043
ers+0.041
ze+0.036
Elsa+0.036
Neg logit contributions
 Suz-0.061
 Sus-0.061
 Joe-0.055
 Su-0.055
 microsc-0.054
Token<|endoftext|>
Token ID50256
Feature activation-6.92e+00

Pos logit contributions
oster+0.179
ars+0.179
ze+0.159
Finally+0.157
ers+0.155
Neg logit contributions
 Suz-0.148
 Sus-0.147
 Joe-0.138
 M-0.138
 microsc-0.134
Token<|endoftext|>
Token ID50256
Feature activation-6.88e+00

Pos logit contributions
ars+0.187
oster+0.183
ers+0.160
ze+0.159
Finally+0.158
Neg logit contributions
 Suz-0.161
 Sus-0.159
 Joe-0.150
 M-0.148
 Su-0.144
TokenL
Token ID43
Feature activation-1.32e+00

Pos logit contributions
ars+0.186
oster+0.181
ers+0.159
ze+0.156
Elsa+0.155
Neg logit contributions
 Suz-0.162
 Sus-0.159
 Joe-0.151
 M-0.148
 Su-0.144
Tokenily
Token ID813
Feature activation-8.37e-01

Pos logit contributions
ars+0.034
oster+0.032
ers+0.029
Elsa+0.028
jected+0.027
Neg logit contributions
 Suz-0.034
 Sus-0.034
 Joe-0.031
 Su-0.031
 microsc-0.030
Token and
Token ID290
Feature activation-3.69e+00

Pos logit contributions
ars+0.016
oster+0.014
ers+0.013
Elsa+0.012
Finally+0.011
Neg logit contributions
 Suz-0.027
 Sus-0.027
 Joe-0.025
 Su-0.025
 microsc-0.025
Token Sam
Token ID3409
Feature activation-1.67e-01

Pos logit contributions
ars+0.111
oster+0.108
ers+0.106
Elsa+0.095
bes+0.093
Neg logit contributions
 Suz-0.087
 Sus-0.085
 Joe-0.081
 Su-0.077
 Joey-0.075
Token lonely
Token ID21757
Feature activation-2.45e+00

Pos logit contributions
ars+0.001
oster+0.001
ers+0.001
Elsa+0.001
Finally+0.001
Neg logit contributions
 Sus-0.001
 Suz-0.001
arter-0.001
 Joe-0.001
 Su-0.001
Token.
Token ID13
Feature activation+7.30e-01

Pos logit contributions
ars+0.057
oster+0.052
ers+0.050
Elsa+0.047
Finally+0.044
Neg logit contributions
 Suz-0.069
 Sus-0.068
 Joe-0.062
 microsc-0.062
 Su-0.061
Token The
Token ID383
Feature activation-2.06e+00

Pos logit contributions
 Suz+0.025
 Sus+0.025
 microsc+0.023
 Su+0.023
arter+0.023
Neg logit contributions
ars-0.012
oster-0.010
Elsa-0.010
ers-0.009
Finally-0.008
Token end
Token ID886
Feature activation+3.89e-01

Pos logit contributions
ars+0.041
oster+0.038
ers+0.036
Elsa+0.033
ze+0.031
Neg logit contributions
 Suz-0.064
 Sus-0.063
 microsc-0.060
 Joe-0.059
arter-0.058
Token.
Token ID13
Feature activation+5.60e-01

Pos logit contributions
 Suz+0.012
 Sus+0.012
 Su+0.011
 Joe+0.011
 microsc+0.011
Neg logit contributions
ars-0.008
ers-0.007
oster-0.007
Elsa-0.006
ze-0.006
Token<|endoftext|>
Token ID50256
Feature activation-6.92e+00

Pos logit contributions
 Suz+0.019
 Sus+0.019
 microsc+0.017
 Su+0.017
 Joe+0.017
Neg logit contributions
ars-0.010
oster-0.008
ers-0.008
Elsa-0.008
Finally-0.007
TokenL
Token ID43
Feature activation-1.10e+00

Pos logit contributions
ars+0.188
oster+0.185
ze+0.169
Finally+0.166
Eventually+0.164
Neg logit contributions
 Sus-0.150
 Suz-0.149
 Joe-0.140
 M-0.139
 Mike-0.135
Tokenily
Token ID813
Feature activation-2.16e-01

Pos logit contributions
ars+0.026
oster+0.024
ers+0.023
Elsa+0.022
ze+0.021
Neg logit contributions
 Suz-0.030
 Sus-0.030
 Joe-0.027
 Su-0.027
 microsc-0.027
Token and
Token ID290
Feature activation-3.49e+00

Pos logit contributions
ars+0.005
oster+0.005
ers+0.005
Elsa+0.005
Finally+0.005
Neg logit contributions
 Sus-0.006
 Suz-0.006
 Joe-0.005
 Su-0.005
 microsc-0.005
Token Tom
Token ID4186
Feature activation-8.11e-02

Pos logit contributions
ars+0.097
oster+0.091
ers+0.088
Elsa+0.082
Finally+0.079
Neg logit contributions
 Suz-0.084
 Sus-0.083
 Joe-0.075
 Su-0.074
 microsc-0.072
Token were
Token ID547
Feature activation-8.30e-01

Pos logit contributions
ars+0.002
oster+0.002
ers+0.002
Elsa+0.002
ze+0.002
Neg logit contributions
 Suz-0.002
 Sus-0.002
 microsc-0.002
 Joe-0.002
 Su-0.002
Token sweet
Token ID6029
Feature activation+1.61e-01

Pos logit contributions
ars+0.027
oster+0.025
ers+0.024
Elsa+0.023
ze+0.021
Neg logit contributions
 Suz-0.033
 Sus-0.033
 Joe-0.030
 microsc-0.030
 Su-0.030
Token and
Token ID290
Feature activation-2.32e+00

Pos logit contributions
 Suz+0.005
 Sus+0.005
 microsc+0.005
 Joe+0.005
arter+0.005
Neg logit contributions
ars-0.003
ers-0.002
oster-0.002
Elsa-0.002
ze-0.002
Token wonderful
Token ID7932
Feature activation-4.83e-01

Pos logit contributions
ars+0.061
oster+0.057
ers+0.056
Finally+0.052
Eventually+0.050
Neg logit contributions
 Suz-0.057
 Sus-0.057
 microsc-0.051
 Joe-0.051
 Su-0.050
Token twins
Token ID20345
Feature activation-1.82e+00

Pos logit contributions
ars+0.013
oster+0.012
ers+0.011
Elsa+0.011
ze+0.010
Neg logit contributions
 Suz-0.012
 Sus-0.012
 Joe-0.011
 microsc-0.011
 Su-0.011
Token."
Token ID526
Feature activation-2.21e+00

Pos logit contributions
ars+0.052
oster+0.052
Finally+0.047
Elsa+0.046
Eventually+0.046
Neg logit contributions
 Sus-0.038
 Suz-0.035
 M-0.033
 Mike-0.033
 Joe-0.033
Token<|endoftext|>
Token ID50256
Feature activation-6.91e+00

Pos logit contributions
ars+0.038
ers+0.031
oster+0.031
Elsa+0.030
Finally+0.027
Neg logit contributions
 Suz-0.073
 Sus-0.073
arter-0.069
 microsc-0.067
 Joe-0.067
TokenL
Token ID43
Feature activation-1.61e+00

Pos logit contributions
ars+0.181
oster+0.180
ze+0.158
ers+0.157
Finally+0.155
Neg logit contributions
 Suz-0.161
 Sus-0.160
 Joe-0.149
 M-0.147
 microsc-0.146
Tokenily
Token ID813
Feature activation-8.26e-01

Pos logit contributions
ars+0.039
ers+0.036
oster+0.035
Elsa+0.032
Finally+0.030
Neg logit contributions
 Suz-0.043
 Sus-0.042
arter-0.039
 microsc-0.039
 Joe-0.038
Token likes
Token ID7832
Feature activation-3.19e+00

Pos logit contributions
ars+0.018
oster+0.017
ers+0.016
Elsa+0.015
Finally+0.015
Neg logit contributions
 Sus-0.024
 Suz-0.024
 Joe-0.022
 Su-0.022
 M-0.021
Token to
Token ID284
Feature activation-1.40e+00

Pos logit contributions
ars+0.071
ers+0.067
oster+0.065
Elsa+0.063
ze+0.059
Neg logit contributions
 Suz-0.082
 Sus-0.079
arter-0.077
 microsc-0.071
 Nice-0.071
Token write
Token ID3551
Feature activation-4.98e-01

Pos logit contributions
ars+0.037
oster+0.034
ers+0.033
Elsa+0.031
ze+0.030
Neg logit contributions
 Suz-0.034
 Sus-0.034
 microsc-0.030
 Su-0.030
arter-0.030
Token we
Token ID356
Feature activation-1.13e+00

Pos logit contributions
ars+0.022
oster+0.020
ers+0.020
Elsa+0.018
ze+0.017
Neg logit contributions
 Suz-0.018
 Sus-0.018
 Joe-0.016
 Su-0.016
 microsc-0.016
Token love
Token ID1842
Feature activation-1.07e+00

Pos logit contributions
ars+0.032
oster+0.031
ers+0.030
Elsa+0.028
bes+0.027
Neg logit contributions
 Sus-0.024
 Suz-0.024
 microsc-0.021
 Joe-0.021
 Su-0.021
Token our
Token ID674
Feature activation-1.56e+00

Pos logit contributions
ars+0.023
ers+0.022
Elsa+0.021
oster+0.020
ze+0.018
Neg logit contributions
 Suz-0.029
 Sus-0.028
arter-0.027
 Su-0.026
 Nice-0.026
Token garage
Token ID15591
Feature activation-1.39e+00

Pos logit contributions
ars+0.039
oster+0.036
ers+0.035
Elsa+0.033
ze+0.031
Neg logit contributions
 Suz-0.040
 Sus-0.039
 Joe-0.036
 Su-0.036
 microsc-0.035
Token."
Token ID526
Feature activation-2.32e+00

Pos logit contributions
ars+0.036
oster+0.034
Elsa+0.032
Finally+0.030
ers+0.030
Neg logit contributions
 Sus-0.035
 Suz-0.034
 Joe-0.030
 Jill-0.030
 Su-0.030
Token<|endoftext|>
Token ID50256
Feature activation-6.91e+00

Pos logit contributions
ars+0.045
oster+0.039
ers+0.038
Elsa+0.037
Finally+0.033
Neg logit contributions
 Suz-0.074
 Sus-0.073
 microsc-0.067
arter-0.067
 Su-0.067
TokenL
Token ID43
Feature activation-1.48e+00

Pos logit contributions
ars+0.179
oster+0.177
ze+0.158
ers+0.156
Finally+0.154
Neg logit contributions
 Suz-0.162
 Sus-0.161
 Joe-0.151
 microsc-0.148
 M-0.148
Tokenily
Token ID813
Feature activation-4.40e-01

Pos logit contributions
ars+0.032
oster+0.029
ers+0.028
Elsa+0.026
ze+0.025
Neg logit contributions
 Suz-0.044
 Sus-0.043
 Joe-0.040
 Su-0.040
 microsc-0.040
Token was
Token ID373
Feature activation-5.97e-01

Pos logit contributions
ars+0.010
oster+0.009
ers+0.008
Elsa+0.008
Finally+0.008
Neg logit contributions
 Sus-0.013
 Suz-0.013
 Joe-0.012
 Su-0.011
 microsc-0.011
Token an
Token ID281
Feature activation+2.93e+00

Pos logit contributions
ars+0.018
oster+0.017
ers+0.016
Elsa+0.015
Finally+0.014
Neg logit contributions
 Sus-0.013
 Suz-0.013
 Joe-0.012
 Su-0.012
 M-0.011
Token adventurous
Token ID39770
Feature activation-1.39e+00

Pos logit contributions
 Suz+0.075
 Sus+0.074
 microsc+0.067
 Joe+0.066
 Su+0.066
Neg logit contributions
ars-0.072
oster-0.067
ers-0.065
Elsa-0.060
jected-0.058
Token hockey
Token ID12217
Feature activation-5.11e+00

Pos logit contributions
ars+0.064
oster+0.061
ers+0.058
Elsa+0.056
Finally+0.054
Neg logit contributions
 Suz-0.055
 Sus-0.055
 Joe-0.049
arter-0.049
 Su-0.048
Token.
Token ID13
Feature activation+5.30e-01

Pos logit contributions
ars+0.129
Finally+0.127
oster+0.125
Eventually+0.123
Elsa+0.123
Neg logit contributions
 Suz-0.104
 Sus-0.104
arter-0.097
 Joe-0.097
 M-0.095
Token Good
Token ID4599
Feature activation-2.33e+00

Pos logit contributions
 Suz+0.015
 Sus+0.014
 microsc+0.014
yt+0.013
arter+0.013
Neg logit contributions
ars-0.011
oster-0.010
ers-0.010
Elsa-0.010
jected-0.009
Token friends
Token ID2460
Feature activation-2.97e-02

Pos logit contributions
ars+0.062
oster+0.058
ers+0.056
Elsa+0.054
Finally+0.052
Neg logit contributions
 Suz-0.057
 Sus-0.056
 Joe-0.051
arter-0.050
 Su-0.050
Token."
Token ID526
Feature activation-2.35e+00

Pos logit contributions
ars+0.001
oster+0.001
Finally+0.001
Elsa+0.001
Eventually+0.001
Neg logit contributions
 Sus-0.001
 Suz-0.001
 Joe-0.001
 Mike-0.001
 M-0.001
Token<|endoftext|>
Token ID50256
Feature activation-6.90e+00

Pos logit contributions
ars+0.062
oster+0.060
ers+0.056
ze+0.051
Elsa+0.051
Neg logit contributions
 Suz-0.061
 Sus-0.060
 Joe-0.054
 Su-0.054
 microsc-0.053
TokenS
Token ID50
Feature activation-9.42e-01

Pos logit contributions
ars+0.182
oster+0.182
ze+0.161
Finally+0.159
ers+0.159
Neg logit contributions
 Suz-0.159
 Sus-0.157
 Joe-0.147
 M-0.145
 microsc-0.145
Tokenara
Token ID3301
Feature activation-8.94e-01

Pos logit contributions
ars+0.023
oster+0.022
ers+0.020
ze+0.020
Elsa+0.019
Neg logit contributions
 Suz-0.025
 Sus-0.025
 Joe-0.024
 Su-0.023
 microsc-0.023
Token and
Token ID290
Feature activation-3.69e+00

Pos logit contributions
ars+0.020
oster+0.019
Finally+0.017
ers+0.017
Elsa+0.017
Neg logit contributions
 Suz-0.025
 Sus-0.025
 Joe-0.024
 Su-0.023
 M-0.023
Token Ben
Token ID3932
Feature activation-1.56e+00

Pos logit contributions
ars+0.093
oster+0.084
ers+0.082
Elsa+0.081
ze+0.075
Neg logit contributions
 Suz-0.090
 Sus-0.089
 microsc-0.083
arter-0.083
 Su-0.079
Token were
Token ID547
Feature activation-1.01e+00

Pos logit contributions
ars+0.042
oster+0.039
ers+0.038
Elsa+0.036
Finally+0.034
Neg logit contributions
 Suz-0.037
 Sus-0.037
 Joe-0.034
 Su-0.033
 microsc-0.033
Token are
Token ID389
Feature activation-6.53e-01

Pos logit contributions
ars+0.040
oster+0.037
ers+0.036
Elsa+0.033
ze+0.031
Neg logit contributions
 Suz-0.043
 Sus-0.042
 Joe-0.038
 Su-0.038
 microsc-0.038
Token Mom
Token ID11254
Feature activation-1.20e-02

Pos logit contributions
ars+0.016
oster+0.015
ers+0.015
Elsa+0.014
ze+0.013
Neg logit contributions
 Suz-0.016
 Sus-0.016
 microsc-0.015
 Su-0.015
arter-0.014
Token and
Token ID290
Feature activation-3.31e+00

Pos logit contributions
ars+0.000
oster+0.000
Finally+0.000
Eventually+0.000
Elsa+0.000
Neg logit contributions
 Sus-0.000
 Suz-0.000
 Su-0.000
 Joe-0.000
 M-0.000
Token Dad
Token ID17415
Feature activation+8.10e-02

Pos logit contributions
ars+0.086
Elsa+0.084
oster+0.080
ers+0.080
 End+0.076
Neg logit contributions
 Suz-0.073
 Sus-0.069
arter-0.067
 Nice-0.062
 microsc-0.062
Token.
Token ID13
Feature activation+6.99e-01

Pos logit contributions
 Sus+0.002
 Suz+0.002
 Joe+0.002
 M+0.001
 Mike+0.001
Neg logit contributions
ars-0.002
oster-0.002
Finally-0.002
Elsa-0.002
Eventually-0.002
Token<|endoftext|>
Token ID50256
Feature activation-6.90e+00

Pos logit contributions
 Suz+0.023
 Sus+0.022
 microsc+0.022
arter+0.021
yt+0.021
Neg logit contributions
ars-0.010
Elsa-0.010
oster-0.008
ers-0.008
There-0.008
TokenL
Token ID43
Feature activation-1.13e+00

Pos logit contributions
ars+0.191
oster+0.188
ze+0.169
Finally+0.166
Eventually+0.165
Neg logit contributions
 Sus-0.148
 Suz-0.147
 M-0.137
 Joe-0.136
 Mike-0.132
Tokenily
Token ID813
Feature activation-6.59e-01

Pos logit contributions
ars+0.022
ers+0.020
oster+0.020
Elsa+0.018
Finally+0.016
Neg logit contributions
 Suz-0.035
 Sus-0.035
 microsc-0.032
 Joe-0.032
arter-0.032
Token and
Token ID290
Feature activation-3.63e+00

Pos logit contributions
ars+0.016
oster+0.014
ers+0.014
Elsa+0.013
Finally+0.012
Neg logit contributions
 Suz-0.018
 Sus-0.018
 Joe-0.016
 Su-0.016
 microsc-0.016
Token Ben
Token ID3932
Feature activation-1.42e+00

Pos logit contributions
ars+0.095
oster+0.086
ers+0.084
Elsa+0.083
ze+0.077
Neg logit contributions
 Suz-0.087
 Sus-0.086
 microsc-0.081
arter-0.080
 Su-0.076
Token were
Token ID547
Feature activation-1.19e+00

Pos logit contributions
ars+0.037
oster+0.034
ers+0.034
Elsa+0.032
ze+0.030
Neg logit contributions
 Suz-0.035
 Sus-0.035
 microsc-0.031
 Joe-0.031
 Su-0.031
Token but
Token ID475
Feature activation+6.63e-01

Pos logit contributions
ars+0.005
oster+0.004
ers+0.004
Elsa+0.004
ze+0.004
Neg logit contributions
 Suz-0.004
 Sus-0.004
 Joe-0.004
 microsc-0.004
 Su-0.004
Token not
Token ID407
Feature activation+2.73e+00

Pos logit contributions
 Suz+0.020
 Sus+0.020
 Su+0.019
 microsc+0.018
 Joe+0.018
Neg logit contributions
ars-0.013
oster-0.012
ers-0.011
Elsa-0.011
Finally-0.010
Token too
Token ID1165
Feature activation-3.39e-01

Pos logit contributions
 Suz+0.076
 Sus+0.074
 Su+0.069
 microsc+0.067
arter+0.066
Neg logit contributions
ars-0.062
oster-0.054
ers-0.054
ze-0.051
Elsa-0.050
Token much
Token ID881
Feature activation+1.87e-01

Pos logit contributions
ars+0.008
oster+0.007
ers+0.007
Elsa+0.006
ze+0.006
Neg logit contributions
 Suz-0.010
 Sus-0.010
 Su-0.009
 Joe-0.008
 microsc-0.008
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
 Suz+0.005
 Sus+0.005
 Su+0.005
 Joe+0.005
 microsc+0.005
Neg logit contributions
ars-0.004
oster-0.004
ers-0.004
Elsa-0.003
ze-0.003
Token<|endoftext|>
Token ID50256
Feature activation-6.89e+00

Pos logit contributions
 Suz+0.015
 Sus+0.015
 Joe+0.013
 Su+0.013
 microsc+0.013
Neg logit contributions
ars-0.014
oster-0.013
ers-0.013
Elsa-0.012
Finally-0.011
TokenAnna
Token ID31160
Feature activation-5.99e-01

Pos logit contributions
ars+0.182
oster+0.176
ze+0.157
ers+0.156
Finally+0.154
Neg logit contributions
 Sus-0.162
 Suz-0.160
 Joe-0.151
 M-0.150
 microsc-0.144
Token and
Token ID290
Feature activation-3.58e+00

Pos logit contributions
ars+0.015
oster+0.014
Finally+0.013
ers+0.012
Eventually+0.012
Neg logit contributions
 Sus-0.016
 Suz-0.015
 Joe-0.014
 Su-0.014
 M-0.014
Token Ben
Token ID3932
Feature activation-1.41e+00

Pos logit contributions
ars+0.104
oster+0.098
ers+0.095
Elsa+0.089
Finally+0.086
Neg logit contributions
 Suz-0.082
 Sus-0.081
 Joe-0.073
 Su-0.072
 microsc-0.069
Token were
Token ID547
Feature activation-8.32e-01

Pos logit contributions
ars+0.039
oster+0.037
ers+0.035
Elsa+0.034
Finally+0.032
Neg logit contributions
 Sus-0.032
 Suz-0.032
 Joe-0.029
 Su-0.029
 microsc-0.028
Token playing
Token ID2712
Feature activation-4.58e+00

Pos logit contributions
ars+0.027
oster+0.026
ers+0.025
Elsa+0.023
ze+0.023
Neg logit contributions
 Sus-0.016
 Suz-0.016
 Joe-0.014
 Su-0.014
arter-0.013
Token is
Token ID318
Feature activation-1.28e-01

Pos logit contributions
 Sus+0.013
 Suz+0.012
 Joe+0.012
 M+0.011
 Johnny+0.011
Neg logit contributions
ars-0.017
oster-0.017
Finally-0.017
Elsa-0.016
Eventually-0.016
Token good
Token ID922
Feature activation-2.64e+00

Pos logit contributions
ars+0.004
oster+0.003
Finally+0.003
ers+0.003
Eventually+0.003
Neg logit contributions
 Sus-0.003
 Suz-0.003
 Su-0.003
arter-0.003
 M-0.002
Token,
Token ID11
Feature activation-1.30e-01

Pos logit contributions
ars+0.075
oster+0.071
Finally+0.065
ers+0.063
Elsa+0.063
Neg logit contributions
 Sus-0.063
 Suz-0.060
 Joe-0.054
 Su-0.054
 Mike-0.053
Token always
Token ID1464
Feature activation-3.73e-01

Pos logit contributions
ars+0.003
oster+0.003
ers+0.003
Elsa+0.003
ze+0.003
Neg logit contributions
 Suz-0.003
 Sus-0.003
 microsc-0.003
 Joe-0.003
arter-0.003
Token."
Token ID526
Feature activation-2.22e+00

Pos logit contributions
ars+0.010
oster+0.010
ers+0.009
Elsa+0.008
Finally+0.008
Neg logit contributions
 Sus-0.009
 Suz-0.009
 Joe-0.008
 Su-0.008
 M-0.008
Token<|endoftext|>
Token ID50256
Feature activation-6.89e+00

Pos logit contributions
ars+0.046
oster+0.042
ers+0.040
Elsa+0.037
ze+0.035
Neg logit contributions
 Suz-0.068
 Sus-0.068
 Joe-0.062
 Su-0.062
 microsc-0.061
TokenS
Token ID50
Feature activation-1.05e+00

Pos logit contributions
ars+0.183
oster+0.178
ze+0.158
ers+0.156
Finally+0.155
Neg logit contributions
 Sus-0.161
 Suz-0.161
 Joe-0.150
 M-0.148
 Su-0.144
Tokenara
Token ID3301
Feature activation-1.07e+00

Pos logit contributions
ars+0.026
oster+0.025
ze+0.023
ers+0.023
bes+0.021
Neg logit contributions
 Sus-0.028
 Suz-0.027
 Su-0.026
 M-0.025
 J-0.025
Token and
Token ID290
Feature activation-3.41e+00

Pos logit contributions
ars+0.025
oster+0.024
Finally+0.022
ers+0.021
Elsa+0.021
Neg logit contributions
 Sus-0.030
 Suz-0.030
 Joe-0.028
 Su-0.027
 M-0.027
Token Ben
Token ID3932
Feature activation-1.63e+00

Pos logit contributions
ars+0.087
oster+0.080
ers+0.078
Elsa+0.074
ze+0.069
Neg logit contributions
 Suz-0.086
 Sus-0.085
 microsc-0.078
arter-0.077
 Su-0.076
Token are
Token ID389
Feature activation-1.29e+00

Pos logit contributions
ars+0.044
oster+0.041
ers+0.039
Elsa+0.037
Finally+0.035
Neg logit contributions
 Suz-0.040
 Sus-0.039
 Joe-0.035
 Su-0.035
 microsc-0.035
Token her
Token ID607
Feature activation-3.24e-01

Pos logit contributions
ars+0.079
oster+0.075
ers+0.075
Elsa+0.073
ze+0.069
Neg logit contributions
 Suz-0.052
 microsc-0.050
 Sus-0.050
arter-0.047
 Nice-0.044
Token little
Token ID1310
Feature activation-1.64e+00

Pos logit contributions
ars+0.007
oster+0.007
ers+0.006
Elsa+0.006
ze+0.006
Neg logit contributions
 Suz-0.009
 Sus-0.009
 microsc-0.008
arter-0.008
 Joe-0.008
Token girl
Token ID2576
Feature activation-1.44e+00

Pos logit contributions
ars+0.038
oster+0.035
ers+0.034
Elsa+0.032
ze+0.030
Neg logit contributions
 Suz-0.046
 Sus-0.045
 Joe-0.041
 Su-0.041
 microsc-0.041
Token,
Token ID11
Feature activation+1.44e+00

Pos logit contributions
ars+0.031
oster+0.029
ers+0.028
Elsa+0.026
ze+0.024
Neg logit contributions
 Suz-0.040
 Sus-0.040
 microsc-0.037
arter-0.037
 Su-0.036
Token "
Token ID366
Feature activation-2.37e+00

Pos logit contributions
 Suz+0.029
 Sus+0.028
arter+0.028
 microsc+0.028
 Joe+0.025
Neg logit contributions
ars-0.041
oster-0.038
ers-0.038
Elsa-0.037
ze-0.034
TokenWould
Token ID17353
Feature activation-6.89e+00

Pos logit contributions
ars+0.058
oster+0.053
ers+0.050
bes+0.044
Elsa+0.044
Neg logit contributions
 Sus-0.066
 Suz-0.065
 Su-0.060
 Joe-0.057
 M-0.057
Token you
Token ID345
Feature activation-2.07e+00

Pos logit contributions
ars+0.155
oster+0.155
ers+0.138
bes+0.126
Elsa+0.123
Neg logit contributions
 Sus-0.197
 Suz-0.189
 Su-0.181
 Joe-0.180
 M-0.176
Token like
Token ID588
Feature activation-1.57e+00

Pos logit contributions
ars+0.065
oster+0.063
ers+0.061
Elsa+0.058
jected+0.056
Neg logit contributions
 Suz-0.039
 Sus-0.039
 Joe-0.034
 Su-0.033
 microsc-0.032
Token a
Token ID257
Feature activation+1.30e+00

Pos logit contributions
ars+0.039
ers+0.036
oster+0.035
Elsa+0.033
Finally+0.032
Neg logit contributions
 Suz-0.040
 Sus-0.039
arter-0.037
 microsc-0.035
 Su-0.035
Token puppy
Token ID26188
Feature activation-2.63e+00

Pos logit contributions
 Suz+0.034
 Sus+0.034
 Joe+0.031
 microsc+0.031
 Su+0.031
Neg logit contributions
ars-0.032
oster-0.029
ers-0.028
Elsa-0.027
ze-0.025
Token?".
Token ID43634
Feature activation-5.45e-01

Pos logit contributions
ars+0.078
oster+0.072
ers+0.071
Elsa+0.067
Finally+0.064
Neg logit contributions
 Suz-0.057
 Sus-0.056
 Joe-0.050
 Su-0.049
 microsc-0.049
Token more
Token ID517
Feature activation+1.08e-01

Pos logit contributions
ars+0.039
ers+0.035
oster+0.035
Elsa+0.033
ze+0.031
Neg logit contributions
 Suz-0.042
 Sus-0.041
arter-0.039
 Su-0.038
 microsc-0.038
Token squash
Token ID34613
Feature activation-9.85e-01

Pos logit contributions
 Suz+0.003
 Sus+0.003
 Su+0.003
 microsc+0.003
 Joe+0.003
Neg logit contributions
ars-0.002
oster-0.002
ers-0.002
Elsa-0.002
ze-0.002
Token."
Token ID526
Feature activation-2.05e+00

Pos logit contributions
ars+0.027
oster+0.025
ers+0.023
Elsa+0.023
Finally+0.023
Neg logit contributions
 Suz-0.023
 Sus-0.023
 Joe-0.021
 microsc-0.020
 Su-0.020
Token<|endoftext|>
Token ID50256
Feature activation-6.66e+00

Pos logit contributions
ars+0.045
oster+0.043
ers+0.041
ze+0.036
Elsa+0.036
Neg logit contributions
 Suz-0.061
 Sus-0.061
 Joe-0.055
 Su-0.055
 microsc-0.054
Token<|endoftext|>
Token ID50256
Feature activation-6.92e+00

Pos logit contributions
oster+0.179
ars+0.179
ze+0.159
Finally+0.157
ers+0.155
Neg logit contributions
 Suz-0.148
 Sus-0.147
 Joe-0.138
 M-0.138
 microsc-0.134
Token<|endoftext|>
Token ID50256
Feature activation-6.88e+00

Pos logit contributions
ars+0.187
oster+0.183
ers+0.160
ze+0.159
Finally+0.158
Neg logit contributions
 Suz-0.161
 Sus-0.159
 Joe-0.150
 M-0.148
 Su-0.144
TokenL
Token ID43
Feature activation-1.32e+00

Pos logit contributions
ars+0.186
oster+0.181
ers+0.159
ze+0.156
Elsa+0.155
Neg logit contributions
 Suz-0.162
 Sus-0.159
 Joe-0.151
 M-0.148
 Su-0.144
Tokenily
Token ID813
Feature activation-8.37e-01

Pos logit contributions
ars+0.034
oster+0.032
ers+0.029
Elsa+0.028
jected+0.027
Neg logit contributions
 Suz-0.034
 Sus-0.034
 Joe-0.031
 Su-0.031
 microsc-0.030
Token and
Token ID290
Feature activation-3.69e+00

Pos logit contributions
ars+0.016
oster+0.014
ers+0.013
Elsa+0.012
Finally+0.011
Neg logit contributions
 Suz-0.027
 Sus-0.027
 Joe-0.025
 Su-0.025
 microsc-0.025
Token Sam
Token ID3409
Feature activation-1.67e-01

Pos logit contributions
ars+0.111
oster+0.108
ers+0.106
Elsa+0.095
bes+0.093
Neg logit contributions
 Suz-0.087
 Sus-0.085
 Joe-0.081
 Su-0.077
 Joey-0.075
Token went
Token ID1816
Feature activation-1.99e+00

Pos logit contributions
ars+0.004
oster+0.004
ers+0.004
Elsa+0.003
Finally+0.003
Neg logit contributions
 Suz-0.004
 Sus-0.004
 Joe-0.004
 Su-0.004
 microsc-0.004
Token is
Token ID318
Feature activation-3.65e-01

Pos logit contributions
ars+0.005
oster+0.005
Elsa+0.004
over+0.004
ers+0.004
Neg logit contributions
 Suz-0.004
 Sus-0.004
 M-0.004
 Joe-0.004
 Nice-0.004
Token useful
Token ID4465
Feature activation-1.52e+00

Pos logit contributions
ars+0.009
oster+0.008
ers+0.008
Elsa+0.007
Finally+0.007
Neg logit contributions
 Suz-0.010
 Sus-0.010
 microsc-0.009
 Joe-0.009
 Su-0.009
Token and
Token ID290
Feature activation-2.74e+00

Pos logit contributions
ars+0.040
oster+0.038
Elsa+0.035
Finally+0.034
ers+0.034
Neg logit contributions
 Sus-0.037
 Suz-0.037
 Joe-0.033
 microsc-0.032
 Su-0.032
Token fun
Token ID1257
Feature activation-2.12e+00

Pos logit contributions
ars+0.068
oster+0.062
ers+0.061
Elsa+0.057
ze+0.053
Neg logit contributions
 Suz-0.072
 Sus-0.071
 Joe-0.064
 Su-0.064
 microsc-0.064
Token."
Token ID526
Feature activation-2.13e+00

Pos logit contributions
ars+0.058
oster+0.056
ers+0.050
Elsa+0.050
Finally+0.048
Neg logit contributions
 Sus-0.051
 Suz-0.050
 Joe-0.045
 microsc-0.044
 Su-0.044
Token<|endoftext|>
Token ID50256
Feature activation-6.88e+00

Pos logit contributions
ars+0.043
oster+0.039
ers+0.038
Elsa+0.035
Finally+0.032
Neg logit contributions
 Suz-0.066
 Sus-0.066
 Joe-0.060
 microsc-0.060
 Su-0.060
TokenOne
Token ID3198
Feature activation-4.19e-01

Pos logit contributions
ars+0.184
oster+0.183
ze+0.163
Finally+0.159
ers+0.158
Neg logit contributions
 Sus-0.156
 Suz-0.156
 Joe-0.146
 M-0.144
 microsc-0.142
Token day
Token ID1110
Feature activation-2.65e+00

Pos logit contributions
ars+0.010
oster+0.009
ers+0.009
Elsa+0.008
ze+0.008
Neg logit contributions
 Suz-0.011
 Sus-0.011
 microsc-0.010
 Su-0.010
 Joe-0.010
Token,
Token ID11
Feature activation+1.52e+00

Pos logit contributions
ars+0.056
ers+0.048
oster+0.048
Elsa+0.044
Finally+0.041
Neg logit contributions
 Suz-0.080
 Sus-0.079
 microsc-0.074
arter-0.073
 Su-0.073
Token a
Token ID257
Feature activation-3.48e-01

Pos logit contributions
 Suz+0.049
 Sus+0.048
 microsc+0.045
 Su+0.045
arter+0.045
Neg logit contributions
ars-0.029
oster-0.025
ers-0.024
Elsa-0.022
Finally-0.021
Token man
Token ID582
Feature activation+2.20e-01

Pos logit contributions
ars+0.008
ers+0.008
oster+0.008
Elsa+0.007
ze+0.007
Neg logit contributions
 Suz-0.009
 Sus-0.009
 Joe-0.008
 Su-0.008
 microsc-0.008
Token,
Token ID11
Feature activation+9.52e-02

Pos logit contributions
ars+0.047
oster+0.045
ers+0.042
Elsa+0.039
bes+0.038
Neg logit contributions
 Sus-0.039
 Suz-0.039
 M-0.036
 Joe-0.035
 microsc-0.034
Token too
Token ID1165
Feature activation-6.98e-01

Pos logit contributions
 Suz+0.002
 Sus+0.002
 Joe+0.002
 microsc+0.002
arter+0.002
Neg logit contributions
ars-0.002
ers-0.002
Elsa-0.002
oster-0.002
Finally-0.002
Token,
Token ID11
Feature activation+4.42e-01

Pos logit contributions
ars+0.013
ers+0.011
oster+0.011
Elsa+0.010
"+0.010
Neg logit contributions
 Suz-0.022
 Sus-0.022
 microsc-0.020
 Joe-0.020
 Su-0.020
Token Lily
Token ID20037
Feature activation-8.23e-01

Pos logit contributions
 Suz+0.009
 Sus+0.009
 Joe+0.008
 Su+0.008
 microsc+0.008
Neg logit contributions
ars-0.014
oster-0.013
ers-0.012
Elsa-0.012
ze-0.011
Token."
Token ID526
Feature activation-2.12e+00

Pos logit contributions
ars+0.026
oster+0.026
Finally+0.024
Eventually+0.024
Elsa+0.023
Neg logit contributions
 Sus-0.015
 Suz-0.015
 M-0.014
 Joe-0.013
 Jill-0.013
Token<|endoftext|>
Token ID50256
Feature activation-6.88e+00

Pos logit contributions
ars+0.055
oster+0.055
ers+0.050
ze+0.047
jected+0.046
Neg logit contributions
 Suz-0.055
 Sus-0.055
 M-0.049
 Jill-0.049
 Su-0.049
TokenTom
Token ID13787
Feature activation+7.36e-01

Pos logit contributions
ars+0.174
oster+0.172
ers+0.150
ze+0.149
Finally+0.146
Neg logit contributions
 Suz-0.170
 Sus-0.169
 Joe-0.157
 M-0.154
 microsc-0.154
Token and
Token ID290
Feature activation-2.97e+00

Pos logit contributions
 Suz+0.017
 Sus+0.017
 Joe+0.016
 Su+0.015
 microsc+0.015
Neg logit contributions
ars-0.020
oster-0.019
ers-0.018
Elsa-0.017
Finally-0.017
Token Mia
Token ID32189
Feature activation-5.30e-01

Pos logit contributions
ars+0.071
Elsa+0.065
ers+0.064
oster+0.064
'+0.058
Neg logit contributions
arter-0.071
 Suz-0.068
 microsc-0.068
 Sus-0.065
yt-0.061
Token are
Token ID389
Feature activation-1.38e+00

Pos logit contributions
ars+0.015
oster+0.014
ers+0.013
Elsa+0.013
Finally+0.012
Neg logit contributions
 Sus-0.012
 Suz-0.012
 Joe-0.011
 Su-0.011
 microsc-0.011
Token friends
Token ID2460
Feature activation-6.63e-01

Pos logit contributions
ars+0.041
oster+0.039
ers+0.037
Elsa+0.035
ze+0.034
Neg logit contributions
 Suz-0.030
 Sus-0.030
 Joe-0.026
 Su-0.026
 microsc-0.026
Token playing
Token ID2712
Feature activation-2.92e+00

Pos logit contributions
 Suz+0.016
 Sus+0.016
 microsc+0.014
 Su+0.014
 Joe+0.014
Neg logit contributions
ars-0.017
ers-0.016
oster-0.015
Elsa-0.014
ze-0.014
Token with
Token ID351
Feature activation+1.67e+00

Pos logit contributions
ars+0.073
oster+0.069
Elsa+0.064
ers+0.062
Finally+0.060
Neg logit contributions
 Sus-0.075
 Suz-0.074
 Joe-0.068
 microsc-0.067
 Su-0.066
Token him
Token ID683
Feature activation+4.40e-01

Pos logit contributions
 Suz+0.039
 Sus+0.038
arter+0.034
 Su+0.034
 microsc+0.034
Neg logit contributions
ars-0.046
ers-0.043
oster-0.042
Elsa-0.039
ze-0.038
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
 Suz+0.012
 Sus+0.012
 Su+0.011
 Joe+0.011
 microsc+0.011
Neg logit contributions
ars-0.010
oster-0.009
ers-0.009
Elsa-0.008
ze-0.008
Token Then
Token ID3244
Feature activation-9.87e-01

Pos logit contributions
 Suz+0.026
 Sus+0.025
 microsc+0.023
arter+0.023
 Su+0.023
Neg logit contributions
ars-0.030
oster-0.028
ers-0.027
Elsa-0.025
Finally-0.024
Token an
Token ID281
Feature activation+4.54e+00

Pos logit contributions
ars+0.022
oster+0.020
ers+0.020
Elsa+0.018
ze+0.017
Neg logit contributions
 Suz-0.028
 Sus-0.027
arter-0.026
 microsc-0.026
 Su-0.025
Token idea
Token ID2126
Feature activation+1.17e+00

Pos logit contributions
 Sus+0.129
 Suz+0.129
 microsc+0.119
 Su+0.114
 Joe+0.114
Neg logit contributions
ars-0.102
oster-0.094
ers-0.093
ze-0.081
jected-0.080
Token came
Token ID1625
Feature activation+3.03e+00

Pos logit contributions
 Suz+0.029
 Sus+0.028
 microsc+0.027
arter+0.027
 Su+0.025
Neg logit contributions
ars-0.030
ers-0.028
oster-0.027
jected-0.024
ze-0.024
Token to
Token ID284
Feature activation-7.56e-01

Pos logit contributions
 Suz+0.073
 Sus+0.072
 Joe+0.065
 Su+0.064
 microsc+0.064
Neg logit contributions
ars-0.084
oster-0.078
ers-0.075
Elsa-0.072
Finally-0.068
Token him
Token ID683
Feature activation+9.66e-01

Pos logit contributions
ars+0.018
oster+0.017
ers+0.017
Elsa+0.016
ze+0.015
Neg logit contributions
 Suz-0.020
 Sus-0.019
 microsc-0.018
arter-0.018
 Su-0.018
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 Sus+0.026
 Suz+0.026
 Joe+0.025
 microsc+0.023
 M+0.023
Neg logit contributions
ars-0.024
oster-0.023
ers-0.021
Elsa-0.021
Finally-0.020
Token upon
Token ID2402
Feature activation-3.45e+00

Pos logit contributions
 Suz+0.051
 Sus+0.051
 microsc+0.049
arter+0.047
 Su+0.045
Neg logit contributions
ars-0.075
oster-0.068
ers-0.067
 End-0.064
Elsa-0.064
Token a
Token ID257
Feature activation-5.44e-01

Pos logit contributions
ars+0.092
oster+0.088
ers+0.081
Elsa+0.077
jected+0.076
Neg logit contributions
 Sus-0.084
 Suz-0.083
 Joe-0.076
 Jill-0.074
 M-0.074
Token time
Token ID640
Feature activation-6.38e-01

Pos logit contributions
ars+0.014
ers+0.013
oster+0.013
Elsa+0.012
jected+0.011
Neg logit contributions
 Suz-0.013
 Sus-0.013
 Joe-0.012
 Su-0.012
arter-0.012
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
ars+0.011
ers+0.009
oster+0.009
"+0.008
'+0.008
Neg logit contributions
 Sus-0.020
 Suz-0.020
 microsc-0.019
arter-0.019
 Su-0.018
Token a
Token ID257
Feature activation-3.03e-01

Pos logit contributions
 Suz+0.048
 Sus+0.048
 microsc+0.044
 Su+0.043
 Joe+0.043
Neg logit contributions
ars-0.038
oster-0.035
ers-0.034
Elsa-0.031
Finally-0.030
Token nos
Token ID43630
Feature activation+4.45e+00

Pos logit contributions
ars+0.008
ers+0.008
oster+0.007
jected+0.007
bes+0.007
Neg logit contributions
 Sus-0.007
 Suz-0.007
 Joe-0.006
 Su-0.006
 M-0.006
Tokeny
Token ID88
Feature activation+6.54e-02

Pos logit contributions
 Suz+0.112
 Sus+0.112
 microsc+0.109
 Nice+0.101
arter+0.099
Neg logit contributions
ars-0.101
oster-0.100
ers-0.096
over-0.083
rot-0.077
Token wolf
Token ID17481
Feature activation+9.99e-01

Pos logit contributions
 Suz+0.002
 Sus+0.002
 microsc+0.002
 Joe+0.002
 Su+0.002
Neg logit contributions
ars-0.001
oster-0.001
ers-0.001
Elsa-0.001
ze-0.001
Token was
Token ID373
Feature activation-4.10e-01

Pos logit contributions
 Sus+0.026
 Suz+0.026
 microsc+0.024
arter+0.023
 Su+0.023
Neg logit contributions
ars-0.023
oster-0.022
ers-0.022
ze-0.020
Elsa-0.019
Token walking
Token ID6155
Feature activation-1.37e+00

Pos logit contributions
ars+0.012
oster+0.011
ers+0.010
Elsa+0.010
Finally+0.010
Neg logit contributions
 Suz-0.010
 Sus-0.009
 Joe-0.008
 Su-0.008
arter-0.008
Token in
Token ID287
Feature activation+2.29e+00

Pos logit contributions
ars+0.033
oster+0.030
ers+0.030
Elsa+0.027
ze+0.026
Neg logit contributions
 Suz-0.037
 Sus-0.037
 microsc-0.034
 Su-0.033
 Joe-0.033
Token got
Token ID1392
Feature activation+1.99e+00

Pos logit contributions
 Suz+0.005
 Sus+0.005
 Joe+0.005
 microsc+0.005
 Su+0.005
Neg logit contributions
ars-0.006
oster-0.005
ers-0.005
Elsa-0.005
Finally-0.004
Token very
Token ID845
Feature activation+4.42e-01

Pos logit contributions
 Suz+0.053
 Sus+0.053
 Joe+0.048
 Su+0.048
 microsc+0.047
Neg logit contributions
ars-0.049
oster-0.045
ers-0.044
Elsa-0.041
Finally-0.039
Token alert
Token ID7995
Feature activation-2.37e-01

Pos logit contributions
 Suz+0.013
 Sus+0.013
 Nice+0.013
arter+0.013
 microsc+0.013
Neg logit contributions
ars-0.009
oster-0.008
ers-0.007
ze-0.007
Elsa-0.007
Token and
Token ID290
Feature activation-1.46e+00

Pos logit contributions
ars+0.006
oster+0.006
Elsa+0.005
Finally+0.005
ers+0.005
Neg logit contributions
 Sus-0.007
 Suz-0.007
 Joe-0.006
 Su-0.006
 M-0.006
Token called
Token ID1444
Feature activation+7.46e-01

Pos logit contributions
ars+0.034
oster+0.031
ers+0.030
Elsa+0.028
ze+0.028
Neg logit contributions
 Suz-0.039
 Sus-0.039
arter-0.037
 microsc-0.036
 Joe-0.035
Token out
Token ID503
Feature activation+4.71e+00

Pos logit contributions
 Suz+0.019
 Sus+0.019
 microsc+0.017
 Joe+0.017
 Su+0.017
Neg logit contributions
ars-0.019
oster-0.018
ers-0.017
Elsa-0.016
ze-0.015
Token to
Token ID284
Feature activation-6.39e-01

Pos logit contributions
 Suz+0.133
 Sus+0.132
arter+0.129
 microsc+0.127
 Su+0.124
Neg logit contributions
ars-0.080
"-0.075
ers-0.074
ze-0.071
oster-0.071
Token her
Token ID607
Feature activation+4.57e-01

Pos logit contributions
ars+0.012
oster+0.011
ers+0.011
Elsa+0.010
ze+0.010
Neg logit contributions
 Suz-0.019
 Sus-0.018
 microsc-0.018
arter-0.018
yt-0.017
Token mom
Token ID1995
Feature activation+2.63e+00

Pos logit contributions
 Suz+0.013
 Sus+0.013
 microsc+0.012
arter+0.012
 Su+0.012
Neg logit contributions
ars-0.009
oster-0.009
ers-0.009
Elsa-0.008
ze-0.008
Tokenmy
Token ID1820
Feature activation+1.26e+00

Pos logit contributions
 Suz+0.069
 Sus+0.069
 microsc+0.063
arter+0.063
 Su+0.062
Neg logit contributions
ars-0.061
ers-0.057
oster-0.055
Elsa-0.052
ze-0.048
Token.
Token ID13
Feature activation+7.07e-01

Pos logit contributions
 Sus+0.032
 Suz+0.032
 Joe+0.029
 M+0.028
 microsc+0.028
Neg logit contributions
ars-0.035
oster-0.032
ers-0.030
Elsa-0.029
Finally-0.028
Token It
Token ID632
Feature activation+8.97e-01

Pos logit contributions
 Suz+0.060
 Sus+0.059
 Joe+0.054
 Su+0.054
 microsc+0.054
Neg logit contributions
ars-0.052
oster-0.047
ers-0.046
Elsa-0.043
Finally-0.040
Token is
Token ID318
Feature activation+4.86e-01

Pos logit contributions
 Suz+0.025
 Sus+0.024
 Joe+0.022
 microsc+0.022
 Su+0.022
Neg logit contributions
ars-0.021
oster-0.019
ers-0.018
Elsa-0.018
Finally-0.017
Token hard
Token ID1327
Feature activation-1.71e+00

Pos logit contributions
 Suz+0.014
 Sus+0.014
 microsc+0.013
 Su+0.013
 Joe+0.013
Neg logit contributions
ars-0.011
oster-0.010
ers-0.009
Elsa-0.009
ze-0.008
Token to
Token ID284
Feature activation+1.19e+00

Pos logit contributions
ars+0.038
oster+0.035
ers+0.034
Elsa+0.032
Finally+0.029
Neg logit contributions
 Suz-0.049
 Sus-0.049
 Joe-0.044
 Su-0.044
 microsc-0.044
Token get
Token ID651
Feature activation+1.32e+00

Pos logit contributions
 Suz+0.028
 Sus+0.028
 Joe+0.025
 Su+0.025
 microsc+0.025
Neg logit contributions
ars-0.032
oster-0.030
ers-0.029
Elsa-0.028
Finally-0.026
Token out
Token ID503
Feature activation+4.69e+00

Pos logit contributions
 Sus+0.035
 Suz+0.034
 Joe+0.031
 Su+0.031
 M+0.030
Neg logit contributions
ars-0.033
oster-0.031
ers-0.029
Elsa-0.028
Finally-0.026
Token.
Token ID13
Feature activation+2.03e+00

Pos logit contributions
 Suz+0.127
 Sus+0.126
 microsc+0.114
 Joe+0.114
 Su+0.113
Neg logit contributions
ars-0.112
oster-0.102
ers-0.101
Elsa-0.092
ze-0.088
Token You
Token ID921
Feature activation-1.00e+00

Pos logit contributions
 Suz+0.054
 Sus+0.054
 Joe+0.049
 Su+0.049
 microsc+0.048
Neg logit contributions
ars-0.050
oster-0.046
ers-0.045
Elsa-0.041
Finally-0.039
Token are
Token ID389
Feature activation+4.74e-01

Pos logit contributions
ars+0.027
oster+0.025
ers+0.024
Elsa+0.023
Finally+0.022
Neg logit contributions
 Suz-0.024
 Sus-0.024
 Joe-0.021
 Su-0.021
 microsc-0.021
Token in
Token ID287
Feature activation+2.68e+00

Pos logit contributions
 Suz+0.013
 Sus+0.013
 Joe+0.012
 microsc+0.011
 Su+0.011
Neg logit contributions
ars-0.011
oster-0.010
ers-0.010
Elsa-0.010
Finally-0.009
Token big
Token ID1263
Feature activation+1.59e+00

Pos logit contributions
 Suz+0.066
 Sus+0.065
 Joe+0.058
 Su+0.058
 microsc+0.058
Neg logit contributions
ars-0.072
oster-0.067
ers-0.065
Elsa-0.061
Finally-0.058
TokenThe
Token ID464
Feature activation-1.86e+00

Pos logit contributions
ars+0.007
ers+0.006
oster+0.006
ze+0.006
jected+0.005
Neg logit contributions
 Suz-0.012
 Sus-0.011
 Su-0.011
 Joe-0.011
 Jill-0.010
Token yacht
Token ID43207
Feature activation+4.95e-01

Pos logit contributions
ars+0.048
ers+0.044
oster+0.042
Elsa+0.040
bes+0.040
Neg logit contributions
 Suz-0.045
 Sus-0.045
 microsc-0.041
 Joe-0.040
 M-0.039
Token sailed
Token ID33835
Feature activation-1.33e+00

Pos logit contributions
 Suz+0.012
 Sus+0.012
 Joe+0.011
 Mike+0.010
 Su+0.010
Neg logit contributions
ars-0.013
oster-0.012
Elsa-0.012
ers-0.012
Finally-0.011
Token away
Token ID1497
Feature activation+1.46e+00

Pos logit contributions
ars+0.030
oster+0.028
Elsa+0.027
ers+0.025
Finally+0.024
Neg logit contributions
 Sus-0.037
 Suz-0.037
 Joe-0.035
 Su-0.034
 Jill-0.033
Token with
Token ID351
Feature activation+1.44e+00

Pos logit contributions
 Suz+0.043
 Sus+0.042
 microsc+0.040
arter+0.040
 Su+0.040
Neg logit contributions
ars-0.030
ers-0.027
oster-0.027
Elsa-0.023
ze-0.022
Token all
Token ID477
Feature activation+4.81e+00

Pos logit contributions
 Suz+0.037
 Sus+0.037
arter+0.033
 microsc+0.033
 Su+0.033
Neg logit contributions
ars-0.036
ers-0.033
oster-0.033
Elsa-0.030
ze-0.028
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 Suz+0.102
 Sus+0.102
arter+0.095
 microsc+0.094
 Su+0.089
Neg logit contributions
ars-0.134
ers-0.129
oster-0.125
Elsa-0.114
jected-0.109
Token fun
Token ID1257
Feature activation+1.68e-02

Pos logit contributions
ars+0.031
oster+0.029
ers+0.028
Elsa+0.026
Finally+0.025
Neg logit contributions
 Suz-0.031
 Sus-0.031
 Joe-0.028
 microsc-0.028
 Su-0.028
Token things
Token ID1243
Feature activation+1.16e+00

Pos logit contributions
 Suz+0.000
 Sus+0.000
arter+0.000
 microsc+0.000
 Su+0.000
Neg logit contributions
ars-0.000
ers-0.000
oster-0.000
ze-0.000
Elsa-0.000
Token the
Token ID262
Feature activation-1.84e+00

Pos logit contributions
 Suz+0.030
 Sus+0.029
 microsc+0.027
arter+0.027
 Su+0.027
Neg logit contributions
ars-0.029
oster-0.027
ers-0.026
Elsa-0.024
ze-0.023
Token little
Token ID1310
Feature activation-1.33e+00

Pos logit contributions
ars+0.049
oster+0.045
ers+0.044
Elsa+0.043
Finally+0.041
Neg logit contributions
 Sus-0.044
 Suz-0.044
 Joe-0.040
 microsc-0.039
 Su-0.038
Token the
Token ID262
Feature activation-8.94e-01

Pos logit contributions
 Suz+0.049
 Sus+0.048
arter+0.046
 Nice+0.046
 microsc+0.045
Neg logit contributions
ars-0.025
ers-0.023
oster-0.021
 End-0.020
Elsa-0.019
Token clouds
Token ID15114
Feature activation+2.71e-01

Pos logit contributions
ars+0.020
oster+0.019
ers+0.018
Elsa+0.017
bes+0.016
Neg logit contributions
 Sus-0.025
 Suz-0.025
 microsc-0.023
 Joe-0.023
 Su-0.023
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
 Suz+0.007
 Sus+0.007
 Joe+0.007
 Su+0.006
 microsc+0.006
Neg logit contributions
ars-0.007
oster-0.006
ers-0.006
Elsa-0.006
Finally-0.005
Token Sometimes
Token ID8975
Feature activation+4.37e-01

Pos logit contributions
 Suz+0.048
arter+0.048
 Sus+0.047
 microsc+0.047
yt+0.046
Neg logit contributions
ars-0.026
oster-0.022
Finally-0.022
ers-0.022
Eventually-0.020
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
 Suz+0.014
arter+0.013
 Sus+0.013
 microsc+0.013
 Su+0.012
Neg logit contributions
ars-0.008
ers-0.007
oster-0.006
"-0.006
'-0.006
Token he
Token ID339
Feature activation+9.01e-01

Pos logit contributions
 Suz+0.044
arter+0.043
 Sus+0.042
 microsc+0.040
 Nice+0.039
Neg logit contributions
ars-0.038
ers-0.035
oster-0.033
There-0.031
 End-0.031
Token would
Token ID561
Feature activation-1.08e+00

Pos logit contributions
 Suz+0.022
 Sus+0.022
 microsc+0.020
arter+0.020
 Su+0.020
Neg logit contributions
ars-0.024
oster-0.022
ers-0.021
Elsa-0.020
ze-0.019
Token see
Token ID766
Feature activation+2.78e+00

Pos logit contributions
ars+0.030
oster+0.028
ers+0.028
Elsa+0.026
Finally+0.024
Neg logit contributions
 Suz-0.025
 Sus-0.025
 Joe-0.022
 Su-0.022
 microsc-0.022
Token a
Token ID257
Feature activation+1.41e+00

Pos logit contributions
 Suz+0.091
 Sus+0.089
arter+0.087
 Su+0.083
 microsc+0.082
Neg logit contributions
ars-0.043
ers-0.041
oster-0.036
ze-0.035
The-0.031
Token bird
Token ID6512
Feature activation+8.75e-01

Pos logit contributions
 Suz+0.037
 Sus+0.037
 Joe+0.033
 Su+0.033
 microsc+0.033
Neg logit contributions
ars-0.034
oster-0.032
ers-0.030
Elsa-0.029
ze-0.027
Token fly
Token ID6129
Feature activation+2.69e-01

Pos logit contributions
 Suz+0.022
 Sus+0.021
arter+0.019
 Joe+0.019
 Su+0.019
Neg logit contributions
ars-0.023
oster-0.021
ers-0.020
Elsa-0.020
Finally-0.019
Token light
Token ID1657
Feature activation+1.09e+00

Pos logit contributions
ars+0.009
oster+0.009
ers+0.008
Elsa+0.008
jected+0.008
Neg logit contributions
 Sus-0.010
 Suz-0.010
 Joe-0.009
 microsc-0.009
 Su-0.008
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
 Suz+0.030
 Sus+0.030
 Joe+0.027
 Su+0.027
 microsc+0.027
Neg logit contributions
ars-0.026
oster-0.024
ers-0.022
Elsa-0.021
Finally-0.020
Token She
Token ID1375
Feature activation+1.88e-01

Pos logit contributions
 Sus+0.007
 Suz+0.007
 Joe+0.006
 Su+0.006
 M+0.006
Neg logit contributions
ars-0.007
oster-0.007
ers-0.007
ze-0.006
Elsa-0.006
Token eagerly
Token ID30130
Feature activation-6.91e-01

Pos logit contributions
 Suz+0.005
 Sus+0.005
 Joe+0.004
 Su+0.004
 microsc+0.004
Neg logit contributions
ars-0.005
oster-0.005
ers-0.004
Elsa-0.004
Finally-0.004
Token helped
Token ID4193
Feature activation+2.88e-01

Pos logit contributions
ars+0.018
oster+0.017
ers+0.016
Elsa+0.016
Finally+0.015
Neg logit contributions
 Suz-0.017
 Sus-0.017
 Joe-0.015
 Su-0.015
 microsc-0.015
Token light
Token ID1657
Feature activation+2.13e+00

Pos logit contributions
 Sus+0.007
 Suz+0.007
 Joe+0.006
 M+0.006
 Su+0.006
Neg logit contributions
ars-0.009
oster-0.008
jected-0.008
ers-0.007
Elsa-0.007
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
 Suz+0.056
 Sus+0.056
 Joe+0.050
 Su+0.049
 microsc+0.048
Neg logit contributions
ars-0.056
oster-0.053
ers-0.048
Elsa-0.048
Finally-0.046
Token candle
Token ID26839
Feature activation+1.18e-01

Pos logit contributions
ars+0.038
oster+0.036
ers+0.033
Finally+0.033
Elsa+0.033
Neg logit contributions
 Sus-0.030
 Suz-0.028
 microsc-0.027
 M-0.026
 J-0.024
Token,
Token ID11
Feature activation+3.20e-01

Pos logit contributions
 Suz+0.003
 Sus+0.003
 Joe+0.003
 Su+0.003
 M+0.003
Neg logit contributions
ars-0.003
oster-0.003
Elsa-0.003
ers-0.002
Finally-0.002
Token and
Token ID290
Feature activation-1.53e+00

Pos logit contributions
 Suz+0.009
 Sus+0.009
 Joe+0.008
 Su+0.008
 microsc+0.008
Neg logit contributions
ars-0.008
oster-0.007
ers-0.007
Elsa-0.007
Finally-0.006
Token everyone
Token ID2506
Feature activation+3.00e+00

Pos logit contributions
ars+0.039
oster+0.036
ers+0.034
Elsa+0.033
Finally+0.030
Neg logit contributions
 Suz-0.041
 Sus-0.041
 Joe-0.037
 Su-0.036
 microsc-0.036
Token stepped
Token ID10764
Feature activation+1.12e-01

Pos logit contributions
 Suz+0.043
 Sus+0.042
 Joe+0.038
 Su+0.037
 microsc+0.036
Neg logit contributions
ars-0.057
oster-0.053
ers-0.051
Elsa-0.051
Finally-0.047
Token back
Token ID736
Feature activation+1.84e+00

Pos logit contributions
 Suz+0.003
 Sus+0.003
 Joe+0.003
 Su+0.003
 microsc+0.003
Neg logit contributions
ars-0.003
oster-0.003
ers-0.002
Elsa-0.002
Finally-0.002
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
 Suz+0.064
 Sus+0.064
arter+0.061
 microsc+0.060
 Su+0.059
Neg logit contributions
ars-0.026
ers-0.023
oster-0.020
Elsa-0.018
'-0.017
Token said
Token ID531
Feature activation-1.63e+00

Pos logit contributions
ars+0.034
oster+0.031
ers+0.030
Elsa+0.029
ze+0.027
Neg logit contributions
 Suz-0.034
 Sus-0.034
arter-0.031
 microsc-0.031
 Su-0.031
Token nothing
Token ID2147
Feature activation+2.54e+00

Pos logit contributions
ars+0.036
oster+0.033
ers+0.032
Elsa+0.029
ze+0.028
Neg logit contributions
 Suz-0.046
 Sus-0.046
 microsc-0.042
arter-0.042
 Su-0.042
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
 Suz+0.070
 Sus+0.070
 Joe+0.063
 Su+0.063
 microsc+0.063
Neg logit contributions
ars-0.060
oster-0.055
ers-0.053
Elsa-0.050
Finally-0.046
Token
Token ID198
Feature activation+1.54e+00

Pos logit contributions
 Suz+0.026
 Sus+0.026
 Joe+0.023
 Su+0.023
 microsc+0.023
Neg logit contributions
ars-0.030
oster-0.027
ers-0.027
Elsa-0.025
ze-0.024
Token
Token ID198
Feature activation+1.26e+00

Pos logit contributions
 Suz+0.040
 Sus+0.039
 Joe+0.036
 Su+0.036
 M+0.035
Neg logit contributions
ars-0.038
oster-0.036
ers-0.035
ze-0.034
jected-0.033
TokenThe
Token ID464
Feature activation-1.87e+00

Pos logit contributions
 Suz+0.038
 Sus+0.037
 Jill+0.035
 Su+0.035
 Joe+0.035
Neg logit contributions
ars-0.020
ers-0.019
oster-0.018
ze-0.017
bes-0.017
Token little
Token ID1310
Feature activation-1.88e+00

Pos logit contributions
ars+0.054
oster+0.049
ers+0.047
Elsa+0.044
Finally+0.043
Neg logit contributions
 Sus-0.043
 Suz-0.042
 microsc-0.038
 Joe-0.037
arter-0.037
Token girl
Token ID2576
Feature activation-6.43e-02

Pos logit contributions
ars+0.042
oster+0.038
ers+0.036
Elsa+0.034
Finally+0.032
Neg logit contributions
 Suz-0.055
 Sus-0.055
 Joe-0.050
 Su-0.049
 microsc-0.049
Token tells
Token ID4952
Feature activation-1.66e-01

Pos logit contributions
 Suz+0.006
 Sus+0.006
 Joe+0.005
 Su+0.005
 microsc+0.005
Neg logit contributions
ars-0.006
oster-0.005
ers-0.005
Elsa-0.005
Finally-0.005
Token his
Token ID465
Feature activation-5.22e-01

Pos logit contributions
ars+0.005
ers+0.005
oster+0.005
Elsa+0.005
ze+0.005
Neg logit contributions
 Suz-0.003
 Sus-0.003
 microsc-0.003
arter-0.003
 Su-0.003
Token mom
Token ID1995
Feature activation+1.06e+00

Pos logit contributions
ars+0.013
oster+0.012
ers+0.011
Elsa+0.011
ze+0.010
Neg logit contributions
 Suz-0.014
 Sus-0.014
 Su-0.012
 Joe-0.012
 microsc-0.012
Token about
Token ID546
Feature activation-3.05e-01

Pos logit contributions
 Sus+0.024
 Suz+0.023
 Joe+0.021
 Su+0.021
 microsc+0.020
Neg logit contributions
ars-0.032
oster-0.030
Elsa-0.028
ers-0.027
Finally-0.027
Token his
Token ID465
Feature activation+2.48e-01

Pos logit contributions
ars+0.008
ers+0.008
oster+0.008
Elsa+0.007
ze+0.007
Neg logit contributions
 Suz-0.007
 Sus-0.006
arter-0.006
 Su-0.006
 microsc-0.006
Token broken
Token ID5445
Feature activation+7.75e-01

Pos logit contributions
 Sus+0.005
 Suz+0.005
 microsc+0.005
 Joe+0.005
 Su+0.005
Neg logit contributions
ars-0.007
oster-0.007
ers-0.006
Elsa-0.006
Finally-0.006
Token math
Token ID10688
Feature activation-1.96e+00

Pos logit contributions
 Suz+0.020
 Sus+0.020
 microsc+0.018
 Joe+0.018
 Su+0.018
Neg logit contributions
ars-0.020
oster-0.018
ers-0.017
Elsa-0.016
Finally-0.016
Token book
Token ID1492
Feature activation-2.10e+00

Pos logit contributions
ars+0.047
oster+0.045
ers+0.041
Elsa+0.041
Finally+0.039
Neg logit contributions
 Sus-0.050
 Suz-0.049
 microsc-0.046
 Joe-0.045
 Su-0.044
Token.
Token ID13
Feature activation+7.89e-01

Pos logit contributions
ars+0.049
oster+0.047
Elsa+0.043
ers+0.042
Finally+0.040
Neg logit contributions
 Sus-0.056
 Suz-0.055
 Joe-0.050
 microsc-0.049
 Su-0.049
Token His
Token ID2399
Feature activation+6.98e-01

Pos logit contributions
 Suz+0.017
 Sus+0.016
arter+0.016
 microsc+0.016
 Su+0.015
Neg logit contributions
ars-0.020
oster-0.020
ers-0.019
Elsa-0.018
Finally-0.018
Token mom
Token ID1995
Feature activation+1.49e+00

Pos logit contributions
 Suz+0.017
 Sus+0.017
 Joe+0.015
 microsc+0.015
 Su+0.015
Neg logit contributions
ars-0.019
oster-0.018
ers-0.017
Elsa-0.016
jected-0.015
Token to
Token ID284
Feature activation+2.94e-01

Pos logit contributions
ars+0.031
ers+0.029
oster+0.028
Elsa+0.026
ze+0.025
Neg logit contributions
 Suz-0.029
 Sus-0.028
 microsc-0.026
arter-0.025
 Su-0.025
Token know
Token ID760
Feature activation+1.56e+00

Pos logit contributions
 Suz+0.007
 Sus+0.007
 Joe+0.007
 Su+0.006
 microsc+0.006
Neg logit contributions
ars-0.008
oster-0.007
ers-0.007
Elsa-0.007
ze-0.006
Token how
Token ID703
Feature activation+7.93e-01

Pos logit contributions
 Suz+0.040
 Sus+0.039
 Joe+0.035
 microsc+0.035
 Su+0.035
Neg logit contributions
ars-0.040
oster-0.036
ers-0.036
Elsa-0.033
ze-0.032
Token much
Token ID881
Feature activation+1.11e+00

Pos logit contributions
 Suz+0.018
 Sus+0.018
 Joe+0.016
 microsc+0.016
 Su+0.016
Neg logit contributions
ars-0.022
oster-0.021
ers-0.020
Elsa-0.019
ze-0.018
Token they
Token ID484
Feature activation+4.43e-01

Pos logit contributions
 Suz+0.033
 Sus+0.032
 microsc+0.030
arter+0.030
 Su+0.029
Neg logit contributions
ars-0.024
ers-0.021
oster-0.021
Elsa-0.019
Finally-0.018
Token weigh
Token ID10164
Feature activation+1.69e+00

Pos logit contributions
 Suz+0.010
 Sus+0.010
 Joe+0.009
 microsc+0.009
 Su+0.008
Neg logit contributions
ars-0.012
oster-0.012
ers-0.011
Elsa-0.011
ze-0.010
Token before
Token ID878
Feature activation-1.15e+00

Pos logit contributions
 Sus+0.046
 Suz+0.046
 Joe+0.042
 Su+0.041
 microsc+0.041
Neg logit contributions
ars-0.040
oster-0.037
ers-0.034
Elsa-0.034
Finally-0.032
Token we
Token ID356
Feature activation-3.40e-01

Pos logit contributions
ars+0.026
oster+0.024
ers+0.023
Elsa+0.022
Finally+0.020
Neg logit contributions
 Suz-0.033
 Sus-0.033
 Joe-0.030
 Su-0.029
 microsc-0.029
Token make
Token ID787
Feature activation-7.53e-01

Pos logit contributions
ars+0.009
oster+0.009
ers+0.009
Elsa+0.008
ze+0.008
Neg logit contributions
 Suz-0.008
 Sus-0.008
 Joe-0.007
 microsc-0.007
 Su-0.007
Token apple
Token ID17180
Feature activation-2.48e+00

Pos logit contributions
ars+0.019
oster+0.018
ers+0.016
Elsa+0.016
jected+0.016
Neg logit contributions
 Sus-0.020
 Suz-0.019
 Joe-0.018
 Su-0.017
 Joey-0.017
Token pie
Token ID2508
Feature activation-1.04e+00

Pos logit contributions
ars+0.062
oster+0.059
Elsa+0.055
ers+0.053
jected+0.052
Neg logit contributions
 Suz-0.063
 Sus-0.062
 Joe-0.057
 Su-0.055
arter-0.055
Token found
Token ID1043
Feature activation+2.10e+00

Pos logit contributions
ars+0.025
oster+0.023
ers+0.023
Elsa+0.022
ze+0.020
Neg logit contributions
 Suz-0.021
 Sus-0.020
 Joe-0.018
 Su-0.018
 microsc-0.018
Token a
Token ID257
Feature activation+1.84e+00

Pos logit contributions
 Suz+0.063
 Sus+0.062
 Joe+0.057
 Su+0.056
 microsc+0.056
Neg logit contributions
ars-0.046
oster-0.042
ers-0.039
Elsa-0.038
Finally-0.035
Token pretty
Token ID2495
Feature activation+1.48e+00

Pos logit contributions
 Suz+0.049
 Sus+0.048
 Joe+0.043
 Su+0.043
 microsc+0.043
Neg logit contributions
ars-0.045
oster-0.042
ers-0.040
Elsa-0.038
ze-0.036
Token flower
Token ID15061
Feature activation-1.97e-01

Pos logit contributions
 Suz+0.038
 Sus+0.038
 Joe+0.034
 Su+0.034
 microsc+0.034
Neg logit contributions
ars-0.037
oster-0.034
ers-0.033
Elsa-0.031
ze-0.029
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
ars+0.005
oster+0.004
ers+0.004
Elsa+0.004
Finally+0.004
Neg logit contributions
 Suz-0.005
 Sus-0.005
 Joe-0.005
 microsc-0.005
 Su-0.005
Token The
Token ID383
Feature activation-7.51e-01

Pos logit contributions
 Suz+0.023
 Sus+0.023
 Joe+0.021
 Su+0.020
 microsc+0.020
Neg logit contributions
ars-0.031
oster-0.028
ers-0.028
Elsa-0.026
ze-0.025
Token bear
Token ID6842
Feature activation+1.15e+00

Pos logit contributions
ars+0.022
ers+0.019
oster+0.019
Finally+0.018
Eventually+0.018
Neg logit contributions
 Suz-0.016
 Sus-0.016
 M-0.015
 microsc-0.015
arter-0.014
Token decided
Token ID3066
Feature activation-1.03e+00

Pos logit contributions
 Suz+0.027
 Sus+0.026
 Joe+0.024
 M+0.023
 Su+0.023
Neg logit contributions
ars-0.033
oster-0.031
ers-0.029
Elsa-0.029
Finally-0.027
Token to
Token ID284
Feature activation-1.16e+00

Pos logit contributions
ars+0.034
oster+0.032
ers+0.030
Elsa+0.030
Finally+0.028
Neg logit contributions
 Sus-0.020
 Suz-0.020
 Joe-0.018
 Su-0.017
 microsc-0.017
Token give
Token ID1577
Feature activation+7.77e-01

Pos logit contributions
ars+0.032
oster+0.029
ers+0.029
Elsa+0.027
Finally+0.026
Neg logit contributions
 Suz-0.028
 Sus-0.028
 Joe-0.025
 Su-0.024
 microsc-0.024
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
 Sus+0.019
 Suz+0.019
 Joe+0.017
 M+0.017
 Joey+0.016
Neg logit contributions
ars-0.023
oster-0.022
Finally-0.020
ers-0.019
bes-0.019
Token the
Token ID262
Feature activation-1.63e+00

Pos logit contributions
ars+0.032
oster+0.029
ers+0.029
Elsa+0.027
ze+0.026
Neg logit contributions
 Suz-0.037
 Sus-0.036
arter-0.033
 microsc-0.033
 Su-0.033
Token other
Token ID584
Feature activation-4.81e-01

Pos logit contributions
ars+0.041
oster+0.038
ers+0.036
Elsa+0.035
Finally+0.034
Neg logit contributions
 Suz-0.042
 Sus-0.042
 Joe-0.038
 microsc-0.037
 Su-0.037
Token trees
Token ID7150
Feature activation+4.36e-01

Pos logit contributions
ars+0.011
oster+0.010
ers+0.009
Elsa+0.009
Finally+0.008
Neg logit contributions
 Suz-0.014
 Sus-0.014
 Joe-0.013
 Su-0.013
 microsc-0.013
Token.
Token ID13
Feature activation+9.93e-01

Pos logit contributions
 Suz+0.011
 Sus+0.011
 Joe+0.010
 Su+0.009
 microsc+0.009
Neg logit contributions
ars-0.011
oster-0.011
ers-0.010
Elsa-0.010
Finally-0.009
Token They
Token ID1119
Feature activation-2.47e-01

Pos logit contributions
 Suz+0.024
 Sus+0.024
 Joe+0.021
 Su+0.021
 microsc+0.021
Neg logit contributions
ars-0.027
oster-0.025
ers-0.024
Elsa-0.023
ze-0.021
Token felt
Token ID2936
Feature activation-4.41e-01

Pos logit contributions
ars+0.007
oster+0.006
ers+0.006
Elsa+0.006
ze+0.006
Neg logit contributions
 Suz-0.006
 Sus-0.006
 Joe-0.005
 microsc-0.005
 Su-0.005
Token happy
Token ID3772
Feature activation-2.50e+00

Pos logit contributions
ars+0.012
oster+0.011
ers+0.010
Elsa+0.010
bes+0.009
Neg logit contributions
 Sus-0.011
 Suz-0.011
 Joe-0.010
 Su-0.010
 Joey-0.010
Token and
Token ID290
Feature activation-3.34e+00

Pos logit contributions
ars+0.050
oster+0.044
ers+0.044
Elsa+0.039
ze+0.037
Neg logit contributions
 Suz-0.078
 Sus-0.077
 Su-0.071
 microsc-0.071
arter-0.070
Token proud
Token ID6613
Feature activation-1.25e+00

Pos logit contributions
ars+0.076
oster+0.069
ers+0.067
Elsa+0.062
ze+0.058
Neg logit contributions
 Suz-0.095
 Sus-0.094
 Joe-0.086
 microsc-0.085
 Su-0.085
Token.
Token ID13
Feature activation+5.64e-01

Pos logit contributions
ars+0.024
ers+0.023
oster+0.021
Elsa+0.018
ze+0.018
Neg logit contributions
 Suz-0.038
 Sus-0.037
 microsc-0.035
 Su-0.035
arter-0.035
Token They
Token ID1119
Feature activation-7.09e-01

Pos logit contributions
 Suz+0.016
 Sus+0.015
arter+0.015
 microsc+0.015
 Su+0.014
Neg logit contributions
ars-0.012
oster-0.011
Elsa-0.010
ers-0.010
Finally-0.010
Token wanted
Token ID2227
Feature activation-2.92e-01

Pos logit contributions
ars+0.009
oster+0.009
ers+0.008
Elsa+0.008
ze+0.008
Neg logit contributions
 Suz-0.009
 Sus-0.009
 microsc-0.008
 Joe-0.008
 Su-0.008
Token to
Token ID284
Feature activation-8.80e-01

Pos logit contributions
ars+0.007
ers+0.006
oster+0.006
Elsa+0.006
ze+0.005
Neg logit contributions
 Suz-0.008
 Sus-0.008
arter-0.007
yt-0.007
 Su-0.007
Token swing
Token ID9628
Feature activation-1.11e+00

Pos logit contributions
ars+0.020
oster+0.018
ers+0.017
Elsa+0.017
ze+0.017
Neg logit contributions
 Sus-0.024
 Suz-0.024
 microsc-0.023
arter-0.022
 Su-0.022
Token higher
Token ID2440
Feature activation-1.02e+00

Pos logit contributions
ars+0.024
oster+0.022
ers+0.021
Elsa+0.020
ze+0.018
Neg logit contributions
 Suz-0.032
 Sus-0.032
 Joe-0.029
 microsc-0.029
 Su-0.029
Token and
Token ID290
Feature activation-1.38e+00

Pos logit contributions
ars+0.022
oster+0.021
ers+0.020
Elsa+0.019
Finally+0.017
Neg logit contributions
 Suz-0.030
 Sus-0.030
 Joe-0.027
 Su-0.027
 microsc-0.027
Token higher
Token ID2440
Feature activation-7.12e-01

Pos logit contributions
ars+0.029
oster+0.027
ers+0.026
Elsa+0.024
ze+0.022
Neg logit contributions
 Suz-0.041
 Sus-0.041
 Joe-0.038
 Su-0.037
 Jill-0.037
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
ars+0.016
oster+0.015
ers+0.014
Elsa+0.013
ze+0.013
Neg logit contributions
 Suz-0.020
 Sus-0.020
 Joe-0.018
 microsc-0.018
 Su-0.018
Token He
Token ID679
Feature activation-4.23e-02

Pos logit contributions
 Suz+0.028
 Sus+0.028
 microsc+0.025
arter+0.025
 Su+0.024
Neg logit contributions
ars-0.035
oster-0.032
ers-0.031
Elsa-0.030
Finally-0.029
Token let
Token ID1309
Feature activation-1.80e+00

Pos logit contributions
ars+0.001
oster+0.001
ers+0.001
Elsa+0.001
Finally+0.001
Neg logit contributions
 Suz-0.001
 Sus-0.001
 Joe-0.001
 Su-0.001
 microsc-0.001
Token go
Token ID467
Feature activation-3.30e-01

Pos logit contributions
ars+0.049
oster+0.045
ers+0.043
Elsa+0.041
Finally+0.040
Neg logit contributions
 Suz-0.044
 Sus-0.043
 Joe-0.039
 Su-0.039
 microsc-0.038
Token of
Token ID286
Feature activation+6.24e-01

Pos logit contributions
ars+0.009
oster+0.008
ers+0.008
Elsa+0.007
Finally+0.007
Neg logit contributions
 Suz-0.008
 Sus-0.008
 microsc-0.007
 Joe-0.007
arter-0.007
Token time
Token ID640
Feature activation-9.08e-01

Pos logit contributions
ars+0.007
ers+0.007
oster+0.006
Elsa+0.006
ze+0.005
Neg logit contributions
 Suz-0.016
 Sus-0.015
 microsc-0.015
arter-0.015
 Nice-0.014
Token for
Token ID329
Feature activation-2.31e+00

Pos logit contributions
ars+0.023
ers+0.021
oster+0.020
Elsa+0.019
Finally+0.018
Neg logit contributions
 Suz-0.023
 Sus-0.022
 microsc-0.021
arter-0.020
 Su-0.020
Token bed
Token ID3996
Feature activation-1.83e+00

Pos logit contributions
ars+0.062
oster+0.057
ers+0.056
Elsa+0.053
ze+0.050
Neg logit contributions
 Suz-0.055
 Sus-0.054
 microsc-0.049
 Joe-0.049
 Su-0.048
Token.
Token ID13
Feature activation+9.32e-01

Pos logit contributions
ars+0.045
oster+0.042
ers+0.039
Elsa+0.038
Finally+0.036
Neg logit contributions
 Sus-0.049
 Suz-0.048
 Joe-0.044
 Su-0.044
 microsc-0.042
Token But
Token ID887
Feature activation+5.68e-01

Pos logit contributions
 Suz+0.022
 Sus+0.021
 microsc+0.019
 Joe+0.019
arter+0.019
Neg logit contributions
ars-0.026
oster-0.024
ers-0.023
Elsa-0.022
Finally-0.021
Token when
Token ID618
Feature activation+4.22e-01

Pos logit contributions
 Suz+0.015
 Sus+0.014
 microsc+0.013
arter+0.013
 Su+0.013
Neg logit contributions
ars-0.014
oster-0.013
ers-0.012
Elsa-0.012
ze-0.011
Token Alex
Token ID4422
Feature activation+1.21e-01

Pos logit contributions
 Suz+0.011
 Sus+0.011
arter+0.011
 microsc+0.011
 Su+0.010
Neg logit contributions
ars-0.009
oster-0.008
ers-0.008
Elsa-0.008
Finally-0.008
Token lay
Token ID3830
Feature activation-1.09e-01

Pos logit contributions
 Sus+0.002
 Suz+0.002
 Joe+0.002
 M+0.002
 Su+0.002
Neg logit contributions
ars-0.004
oster-0.003
ers-0.003
Finally-0.003
Elsa-0.003
Token down
Token ID866
Feature activation+8.80e-02

Pos logit contributions
ars+0.003
oster+0.003
ers+0.003
Elsa+0.003
Finally+0.003
Neg logit contributions
 Suz-0.003
 Sus-0.003
 Joe-0.002
 Su-0.002
 M-0.002
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
 Suz+0.003
 Sus+0.003
 microsc+0.002
arter+0.002
 Joe+0.002
Neg logit contributions
ars-0.002
ers-0.002
oster-0.002
Elsa-0.001
ze-0.001
Token he
Token ID339
Feature activation+1.97e-01

Pos logit contributions
 Suz+0.033
 Sus+0.032
arter+0.031
 microsc+0.029
 Su+0.029
Neg logit contributions
ars-0.036
ers-0.033
oster-0.032
Elsa-0.031
ze-0.030
Token a
Token ID257
Feature activation+1.38e+00

Pos logit contributions
ars+0.055
oster+0.049
ers+0.049
Elsa+0.045
ze+0.042
Neg logit contributions
 Suz-0.073
 Sus-0.072
arter-0.066
 microsc-0.066
 Joe-0.066
Token nice
Token ID3621
Feature activation-1.20e-01

Pos logit contributions
 Suz+0.037
 Sus+0.037
 Joe+0.034
 Su+0.033
 microsc+0.033
Neg logit contributions
ars-0.033
oster-0.030
ers-0.029
Elsa-0.028
Finally-0.027
Token bath
Token ID7837
Feature activation-2.31e+00

Pos logit contributions
ars+0.003
oster+0.003
ers+0.003
Elsa+0.003
jected+0.003
Neg logit contributions
 Suz-0.003
 Sus-0.003
 Joe-0.003
 Su-0.003
 microsc-0.003
Token and
Token ID290
Feature activation-2.63e+00

Pos logit contributions
ars+0.056
oster+0.051
Elsa+0.048
ers+0.047
Finally+0.046
Neg logit contributions
 Suz-0.062
 Sus-0.062
 Joe-0.056
 Su-0.055
 microsc-0.054
Token a
Token ID257
Feature activation+8.73e-01

Pos logit contributions
ars+0.055
ers+0.049
oster+0.049
Elsa+0.045
ze+0.042
Neg logit contributions
 Suz-0.078
 Sus-0.077
arter-0.071
 microsc-0.071
 Joe-0.070
Token nice
Token ID3621
Feature activation-2.74e-01

Pos logit contributions
 Suz+0.024
 Sus+0.024
 Joe+0.022
 Su+0.022
 microsc+0.022
Neg logit contributions
ars-0.020
oster-0.019
ers-0.018
Elsa-0.017
Finally-0.016
Token bed
Token ID3996
Feature activation-2.63e+00

Pos logit contributions
ars+0.007
oster+0.006
ers+0.006
Elsa+0.006
ze+0.006
Neg logit contributions
 Suz-0.007
 Sus-0.007
 microsc-0.006
 Joe-0.006
 Su-0.006
Tokentime
Token ID2435
Feature activation-3.38e+00

Pos logit contributions
ars+0.074
oster+0.070
Elsa+0.067
Finally+0.064
Eventually+0.062
Neg logit contributions
 Sus-0.059
 Suz-0.056
 Su-0.049
 Jill-0.049
 Joe-0.049
Token story
Token ID1621
Feature activation+3.45e-01

Pos logit contributions
ars+0.088
oster+0.081
ers+0.079
Elsa+0.075
Finally+0.070
Neg logit contributions
 Suz-0.085
 Sus-0.084
 Joe-0.076
 Su-0.075
 microsc-0.075
Token.
Token ID13
Feature activation+3.12e-01

Pos logit contributions
 Sus+0.007
 Suz+0.006
 Jill+0.006
 Jo+0.006
 Su+0.006
Neg logit contributions
ars-0.010
oster-0.010
Elsa-0.009
Finally-0.009
jected-0.009
Token They
Token ID1119
Feature activation-9.41e-01

Pos logit contributions
 Suz+0.009
 Sus+0.009
arter+0.008
 microsc+0.008
 Joe+0.008
Neg logit contributions
ars-0.006
Elsa-0.006
oster-0.006
ers-0.005
Finally-0.005
Token friends
Token ID2460
Feature activation+6.24e-02

Pos logit contributions
ars+0.048
ers+0.041
oster+0.040
Elsa+0.035
ze+0.035
Neg logit contributions
 Suz-0.070
 Sus-0.068
 Su-0.063
 microsc-0.063
arter-0.062
Token.
Token ID13
Feature activation+4.46e-01

Pos logit contributions
 Sus+0.001
 Suz+0.001
 Joe+0.001
 microsc+0.001
 Mike+0.001
Neg logit contributions
ars-0.002
oster-0.002
Elsa-0.001
ers-0.001
Finally-0.001
Token<|endoftext|>
Token ID50256
Feature activation-6.12e+00

Pos logit contributions
 Suz+0.015
 Sus+0.014
arter+0.014
 microsc+0.014
 Su+0.013
Neg logit contributions
ars-0.007
oster-0.006
Elsa-0.006
ers-0.006
Finally-0.006
TokenL
Token ID43
Feature activation-6.62e-01

Pos logit contributions
ars+0.160
oster+0.158
ze+0.140
Finally+0.138
ers+0.138
Neg logit contributions
 Sus-0.143
 Suz-0.143
 Joe-0.135
 M-0.131
 Mike-0.128
Tokenily
Token ID813
Feature activation-5.98e-01

Pos logit contributions
ars+0.016
oster+0.015
ers+0.014
Elsa+0.014
jected+0.013
Neg logit contributions
 Suz-0.018
 Sus-0.018
 Su-0.016
 Joe-0.016
 microsc-0.016
Token and
Token ID290
Feature activation-3.57e+00

Pos logit contributions
ars+0.014
oster+0.013
ers+0.012
Elsa+0.012
Finally+0.011
Neg logit contributions
 Sus-0.017
 Suz-0.016
 Joe-0.015
 Su-0.015
 Mike-0.015
Token Ben
Token ID3932
Feature activation-1.51e+00

Pos logit contributions
ars+0.103
oster+0.096
ers+0.094
Elsa+0.088
Finally+0.084
Neg logit contributions
 Suz-0.082
 Sus-0.082
 Joe-0.073
 Su-0.072
 microsc-0.071
Token were
Token ID547
Feature activation-1.27e+00

Pos logit contributions
ars+0.043
oster+0.040
ers+0.039
Elsa+0.037
Finally+0.035
Neg logit contributions
 Suz-0.034
 Sus-0.034
 Joe-0.030
 Su-0.030
 microsc-0.030
Token playing
Token ID2712
Feature activation-4.39e+00

Pos logit contributions
ars+0.036
oster+0.035
ers+0.033
Elsa+0.031
bes+0.030
Neg logit contributions
 Suz-0.029
 Sus-0.029
 Joe-0.026
 Su-0.025
arter-0.025
Token by
Token ID416
Feature activation+1.99e+00

Pos logit contributions
ars+0.121
oster+0.110
ers+0.104
Elsa+0.102
Finally+0.101
Neg logit contributions
 Suz-0.097
 Sus-0.097
 Joe-0.091
 M-0.090
 J-0.087
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
 Suz+0.057
 Sus+0.056
arter+0.055
 microsc+0.054
 Nice+0.053
Neg logit contributions
ars-0.039
ers-0.038
oster-0.035
Elsa-0.034
bes-0.033
Token's
Token ID338
Feature activation+3.79e-01

Pos logit contributions
 Sus+0.006
 Suz+0.006
 Joe+0.006
 microsc+0.006
 Su+0.006
Neg logit contributions
ars-0.004
oster-0.004
ers-0.004
Elsa-0.003
Finally-0.003
Token right
Token ID826
Feature activation+1.99e+00

Pos logit contributions
 Suz+0.010
 Sus+0.010
 Joe+0.010
 Su+0.009
 Joey+0.009
Neg logit contributions
ars-0.009
oster-0.009
ers-0.008
Elsa-0.008
ze-0.007
Token.
Token ID13
Feature activation+3.31e-01

Pos logit contributions
 Suz+0.056
 Sus+0.055
 microsc+0.051
arter+0.051
 Su+0.051
Neg logit contributions
ars-0.044
ers-0.041
oster-0.040
Elsa-0.036
ze-0.035
Token Now
Token ID2735
Feature activation-1.01e+00

Pos logit contributions
 Suz+0.009
 Sus+0.009
 Joe+0.008
 Su+0.008
 microsc+0.008
Neg logit contributions
ars-0.008
oster-0.008
ers-0.008
Elsa-0.007
ze-0.007
Token,
Token ID11
Feature activation+7.02e-01

Pos logit contributions
ars+0.029
oster+0.027
ers+0.026
Elsa+0.024
jected+0.023
Neg logit contributions
 Sus-0.025
 Suz-0.024
 Joe-0.022
 Su-0.021
 M-0.021
Token don
Token ID836
Feature activation-4.41e+00

Pos logit contributions
 Sus+0.017
 Suz+0.017
 Joe+0.015
 Su+0.015
 M+0.014
Neg logit contributions
ars-0.020
oster-0.019
ers-0.018
Elsa-0.017
bes-0.016
Token't
Token ID470
Feature activation+1.82e+00

Pos logit contributions
ars+0.094
oster+0.092
ers+0.084
Elsa+0.080
jected+0.071
Neg logit contributions
 Sus-0.130
 Suz-0.130
 Joe-0.123
 Su-0.121
 M-0.117
Token forget
Token ID6044
Feature activation+1.04e+00

Pos logit contributions
 Suz+0.045
 Sus+0.044
 microsc+0.042
 Su+0.041
arter+0.041
Neg logit contributions
ars-0.044
oster-0.040
ze-0.039
Elsa-0.038
ers-0.037
Token."
Token ID526
Feature activation-1.88e+00

Pos logit contributions
 Sus+0.023
 Suz+0.023
 Joe+0.021
 Su+0.020
 microsc+0.020
Neg logit contributions
ars-0.031
oster-0.029
ers-0.028
Elsa-0.027
jected-0.026
Token
Token ID198
Feature activation-7.51e-01

Pos logit contributions
ars+0.048
oster+0.045
ers+0.043
ze+0.039
jected+0.038
Neg logit contributions
 Sus-0.054
 Suz-0.053
 Joe-0.049
 Su-0.048
 M-0.047
Token
Token ID198
Feature activation-7.08e-01

Pos logit contributions
ars+0.020
oster+0.018
ers+0.018
jected+0.018
ze+0.017
Neg logit contributions
 Suz-0.019
 Sus-0.019
 Joe-0.018
 Su-0.017
 M-0.017
Token her
Token ID607
Feature activation+2.01e-01

Pos logit contributions
 Suz+0.039
 Sus+0.039
 microsc+0.035
 Su+0.035
 Joe+0.035
Neg logit contributions
ars-0.032
oster-0.029
ers-0.029
Elsa-0.027
ze-0.025
Token toy
Token ID13373
Feature activation-8.36e-01

Pos logit contributions
 Suz+0.006
 Sus+0.005
 Su+0.005
arter+0.005
 Joe+0.005
Neg logit contributions
ars-0.004
ers-0.004
oster-0.004
Elsa-0.004
ze-0.004
Token.
Token ID13
Feature activation-2.43e-01

Pos logit contributions
ars+0.019
oster+0.017
ers+0.017
Elsa+0.015
ze+0.014
Neg logit contributions
 Suz-0.024
 Sus-0.024
 Su-0.022
 Joe-0.022
 microsc-0.022
Token From
Token ID3574
Feature activation-1.23e+00

Pos logit contributions
ars+0.006
oster+0.005
ers+0.005
Elsa+0.005
ze+0.005
Neg logit contributions
 Suz-0.007
 Sus-0.007
 Joe-0.006
 Su-0.006
 microsc-0.006
Token that
Token ID326
Feature activation-6.69e-01

Pos logit contributions
ars+0.029
oster+0.027
ers+0.025
Elsa+0.024
ze+0.024
Neg logit contributions
 Sus-0.036
 Suz-0.035
 Joe-0.034
 Joey-0.032
 Su-0.032
Token day
Token ID1110
Feature activation-4.02e+00

Pos logit contributions
ars+0.020
oster+0.019
ers+0.019
Elsa+0.017
ze+0.017
Neg logit contributions
 Sus-0.015
 Suz-0.015
 Joe-0.013
 Su-0.012
 M-0.012
Token on
Token ID319
Feature activation-1.34e-01

Pos logit contributions
ars+0.084
oster+0.080
ers+0.076
Elsa+0.071
ze+0.066
Neg logit contributions
 Suz-0.118
 Sus-0.117
 Su-0.109
 Joe-0.108
 microsc-0.104
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
ars+0.003
oster+0.003
ers+0.003
Elsa+0.003
ze+0.003
Neg logit contributions
 Suz-0.004
 Sus-0.004
 Joe-0.003
 microsc-0.003
 Su-0.003
Token Z
Token ID1168
Feature activation+9.03e-01

Pos logit contributions
 Suz+0.026
 Sus+0.026
 Joe+0.023
 Su+0.023
 microsc+0.023
Neg logit contributions
ars-0.041
oster-0.038
ers-0.037
Elsa-0.035
ze-0.034
Tokenippy
Token ID41214
Feature activation+1.44e+00

Pos logit contributions
 Suz+0.029
 Sus+0.028
 Joe+0.026
 microsc+0.026
 Su+0.026
Neg logit contributions
ars-0.018
oster-0.016
ers-0.015
Elsa-0.014
Finally-0.013
Token made
Token ID925
Feature activation-6.04e-01

Pos logit contributions
 Suz+0.036
 Sus+0.035
 Joe+0.032
 Su+0.031
 microsc+0.031
Neg logit contributions
ars-0.037
oster-0.034
Elsa-0.033
ers-0.032
Finally-0.031
Token have
Token ID423
Feature activation-1.01e+00

Pos logit contributions
ars+0.017
oster+0.016
ers+0.016
Elsa+0.014
jected+0.014
Neg logit contributions
 Sus-0.016
 Suz-0.016
 Joe-0.014
 Su-0.014
 M-0.014
Token sweet
Token ID6029
Feature activation+5.46e-01

Pos logit contributions
ars+0.022
ers+0.019
oster+0.019
Elsa+0.018
ze+0.016
Neg logit contributions
 Suz-0.030
 Sus-0.029
arter-0.028
 microsc-0.027
 Su-0.027
Token dreams
Token ID10625
Feature activation-7.48e-01

Pos logit contributions
 Suz+0.015
 Sus+0.014
 Joe+0.013
 Su+0.013
 microsc+0.013
Neg logit contributions
ars-0.014
oster-0.013
ers-0.012
Elsa-0.011
Finally-0.011
Token.
Token ID13
Feature activation+8.50e-01

Pos logit contributions
ars+0.021
oster+0.021
Elsa+0.019
Finally+0.019
jected+0.018
Neg logit contributions
 Sus-0.016
 Suz-0.015
 Joe-0.014
 M-0.013
 Su-0.013
Token Good
Token ID4599
Feature activation-1.96e+00

Pos logit contributions
 Suz+0.023
 Sus+0.022
 microsc+0.020
 Su+0.020
arter+0.020
Neg logit contributions
ars-0.020
oster-0.018
Elsa-0.018
ers-0.018
Finally-0.016
Token night
Token ID1755
Feature activation-3.74e+00

Pos logit contributions
ars+0.054
oster+0.051
ers+0.049
Elsa+0.048
Finally+0.047
Neg logit contributions
 Suz-0.045
 Sus-0.044
 Joe-0.040
arter-0.039
 Su-0.039
Token,
Token ID11
Feature activation-2.18e-02

Pos logit contributions
ars+0.107
oster+0.102
ers+0.094
Elsa+0.093
Finally+0.091
Neg logit contributions
 Sus-0.084
 Suz-0.081
 Joe-0.075
 M-0.074
 Su-0.072
Token sweet
Token ID6029
Feature activation+3.72e-01

Pos logit contributions
ars+0.001
oster+0.001
ers+0.001
Elsa+0.001
ze+0.000
Neg logit contributions
 Suz-0.001
 Sus-0.000
 microsc-0.000
arter-0.000
 Joe-0.000
Tokenie
Token ID494
Feature activation+2.52e-01

Pos logit contributions
 Suz+0.008
 Sus+0.007
 Joe+0.007
 Su+0.006
arter+0.006
Neg logit contributions
ars-0.012
oster-0.011
ers-0.011
Elsa-0.010
Finally-0.010
Token.
Token ID13
Feature activation+5.01e-01

Pos logit contributions
 Sus+0.005
 Suz+0.005
 M+0.004
 Joe+0.004
 Su+0.004
Neg logit contributions
ars-0.008
Finally-0.007
Eventually-0.007
oster-0.007
Elsa-0.007
Token I
Token ID314
Feature activation-6.65e-01

Pos logit contributions
 Suz+0.013
 Sus+0.013
 microsc+0.012
 Su+0.012
arter+0.012
Neg logit contributions
ars-0.012
oster-0.011
Elsa-0.010
ers-0.010
Finally-0.009
Token make
Token ID787
Feature activation-1.29e+00

Pos logit contributions
 Suz+0.010
 Sus+0.010
 microsc+0.009
 Joe+0.009
 Su+0.009
Neg logit contributions
ars-0.010
oster-0.009
ers-0.009
Elsa-0.009
ze-0.008
Token it
Token ID340
Feature activation-7.90e-01

Pos logit contributions
ars+0.038
oster+0.035
jected+0.033
Elsa+0.031
Finally+0.030
Neg logit contributions
 Sus-0.033
 Suz-0.032
 M-0.029
 Joe-0.028
 Su-0.028
Token smell
Token ID8508
Feature activation+4.77e-01

Pos logit contributions
ars+0.019
oster+0.018
ers+0.018
Elsa+0.016
ze+0.016
Neg logit contributions
 Suz-0.020
 Sus-0.020
 microsc-0.018
 Su-0.018
 Joe-0.018
Token nice
Token ID3621
Feature activation-5.22e-01

Pos logit contributions
 Suz+0.013
 Sus+0.013
 Joe+0.012
 microsc+0.012
 Su+0.012
Neg logit contributions
ars-0.011
oster-0.010
ers-0.010
Elsa-0.009
ze-0.009
Token.
Token ID13
Feature activation-8.68e-02

Pos logit contributions
ars+0.011
ers+0.010
oster+0.010
Elsa+0.009
ze+0.009
Neg logit contributions
 Suz-0.015
 Sus-0.015
 Su-0.014
 microsc-0.014
 Joe-0.014
Token<|endoftext|>
Token ID50256
Feature activation-5.94e+00

Pos logit contributions
ars+0.002
oster+0.001
ers+0.001
Elsa+0.001
Finally+0.001
Neg logit contributions
 Suz-0.003
 Sus-0.003
 microsc-0.002
 Joe-0.002
arter-0.002
TokenOnce
Token ID7454
Feature activation+2.32e+00

Pos logit contributions
ars+0.141
oster+0.135
ers+0.120
Elsa+0.118
ze+0.115
Neg logit contributions
 Sus-0.161
 Suz-0.160
 Joe-0.149
 Su-0.145
 M-0.144
Token upon
Token ID2402
Feature activation-3.34e+00

Pos logit contributions
 Suz+0.046
 Sus+0.045
 microsc+0.042
arter+0.041
 Su+0.040
Neg logit contributions
ars-0.067
oster-0.060
ers-0.059
Elsa-0.058
 End-0.057
Token a
Token ID257
Feature activation-5.04e-01

Pos logit contributions
ars+0.090
oster+0.087
ers+0.079
Elsa+0.075
jected+0.074
Neg logit contributions
 Sus-0.079
 Suz-0.078
 Joe-0.072
 M-0.071
 Jill-0.070
Token time
Token ID640
Feature activation-7.07e-01

Pos logit contributions
ars+0.012
oster+0.011
ers+0.011
Elsa+0.010
ze+0.010
Neg logit contributions
 Suz-0.013
 Sus-0.013
 Joe-0.012
 Su-0.012
arter-0.012
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
ars+0.013
ers+0.011
oster+0.011
'+0.010
"+0.010
Neg logit contributions
 Suz-0.022
 Sus-0.022
 microsc-0.021
arter-0.021
 Su-0.020