TOP ACTIVATIONS
MAX = 4.591

ie . They both had birth days coming up , and
if she could go get an X - ray .
, " One , two , three !" As soon as
s mom explained that an X - ray was a
was too young to get an X - ray , but
. They had many toys , but their favorite was a
They had many number toys , like blocks , puzzles ,
them . They could count , add , and subtract with
. They had many toys , but their favorite was a
cards . They could count , add , and subtract very
. They had many toys , but their favorite was a
. They had many toys , but their favorite ones were
. " One , two , three , four ," she
. " One , two , three !" they shouted ,
steps . One , two , three - all the way
. They had many toys , but their favorite was a
to count and she counted to five . Then her head
blocks . One , two , three , four ... The
. They had many toys , but their favorite was a
so high . How did you learn to count so well

MIN ACTIVATIONS
MIN = -4.131

the mud all the time . <|endoftext|> Once upon a time
little girl and her family . <|endoftext|> Once upon a time
and grateful to her friend . From that day on ,
crawled out from under the leather and surprised Sammy . Sammy
first ." Tim my promised and quickly ran out of the
for her everywhere , but couldn 't find her . A
apple and her tum my couldn 't handle it . She
careful and sat on the c actus . They became good
you not to touch the c actus !" Tom
She hurt her head and couldn 't wake up . Lucy
't make you touch the c actus ," Sam said .
Sadly , Nut ty 's sickness got worse and he couldn
!" Mr . Fox smiled and said , " You 're
ear ." Fl oppy apologized and they continued to have a
y flew away , happy and clean . <|endoftext|> Once upon
happy and thanked Farmer Brown . They brought the hay back
, he crawled under the leather and hid .
help when he needed it . <|endoftext|> Once upon a time
not make Tom touch the c actus . He tried to
She hurt herself badly and couldn 't get out of the

INTERVAL 2.410 - 4.591
CONTAINS 0.287%

, there was a swing . A girl named Lily loved
They like to play outside . One day , they see
whatever you like ," his mom my said .
to count !" So , they started counting the days on
golf . He asked his dad , " Why do we

INTERVAL 0.230 - 2.410
CONTAINS 38.085%

Every day the pin would slide down the leaves . It
planting seeds in the ground . One day , he found
Lily nodded and said , " Yes , Mom ."
and Jill wanted to buy a present for their mom .
mom said , " I am measuring this box for a

INTERVAL -1.951 - 0.230
CONTAINS 60.649%

red balloon shrunk itself down to join the jog . All
He started picking some special oats . He picked and picked
day , they saw something shiny in the hole . They
do things we don 't want to do in order to
outside his room . He got out of bed to investigate

INTERVAL -4.131 - -1.951
CONTAINS 0.979%

don 't belong to you !" The thief was surprised and
pillar for the wonderful gift and said goodbye .
honest and very excited because it was a special day .
Sam says , " I don 't know . Maybe it
a bee flying near a da isy . He wanted to
Tokenie
Token ID494
Feature activation-2.16e-01

Pos logit contributions
 guess+0.031
 count+0.030
 ten+0.028
 imagine+0.028
 three+0.028
Neg logit contributions
actus-0.020
 therapist-0.020
 Midnight-0.020
iflower-0.018
 drowned-0.017
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 therapist+0.010
actus+0.010
 Midnight+0.010
 masse+0.009
lor+0.009
Neg logit contributions
 guess-0.008
 count-0.008
 three-0.008
 ten-0.008
 counted-0.007
Token They
Token ID1119
Feature activation+1.25e+00

Pos logit contributions
 guess+0.004
 count+0.004
 three+0.004
 ten+0.004
 imagine+0.003
Neg logit contributions
 therapist-0.005
 Midnight-0.005
actus-0.005
 masse-0.004
 Woo-0.004
Token both
Token ID1111
Feature activation+1.86e+00

Pos logit contributions
 guess+0.049
 count+0.047
 ten+0.043
ruits+0.041
 1+0.041
Neg logit contributions
 Midnight-0.056
actus-0.051
 therapist-0.051
 Woo-0.049
 drowned-0.048
Token had
Token ID550
Feature activation+3.90e-01

Pos logit contributions
 guess+0.065
 count+0.064
 ten+0.058
 whether+0.057
ptions+0.057
Neg logit contributions
 Midnight-0.083
 therapist-0.082
actus-0.080
 died-0.079
 drowned-0.079
Token birth
Token ID4082
Feature activation+4.33e+00

Pos logit contributions
 guess+0.012
 count+0.012
 imagine+0.010
 ten+0.010
 counted+0.010
Neg logit contributions
 Midnight-0.021
 therapist-0.020
actus-0.020
 Woo-0.019
 drowned-0.019
Tokendays
Token ID12545
Feature activation+1.49e+00

Pos logit contributions
 guess+0.164
 blond+0.161
 Many+0.161
 One+0.161
 Two+0.159
Neg logit contributions
ugg-0.170
eddy-0.167
uggle-0.153
eds-0.147
ucer-0.146
Token coming
Token ID2406
Feature activation+1.14e+00

Pos logit contributions
 guess+0.063
 count+0.060
 imagine+0.055
 ten+0.054
 blond+0.054
Neg logit contributions
 Midnight-0.059
 therapist-0.058
ugg-0.056
actus-0.055
 Woo-0.052
Token up
Token ID510
Feature activation+7.93e-01

Pos logit contributions
 guess+0.055
 count+0.055
 blond+0.052
 imagine+0.052
 counted+0.052
Neg logit contributions
 Midnight-0.036
 therapist-0.035
actus-0.033
 Woo-0.033
 masse-0.030
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
 count+0.033
 guess+0.033
 imagine+0.030
 counted+0.030
 three+0.029
Neg logit contributions
 therapist-0.031
 Midnight-0.031
actus-0.031
 Woo-0.028
 masse-0.028
Token and
Token ID290
Feature activation+3.58e-01

Pos logit contributions
 guess+0.046
 count+0.044
 ten+0.041
ff+0.040
 three+0.040
Neg logit contributions
 Midnight-0.060
 therapist-0.058
 Woo-0.054
actus-0.053
 masse-0.050
Token if
Token ID611
Feature activation-6.47e-01

Pos logit contributions
 blond+0.095
 blonde+0.093
 guess+0.091
 counted+0.084
 turrets+0.083
Neg logit contributions
 Midnight-0.097
 therapist-0.094
actus-0.086
iflower-0.080
 bottle-0.079
Token she
Token ID673
Feature activation+1.39e+00

Pos logit contributions
actus+0.022
 therapist+0.021
 Midnight+0.021
 masse+0.019
iflower+0.018
Neg logit contributions
 guess-0.034
 count-0.033
 three-0.032
 ten-0.031
 counted-0.030
Token could
Token ID714
Feature activation+1.25e+00

Pos logit contributions
 count+0.063
 ten+0.061
 guess+0.061
ptions+0.058
 three+0.058
Neg logit contributions
 therapist-0.050
actus-0.047
 Midnight-0.046
 died-0.042
 drowned-0.041
Token go
Token ID467
Feature activation+1.39e+00

Pos logit contributions
 guess+0.044
ptions+0.044
 ten+0.043
 three+0.043
 blond+0.042
Neg logit contributions
 therapist-0.052
 Midnight-0.050
actus-0.050
 masse-0.047
 died-0.045
Token get
Token ID651
Feature activation+7.43e-01

Pos logit contributions
 guess+0.061
 blond+0.060
 count+0.060
ruits+0.059
 counted+0.059
Neg logit contributions
actus-0.047
 therapist-0.047
 Midnight-0.044
 Woo-0.041
 masse-0.040
Token an
Token ID281
Feature activation+4.32e+00

Pos logit contributions
 guess+0.032
 count+0.032
 counted+0.029
 imagine+0.029
 ten+0.029
Neg logit contributions
 therapist-0.029
actus-0.028
 Midnight-0.027
iflower-0.025
 masse-0.025
Token X
Token ID1395
Feature activation+1.53e+00

Pos logit contributions
 blond+0.189
ptions+0.187
 blonde+0.187
 counted+0.174
ruits+0.174
Neg logit contributions
actus-0.139
iflower-0.133
 medicine-0.126
 Tooth-0.125
 band-0.113
Token-
Token ID12
Feature activation+2.12e+00

Pos logit contributions
 guess+0.081
 count+0.078
 One+0.075
 counted+0.075
 blond+0.075
Neg logit contributions
 therapist-0.045
actus-0.044
lor-0.041
iflower-0.040
 Midnight-0.037
Tokenray
Token ID2433
Feature activation+8.40e-01

Pos logit contributions
 guess+0.106
 count+0.103
 imagine+0.099
 blonde+0.096
 blond+0.096
Neg logit contributions
actus-0.068
 therapist-0.064
iflower-0.061
lor-0.059
ugg-0.055
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
 guess+0.038
 count+0.037
 ten+0.035
 three+0.034
 imagine+0.034
Neg logit contributions
 therapist-0.033
actus-0.031
 Midnight-0.029
 masse-0.028
iflower-0.027
Token
Token ID198
Feature activation-3.64e-01

Pos logit contributions
 therapist+0.007
actus+0.007
 Midnight+0.007
 masse+0.006
iflower+0.006
Neg logit contributions
 guess-0.006
 count-0.006
 three-0.006
 ten-0.006
 counted-0.005
Token,
Token ID11
Feature activation+1.81e+00

Pos logit contributions
 guess+0.110
 count+0.106
 imagine+0.103
ruits+0.102
adies+0.100
Neg logit contributions
 therapist-0.063
 Midnight-0.057
actus-0.054
ls-0.052
iflower-0.052
Token "
Token ID366
Feature activation+2.06e+00

Pos logit contributions
 guess+0.062
ruits+0.060
 count+0.059
 imagine+0.058
ff+0.056
Neg logit contributions
 therapist-0.081
 Midnight-0.076
 masse-0.070
actus-0.067
 Woo-0.066
TokenOne
Token ID3198
Feature activation+1.88e+00

Pos logit contributions
 guess+0.091
 count+0.090
 imagine+0.089
ruits+0.081
 blond+0.079
Neg logit contributions
 therapist-0.075
 masse-0.067
Now-0.066
 Midnight-0.063
lor-0.062
Token,
Token ID11
Feature activation+3.39e+00

Pos logit contributions
 guess+0.071
 blond+0.068
ptions+0.067
 imagine+0.066
 blonde+0.066
Neg logit contributions
 therapist-0.072
iflower-0.069
actus-0.069
 masse-0.064
 Midnight-0.063
Token two
Token ID734
Feature activation+2.31e+00

Pos logit contributions
illed+0.096
 blond+0.092
ff+0.092
ruits+0.091
 blonde+0.087
Neg logit contributions
 therapist-0.147
 masse-0.143
 band-0.143
 Midnight-0.140
 Woo-0.135
Token,
Token ID11
Feature activation+4.22e+00

Pos logit contributions
 guess+0.088
 count+0.080
 imagine+0.077
ruits+0.076
 One+0.076
Neg logit contributions
 therapist-0.091
 Midnight-0.090
actus-0.085
iflower-0.082
 masse-0.081
Token three
Token ID1115
Feature activation+2.22e+00

Pos logit contributions
illed+0.108
ff+0.094
ruits+0.093
 exception+0.091
adows+0.088
Neg logit contributions
 band-0.194
 masse-0.189
 Midnight-0.186
 therapist-0.185
 jar-0.183
Token!"
Token ID2474
Feature activation+2.20e-01

Pos logit contributions
 guess+0.096
 count+0.092
 imagine+0.088
ruits+0.084
 One+0.083
Neg logit contributions
 therapist-0.080
iflower-0.076
actus-0.075
 Midnight-0.072
ls-0.069
Token As
Token ID1081
Feature activation+1.62e-01

Pos logit contributions
 guess+0.008
 count+0.008
 ten+0.008
 three+0.007
 imagine+0.007
Neg logit contributions
 therapist-0.010
actus-0.010
 Midnight-0.010
 masse-0.009
iflower-0.009
Token soon
Token ID2582
Feature activation+2.83e-01

Pos logit contributions
 guess+0.008
 count+0.008
 ten+0.007
 three+0.007
 imagine+0.007
Neg logit contributions
 therapist-0.006
 Midnight-0.005
actus-0.005
 masse-0.005
iflower-0.005
Token as
Token ID355
Feature activation-1.85e-01

Pos logit contributions
 guess+0.009
 count+0.009
 ten+0.008
 three+0.008
 imagine+0.008
Neg logit contributions
 therapist-0.015
 Midnight-0.015
actus-0.015
 masse-0.013
 drowned-0.013
Token
Token ID8151
Feature activation-9.49e-03

Pos logit contributions
 guess+0.011
 count+0.011
 three+0.011
 ten+0.010
 imagine+0.010
Neg logit contributions
actus-0.012
 therapist-0.012
 Midnight-0.011
lor-0.010
iflower-0.010
Tokens
Token ID82
Feature activation+4.59e-01

Pos logit contributions
 therapist+0.000
actus+0.000
 Midnight+0.000
 masse+0.000
lor+0.000
Neg logit contributions
 guess-0.000
 count-0.000
 three-0.000
 ten-0.000
 imagine-0.000
Token mom
Token ID1995
Feature activation+8.29e-01

Pos logit contributions
 guess+0.021
 count+0.021
 ten+0.019
 three+0.019
 counted+0.019
Neg logit contributions
actus-0.017
 therapist-0.017
 Midnight-0.017
lor-0.015
 drowned-0.014
Token explained
Token ID4893
Feature activation+2.28e-02

Pos logit contributions
 count+0.037
 guess+0.036
 ten+0.036
 imagine+0.034
 three+0.034
Neg logit contributions
 therapist-0.031
 Midnight-0.030
actus-0.030
 died-0.027
 drowned-0.026
Token that
Token ID326
Feature activation+9.33e-01

Pos logit contributions
 guess+0.001
 count+0.001
 three+0.001
 ten+0.001
 counted+0.001
Neg logit contributions
 therapist-0.001
actus-0.001
 Midnight-0.001
 masse-0.001
iflower-0.001
Token an
Token ID281
Feature activation+4.20e+00

Pos logit contributions
 guess+0.047
 count+0.046
 ten+0.044
 imagine+0.043
 three+0.043
Neg logit contributions
 therapist-0.030
 Midnight-0.028
actus-0.028
iflower-0.025
 masse-0.025
Token X
Token ID1395
Feature activation+1.14e+00

Pos logit contributions
 yes+0.190
 blond+0.189
 guess+0.188
 blonde+0.186
ill+0.186
Neg logit contributions
actus-0.130
iflower-0.126
 Tooth-0.103
orn-0.102
 masse-0.101
Token-
Token ID12
Feature activation+2.24e+00

Pos logit contributions
 guess+0.055
 count+0.053
 ten+0.051
 three+0.051
 imagine+0.050
Neg logit contributions
 therapist-0.040
actus-0.040
iflower-0.036
lor-0.036
 Midnight-0.035
Tokenray
Token ID2433
Feature activation+1.07e+00

Pos logit contributions
 guess+0.123
 count+0.119
 imagine+0.115
 blond+0.109
 yes+0.109
Neg logit contributions
actus-0.064
 therapist-0.061
iflower-0.059
lor-0.058
ugg-0.049
Token was
Token ID373
Feature activation+8.90e-01

Pos logit contributions
 guess+0.050
 count+0.048
 ten+0.046
 three+0.045
 imagine+0.045
Neg logit contributions
 therapist-0.039
actus-0.038
 Midnight-0.034
iflower-0.034
lor-0.034
Token a
Token ID257
Feature activation+1.95e+00

Pos logit contributions
 guess+0.034
 count+0.033
 imagine+0.031
 three+0.030
 ten+0.030
Neg logit contributions
 therapist-0.039
 Midnight-0.038
actus-0.037
iflower-0.035
 masse-0.035
Token was
Token ID373
Feature activation+2.78e-01

Pos logit contributions
 guess+0.015
 count+0.015
 ten+0.014
 three+0.014
 imagine+0.013
Neg logit contributions
 therapist-0.016
actus-0.015
 Midnight-0.015
iflower-0.014
 masse-0.013
Token too
Token ID1165
Feature activation+8.59e-01

Pos logit contributions
 guess+0.010
 count+0.010
 ten+0.009
 imagine+0.009
 three+0.009
Neg logit contributions
 therapist-0.013
 Midnight-0.013
actus-0.013
 masse-0.012
iflower-0.012
Token young
Token ID1862
Feature activation-9.59e-01

Pos logit contributions
 count+0.037
 guess+0.035
 counted+0.033
 ten+0.033
 imagine+0.032
Neg logit contributions
 therapist-0.036
actus-0.035
 Midnight-0.033
 masse-0.033
 severe-0.033
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
actus+0.047
 therapist+0.046
 Midnight+0.044
lor+0.042
iflower+0.042
Neg logit contributions
 guess-0.036
 count-0.036
 three-0.034
 ten-0.033
 like-0.032
Token get
Token ID651
Feature activation+8.40e-02

Pos logit contributions
 ten+0.045
ruits+0.045
 three+0.044
ptions+0.044
 guess+0.044
Neg logit contributions
 therapist-0.060
 Midnight-0.059
 masse-0.057
actus-0.056
 drowned-0.055
Token an
Token ID281
Feature activation+4.18e+00

Pos logit contributions
 guess+0.004
 count+0.004
 three+0.003
 ten+0.003
 imagine+0.003
Neg logit contributions
 therapist-0.004
actus-0.003
 Midnight-0.003
 masse-0.003
iflower-0.003
Token X
Token ID1395
Feature activation+1.35e+00

Pos logit contributions
 blond+0.185
 blonde+0.184
ptions+0.180
 yes+0.177
aney+0.171
Neg logit contributions
actus-0.128
iflower-0.121
 Tooth-0.112
 medicine-0.112
 masse-0.107
Token-
Token ID12
Feature activation+1.85e+00

Pos logit contributions
 guess+0.071
 count+0.068
 One+0.065
 ten+0.065
 three+0.065
Neg logit contributions
 therapist-0.042
actus-0.041
lor-0.037
iflower-0.037
 Midnight-0.036
Tokenray
Token ID2433
Feature activation+8.15e-01

Pos logit contributions
 guess+0.095
 count+0.092
 imagine+0.089
 blond+0.085
 blonde+0.084
Neg logit contributions
actus-0.057
 therapist-0.056
iflower-0.052
lor-0.050
ugg-0.047
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
 guess+0.036
 count+0.035
 ten+0.033
 three+0.033
 imagine+0.032
Neg logit contributions
 therapist-0.032
actus-0.031
 Midnight-0.029
 masse-0.028
iflower-0.027
Token but
Token ID475
Feature activation+1.23e+00

Pos logit contributions
 count+0.052
 guess+0.049
 ten+0.047
 counted+0.045
ruits+0.044
Neg logit contributions
 therapist-0.081
 Midnight-0.080
 Woo-0.075
 masse-0.073
actus-0.072
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
 guess+0.017
 count+0.016
 three+0.015
 ten+0.015
 imagine+0.015
Neg logit contributions
 therapist-0.016
actus-0.016
 Midnight-0.015
 masse-0.014
iflower-0.013
Token They
Token ID1119
Feature activation+2.26e+00

Pos logit contributions
 count+0.041
 guess+0.039
 counted+0.035
 imagine+0.032
 ten+0.031
Neg logit contributions
 Midnight-0.105
 Woo-0.099
 therapist-0.098
actus-0.097
 Lucky-0.095
Token had
Token ID550
Feature activation+1.62e+00

Pos logit contributions
 guess+0.111
teen+0.107
adies+0.106
 Today+0.106
 count+0.103
Neg logit contributions
 Midnight-0.068
 therapist-0.057
 drowned-0.054
 died-0.054
 Woo-0.053
Token many
Token ID867
Feature activation+2.13e+00

Pos logit contributions
 guess+0.066
 count+0.064
ruits+0.059
 imagine+0.059
ff+0.058
Neg logit contributions
 Midnight-0.068
 Woo-0.063
 therapist-0.059
 drowned-0.056
 Velvet-0.056
Token toys
Token ID14958
Feature activation+5.46e-01

Pos logit contributions
 guess+0.112
 count+0.104
ill+0.103
 ten+0.102
adies+0.101
Neg logit contributions
actus-0.060
 Midnight-0.060
 band-0.055
 swollen-0.053
ls-0.053
Token,
Token ID11
Feature activation+4.14e+00

Pos logit contributions
 guess+0.021
 count+0.020
 three+0.018
 ten+0.018
 imagine+0.018
Neg logit contributions
 therapist-0.025
actus-0.025
 Midnight-0.025
ugg-0.022
 Woo-0.022
Token but
Token ID475
Feature activation+1.78e+00

Pos logit contributions
ff+0.185
 1+0.152
rr+0.151
illed+0.151
used+0.144
Neg logit contributions
 Midnight-0.168
 Woo-0.150
 Velvet-0.149
 band-0.145
 medicine-0.138
Token their
Token ID511
Feature activation+1.08e+00

Pos logit contributions
 count+0.085
 guess+0.083
ff+0.079
 ten+0.078
adows+0.077
Neg logit contributions
 Midnight-0.064
 therapist-0.061
 Woo-0.056
 Velvet-0.052
 Lucky-0.050
Token favorite
Token ID4004
Feature activation+1.01e+00

Pos logit contributions
 guess+0.034
 count+0.032
 counted+0.030
 ten+0.029
 imagine+0.028
Neg logit contributions
actus-0.056
 Midnight-0.055
 therapist-0.055
 drowned-0.049
 masse-0.049
Token was
Token ID373
Feature activation+7.23e-01

Pos logit contributions
 guess+0.051
 count+0.050
 ten+0.048
 three+0.047
 1+0.047
Neg logit contributions
 therapist-0.033
actus-0.032
 Midnight-0.032
 drowned-0.027
 died-0.026
Token a
Token ID257
Feature activation-2.13e-01

Pos logit contributions
 guess+0.028
 count+0.026
 imagine+0.025
 three+0.023
 counted+0.023
Neg logit contributions
 Midnight-0.034
 therapist-0.033
actus-0.031
 Woo-0.030
 masse-0.029
Token They
Token ID1119
Feature activation+2.04e+00

Pos logit contributions
 guess+0.039
 count+0.039
 counted+0.034
 imagine+0.031
 ten+0.030
Neg logit contributions
 Midnight-0.099
 Woo-0.092
actus-0.091
 therapist-0.090
 Lucky-0.089
Token had
Token ID550
Feature activation+2.08e+00

Pos logit contributions
 guess+0.085
 count+0.079
adies+0.078
 Today+0.076
 whether+0.075
Neg logit contributions
 Midnight-0.073
 therapist-0.069
actus-0.065
 drowned-0.063
 died-0.063
Token many
Token ID867
Feature activation+3.03e+00

Pos logit contributions
 guess+0.091
immer+0.089
ruits+0.086
 count+0.086
 imagine+0.086
Neg logit contributions
 Midnight-0.067
 Woo-0.063
 therapist-0.061
 band-0.057
 Velvet-0.056
Token number
Token ID1271
Feature activation+3.53e+00

Pos logit contributions
 guess+0.142
ill+0.140
 mate+0.140
 yes+0.133
adies+0.131
Neg logit contributions
iflower-0.082
ls-0.078
actus-0.076
 lands-0.075
 band-0.075
Token toys
Token ID14958
Feature activation+1.01e+00

Pos logit contributions
 guess+0.157
 imagine+0.144
 blond+0.138
 count+0.137
illed+0.135
Neg logit contributions
 therapist-0.108
ls-0.107
actus-0.100
 lands-0.098
ic-0.097
Token,
Token ID11
Feature activation+4.14e+00

Pos logit contributions
 guess+0.042
 count+0.040
 imagine+0.037
 three+0.037
 blond+0.036
Neg logit contributions
 therapist-0.042
actus-0.040
 Midnight-0.040
lor-0.035
 masse-0.035
Token like
Token ID588
Feature activation+2.61e+00

Pos logit contributions
ff+0.150
illed+0.141
adies+0.126
int+0.125
 Today+0.117
Neg logit contributions
 Midnight-0.159
 Velvet-0.150
 band-0.148
 therapist-0.146
 chip-0.144
Token blocks
Token ID7021
Feature activation+7.59e-01

Pos logit contributions
 guess+0.099
illed+0.092
ruits+0.091
 imagine+0.086
 count+0.085
Neg logit contributions
 Midnight-0.106
 therapist-0.104
 masse-0.097
 chip-0.093
 band-0.092
Token,
Token ID11
Feature activation+3.51e+00

Pos logit contributions
 guess+0.029
 count+0.027
 three+0.026
 imagine+0.025
 ten+0.025
Neg logit contributions
actus-0.034
 therapist-0.034
 Midnight-0.033
iflower-0.030
 masse-0.030
Token puzzles
Token ID24367
Feature activation-1.15e-01

Pos logit contributions
illed+0.166
ff+0.155
 guess+0.136
int+0.135
 exception+0.134
Neg logit contributions
 Midnight-0.112
 band-0.102
 Velvet-0.102
 sy-0.099
 chip-0.099
Token,
Token ID11
Feature activation+2.79e+00

Pos logit contributions
 therapist+0.006
actus+0.006
 Midnight+0.006
 masse+0.005
lor+0.005
Neg logit contributions
 guess-0.004
 count-0.004
 three-0.004
 ten-0.004
 imagine-0.003
Token them
Token ID606
Feature activation+1.72e+00

Pos logit contributions
 guess+0.075
 count+0.073
 imagine+0.070
 counted+0.068
 want+0.067
Neg logit contributions
 Midnight-0.049
 therapist-0.045
actus-0.043
iflower-0.042
 Woo-0.040
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
 guess+0.081
 count+0.077
 three+0.076
 ten+0.075
 blond+0.073
Neg logit contributions
 therapist-0.054
 Midnight-0.054
actus-0.053
 died-0.046
iflower-0.046
Token They
Token ID1119
Feature activation+2.20e+00

Pos logit contributions
 count+0.073
 guess+0.071
 counted+0.069
 imagine+0.069
 ten+0.067
Neg logit contributions
 Midnight-0.057
 Woo-0.052
actus-0.049
 therapist-0.048
 Velvet-0.048
Token could
Token ID714
Feature activation+2.12e+00

Pos logit contributions
 guess+0.086
 Today+0.083
ruits+0.082
 whether+0.082
illed+0.081
Neg logit contributions
 therapist-0.079
 Midnight-0.078
 died-0.076
 drowned-0.073
actus-0.071
Token count
Token ID954
Feature activation+2.89e+00

Pos logit contributions
illed+0.077
ruits+0.073
 blonde+0.070
ff+0.070
adows+0.069
Neg logit contributions
 Midnight-0.085
 therapist-0.083
 band-0.076
actus-0.075
 masse-0.074
Token,
Token ID11
Feature activation+4.08e+00

Pos logit contributions
 guess+0.143
 count+0.133
 imagine+0.131
 counted+0.129
illed+0.129
Neg logit contributions
 Midnight-0.076
ls-0.074
 therapist-0.073
actus-0.073
iflower-0.071
Token add
Token ID751
Feature activation+2.50e+00

Pos logit contributions
illed+0.148
ruits+0.125
ff+0.124
adows+0.119
 exception+0.116
Neg logit contributions
 Midnight-0.153
 therapist-0.147
 chip-0.146
 masse-0.146
 band-0.145
Token,
Token ID11
Feature activation+3.92e+00

Pos logit contributions
 guess+0.101
 imagine+0.094
 count+0.093
illed+0.090
 counted+0.088
Neg logit contributions
 therapist-0.090
 Midnight-0.090
actus-0.087
ls-0.084
iflower-0.082
Token and
Token ID290
Feature activation+3.06e+00

Pos logit contributions
illed+0.157
ruits+0.135
 guess+0.129
 exception+0.129
ff+0.129
Neg logit contributions
 band-0.129
 therapist-0.128
 Midnight-0.126
 masse-0.125
 sy-0.124
Token subtract
Token ID34128
Feature activation+2.56e+00

Pos logit contributions
illed+0.103
ruits+0.097
ff+0.091
adies+0.089
adows+0.086
Neg logit contributions
 Midnight-0.125
 therapist-0.125
 masse-0.118
 band-0.118
 disappears-0.112
Token with
Token ID351
Feature activation+2.21e+00

Pos logit contributions
 guess+0.133
 imagine+0.121
 blond+0.120
 count+0.119
 wonder+0.119
Neg logit contributions
 therapist-0.068
iflower-0.064
 Midnight-0.064
actus-0.063
ic-0.060
Token.
Token ID13
Feature activation+2.25e+00

Pos logit contributions
 guess+0.003
 count+0.003
 three+0.003
 ten+0.003
 imagine+0.003
Neg logit contributions
 therapist-0.003
actus-0.003
 Midnight-0.003
 masse-0.003
iflower-0.003
Token They
Token ID1119
Feature activation+2.02e+00

Pos logit contributions
 count+0.058
 guess+0.057
ptions+0.052
 counted+0.050
 yes+0.049
Neg logit contributions
 Woo-0.113
actus-0.110
 Midnight-0.109
 therapist-0.105
 Mitt-0.103
Token had
Token ID550
Feature activation+1.43e+00

Pos logit contributions
 guess+0.101
 count+0.095
teen+0.095
 whether+0.094
 Spanish+0.093
Neg logit contributions
 Midnight-0.059
 therapist-0.051
 drowned-0.048
 died-0.047
 Woo-0.047
Token many
Token ID867
Feature activation+1.82e+00

Pos logit contributions
 guess+0.057
 count+0.056
 imagine+0.052
ruits+0.051
ff+0.050
Neg logit contributions
 Midnight-0.061
 Woo-0.056
 therapist-0.055
actus-0.051
 Velvet-0.050
Token toys
Token ID14958
Feature activation+3.77e-01

Pos logit contributions
 guess+0.095
 count+0.088
 ten+0.086
ill+0.086
 1+0.084
Neg logit contributions
actus-0.056
 Midnight-0.054
 band-0.049
 therapist-0.048
ls-0.048
Token,
Token ID11
Feature activation+4.07e+00

Pos logit contributions
 guess+0.014
 count+0.013
 three+0.012
 ten+0.012
 imagine+0.012
Neg logit contributions
 therapist-0.018
actus-0.018
 Midnight-0.017
 masse-0.016
 Woo-0.016
Token but
Token ID475
Feature activation+1.99e+00

Pos logit contributions
ff+0.182
 1+0.151
illed+0.149
rr+0.143
used+0.140
Neg logit contributions
 Midnight-0.159
 Velvet-0.147
 band-0.145
 Woo-0.142
 medicine-0.132
Token their
Token ID511
Feature activation+6.89e-01

Pos logit contributions
 count+0.097
 guess+0.096
ff+0.092
rr+0.091
adows+0.091
Neg logit contributions
 Midnight-0.058
 therapist-0.058
 Woo-0.054
 Velvet-0.049
 swollen-0.048
Token favorite
Token ID4004
Feature activation+8.61e-01

Pos logit contributions
 guess+0.022
 count+0.021
 counted+0.019
 ten+0.019
 three+0.018
Neg logit contributions
actus-0.036
 therapist-0.036
 Midnight-0.036
 masse-0.032
 drowned-0.032
Token was
Token ID373
Feature activation+6.40e-01

Pos logit contributions
 guess+0.043
 count+0.042
 ten+0.040
 three+0.039
 1+0.039
Neg logit contributions
 therapist-0.029
actus-0.029
 Midnight-0.028
 drowned-0.024
 died-0.023
Token a
Token ID257
Feature activation-4.17e-01

Pos logit contributions
 guess+0.025
 count+0.023
 imagine+0.022
 three+0.021
 ten+0.020
Neg logit contributions
 Midnight-0.030
 therapist-0.029
actus-0.028
 masse-0.026
 Woo-0.026
Token cards
Token ID4116
Feature activation+5.18e-02

Pos logit contributions
illed+0.157
ff+0.154
int+0.145
ruits+0.143
immer+0.142
Neg logit contributions
 Midnight-0.123
 band-0.113
 therapist-0.112
 masse-0.110
 medicine-0.108
Token.
Token ID13
Feature activation+1.31e+00

Pos logit contributions
 guess+0.002
 count+0.002
 three+0.002
 ten+0.002
 imagine+0.002
Neg logit contributions
 therapist-0.002
actus-0.002
 Midnight-0.002
 masse-0.002
lor-0.002
Token They
Token ID1119
Feature activation+2.74e+00

Pos logit contributions
 count+0.038
 guess+0.035
 counted+0.034
 ten+0.032
 imagine+0.032
Neg logit contributions
 Midnight-0.075
actus-0.070
 Woo-0.069
 therapist-0.069
 Lucky-0.066
Token could
Token ID714
Feature activation+2.33e+00

Pos logit contributions
adies+0.109
 guess+0.107
 Today+0.107
illed+0.104
ruits+0.102
Neg logit contributions
 Midnight-0.087
 therapist-0.087
 died-0.084
 drowned-0.080
 band-0.076
Token count
Token ID954
Feature activation+2.91e+00

Pos logit contributions
illed+0.083
ruits+0.077
ff+0.073
adies+0.072
adows+0.070
Neg logit contributions
 Midnight-0.100
 therapist-0.093
 band-0.087
 masse-0.086
actus-0.086
Token,
Token ID11
Feature activation+4.06e+00

Pos logit contributions
 guess+0.147
 count+0.136
 imagine+0.133
illed+0.132
 counted+0.132
Neg logit contributions
 Midnight-0.078
ls-0.076
actus-0.075
 therapist-0.073
iflower-0.071
Token add
Token ID751
Feature activation+2.66e+00

Pos logit contributions
illed+0.154
ff+0.133
ruits+0.128
adies+0.127
adows+0.125
Neg logit contributions
 Midnight-0.153
 chip-0.145
 therapist-0.143
 sy-0.141
 masse-0.140
Token,
Token ID11
Feature activation+3.83e+00

Pos logit contributions
 guess+0.109
 count+0.101
 imagine+0.101
illed+0.098
 counted+0.097
Neg logit contributions
 Midnight-0.096
 therapist-0.095
actus-0.093
ls-0.088
 masse-0.088
Token and
Token ID290
Feature activation+3.32e+00

Pos logit contributions
illed+0.162
ruits+0.139
ff+0.137
 guess+0.137
adows+0.133
Neg logit contributions
 Midnight-0.121
 chip-0.119
 therapist-0.118
 band-0.117
 sy-0.116
Token subtract
Token ID34128
Feature activation+2.26e+00

Pos logit contributions
illed+0.119
adies+0.111
ruits+0.110
ff+0.107
adows+0.103
Neg logit contributions
 Midnight-0.131
 therapist-0.128
 masse-0.120
 band-0.117
 medicine-0.111
Token very
Token ID845
Feature activation+1.40e+00

Pos logit contributions
 guess+0.114
 imagine+0.103
 count+0.102
 wonder+0.100
 blond+0.099
Neg logit contributions
 therapist-0.067
 Midnight-0.065
iflower-0.064
actus-0.064
ic-0.060
Token.
Token ID13
Feature activation+2.71e+00

Pos logit contributions
 guess+0.006
 count+0.006
 three+0.006
 ten+0.006
 imagine+0.006
Neg logit contributions
 therapist-0.006
actus-0.006
 Midnight-0.006
 masse-0.005
iflower-0.005
Token They
Token ID1119
Feature activation+1.99e+00

Pos logit contributions
 guess+0.084
 imagine+0.076
 count+0.074
ptions+0.071
 believe+0.067
Neg logit contributions
 Midnight-0.120
 Woo-0.120
 Cathy-0.113
actus-0.112
 Olive-0.112
Token had
Token ID550
Feature activation+1.60e+00

Pos logit contributions
 guess+0.096
 count+0.089
 whether+0.086
 Today+0.086
teen+0.086
Neg logit contributions
 Midnight-0.064
 therapist-0.052
 Woo-0.051
 drowned-0.051
 died-0.050
Token many
Token ID867
Feature activation+2.14e+00

Pos logit contributions
 guess+0.067
 count+0.065
ruits+0.061
 imagine+0.061
ff+0.060
Neg logit contributions
 Midnight-0.064
 Woo-0.060
 therapist-0.054
 drowned-0.053
 Velvet-0.053
Token toys
Token ID14958
Feature activation+5.70e-01

Pos logit contributions
 guess+0.115
ill+0.109
 count+0.105
ff+0.104
 ten+0.104
Neg logit contributions
 Midnight-0.059
actus-0.056
 band-0.053
ls-0.052
 swollen-0.051
Token,
Token ID11
Feature activation+4.05e+00

Pos logit contributions
 guess+0.021
 count+0.020
 three+0.019
 ten+0.019
 imagine+0.018
Neg logit contributions
actus-0.026
 therapist-0.026
 Midnight-0.026
ugg-0.023
 Woo-0.023
Token but
Token ID475
Feature activation+2.09e+00

Pos logit contributions
ff+0.186
rr+0.153
illed+0.149
 amaz+0.146
 1+0.145
Neg logit contributions
 Midnight-0.154
 Velvet-0.139
 band-0.137
 Woo-0.137
 poison-0.127
Token their
Token ID511
Feature activation+5.06e-01

Pos logit contributions
 guess+0.102
 count+0.102
rr+0.099
ff+0.099
adows+0.098
Neg logit contributions
 Midnight-0.062
 therapist-0.059
 Woo-0.055
 swollen-0.049
 Velvet-0.049
Token favorite
Token ID4004
Feature activation+7.61e-01

Pos logit contributions
 guess+0.016
 count+0.016
 counted+0.014
 ten+0.014
 three+0.014
Neg logit contributions
actus-0.026
 therapist-0.026
 Midnight-0.026
 masse-0.023
 drowned-0.023
Token was
Token ID373
Feature activation+7.07e-01

Pos logit contributions
 guess+0.037
 count+0.036
 ten+0.035
 three+0.034
 counted+0.034
Neg logit contributions
 therapist-0.027
actus-0.026
 Midnight-0.026
iflower-0.022
 drowned-0.021
Token a
Token ID257
Feature activation-1.36e-01

Pos logit contributions
 guess+0.028
 count+0.026
 imagine+0.024
 three+0.023
 ten+0.023
Neg logit contributions
 Midnight-0.033
 therapist-0.032
actus-0.030
 Woo-0.029
 masse-0.028
Token.
Token ID13
Feature activation+2.90e+00

Pos logit contributions
 guess+0.023
 count+0.023
 three+0.021
 ten+0.021
 imagine+0.021
Neg logit contributions
 therapist-0.021
actus-0.020
 Midnight-0.020
iflower-0.017
 masse-0.017
Token They
Token ID1119
Feature activation+2.31e+00

Pos logit contributions
 guess+0.090
 count+0.088
 imagine+0.085
 believe+0.084
 yes+0.080
Neg logit contributions
 Woo-0.125
 Midnight-0.119
actus-0.118
 Mitt-0.115
 Lucky-0.110
Token had
Token ID550
Feature activation+1.57e+00

Pos logit contributions
 guess+0.104
 count+0.100
 Today+0.100
adies+0.097
teen+0.097
Neg logit contributions
 Midnight-0.076
 drowned-0.067
 therapist-0.067
 died-0.065
 Woo-0.063
Token many
Token ID867
Feature activation+2.14e+00

Pos logit contributions
 guess+0.062
 count+0.061
 imagine+0.057
ruits+0.057
ff+0.055
Neg logit contributions
 Midnight-0.064
 Woo-0.060
 therapist-0.058
 drowned-0.057
actus-0.055
Token toys
Token ID14958
Feature activation+4.58e-01

Pos logit contributions
 guess+0.110
ill+0.103
 count+0.100
ff+0.099
adies+0.097
Neg logit contributions
actus-0.060
 band-0.059
ls-0.058
 Midnight-0.057
 swollen-0.055
Token,
Token ID11
Feature activation+4.04e+00

Pos logit contributions
 guess+0.017
 count+0.017
 three+0.015
 ten+0.015
 imagine+0.015
Neg logit contributions
 therapist-0.021
actus-0.021
 Midnight-0.020
 Woo-0.018
ugg-0.018
Token but
Token ID475
Feature activation+2.33e+00

Pos logit contributions
ff+0.170
rr+0.143
illed+0.141
 Today+0.137
 1+0.135
Neg logit contributions
 Midnight-0.160
 Woo-0.151
 band-0.146
 Velvet-0.142
 medicine-0.134
Token their
Token ID511
Feature activation+1.14e+00

Pos logit contributions
 count+0.115
adows+0.114
 guess+0.113
ff+0.113
rr+0.111
Neg logit contributions
 Midnight-0.061
 Woo-0.058
 therapist-0.057
 Velvet-0.049
 Mitt-0.047
Token favorite
Token ID4004
Feature activation+1.12e+00

Pos logit contributions
 guess+0.038
 count+0.036
 counted+0.034
 ten+0.033
 imagine+0.032
Neg logit contributions
actus-0.056
 therapist-0.055
 Midnight-0.054
 drowned-0.049
 masse-0.048
Token ones
Token ID3392
Feature activation+6.22e-01

Pos logit contributions
 guess+0.057
 count+0.055
 ten+0.053
 1+0.052
 three+0.052
Neg logit contributions
 therapist-0.037
actus-0.036
 Midnight-0.034
 drowned-0.030
 died-0.029
Token were
Token ID547
Feature activation+7.28e-01

Pos logit contributions
 guess+0.024
 count+0.023
 ten+0.022
 three+0.022
 imagine+0.021
Neg logit contributions
 therapist-0.028
actus-0.028
 Midnight-0.027
iflower-0.024
 drowned-0.024
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
 guess+0.071
 count+0.069
 three+0.067
 ten+0.065
 imagine+0.065
Neg logit contributions
 therapist-0.048
actus-0.047
 Midnight-0.047
iflower-0.040
 masse-0.040
Token "
Token ID366
Feature activation+2.04e+00

Pos logit contributions
 count+0.009
 guess+0.009
 ten+0.008
 three+0.008
 imagine+0.008
Neg logit contributions
 therapist-0.008
actus-0.008
 Midnight-0.008
 masse-0.007
 Woo-0.007
TokenOne
Token ID3198
Feature activation+1.43e+00

Pos logit contributions
 count+0.107
 guess+0.105
 imagine+0.101
ruits+0.099
 counted+0.097
Neg logit contributions
 therapist-0.057
 masse-0.052
actus-0.052
 Woo-0.051
Now-0.050
Token,
Token ID11
Feature activation+2.74e+00

Pos logit contributions
 guess+0.059
 counted+0.056
 count+0.056
 whether+0.054
 blond+0.054
Neg logit contributions
actus-0.056
 therapist-0.054
 bottle-0.051
 Midnight-0.050
iflower-0.049
Token two
Token ID734
Feature activation+1.87e+00

Pos logit contributions
illed+0.124
adows+0.123
ff+0.118
ruits+0.116
 exception+0.110
Neg logit contributions
 Midnight-0.091
 therapist-0.086
 Woo-0.086
 band-0.083
 bottle-0.083
Token,
Token ID11
Feature activation+4.04e+00

Pos logit contributions
 guess+0.075
 count+0.067
 imagine+0.063
 counted+0.063
 ten+0.063
Neg logit contributions
 Midnight-0.074
 therapist-0.072
actus-0.070
 Woo-0.068
 band-0.066
Token three
Token ID1115
Feature activation+1.82e+00

Pos logit contributions
illed+0.171
adows+0.165
ff+0.157
ruits+0.151
ill+0.145
Neg logit contributions
 Midnight-0.149
 Woo-0.149
 band-0.148
 Velvet-0.139
 bottle-0.136
Token,
Token ID11
Feature activation+3.68e+00

Pos logit contributions
 guess+0.077
 count+0.073
 imagine+0.068
 ten+0.066
 three+0.066
Neg logit contributions
 therapist-0.068
actus-0.067
 Midnight-0.065
iflower-0.063
ls-0.063
Token four
Token ID1440
Feature activation+1.66e+00

Pos logit contributions
illed+0.144
adows+0.133
ff+0.132
ruits+0.127
 exception+0.124
Neg logit contributions
 Woo-0.142
 band-0.140
 Midnight-0.140
 Velvet-0.132
 bottle-0.129
Token,"
Token ID553
Feature activation+1.09e+00

Pos logit contributions
 guess+0.067
 count+0.062
 imagine+0.059
 ten+0.058
ruits+0.058
Neg logit contributions
 therapist-0.066
 Midnight-0.064
actus-0.063
iflower-0.058
ls-0.058
Token she
Token ID673
Feature activation+1.50e+00

Pos logit contributions
 count+0.048
 ten+0.046
ptions+0.045
 guess+0.045
ruits+0.044
Neg logit contributions
 therapist-0.041
 Midnight-0.039
actus-0.036
 masse-0.035
 ble-0.033
Token.
Token ID13
Feature activation+2.48e+00

Pos logit contributions
 guess+0.044
 count+0.042
 ten+0.039
 three+0.039
 imagine+0.039
Neg logit contributions
 therapist-0.035
actus-0.035
 Midnight-0.034
 drowned-0.031
 died-0.031
Token "
Token ID366
Feature activation+2.34e+00

Pos logit contributions
 guess+0.118
 count+0.114
 ten+0.113
 imagine+0.112
 yes+0.109
Neg logit contributions
 Midnight-0.079
 Woo-0.062
 Cathy-0.062
 therapist-0.062
actus-0.058
TokenOne
Token ID3198
Feature activation+1.80e+00

Pos logit contributions
 count+0.135
 guess+0.132
 imagine+0.127
ruits+0.125
 yes+0.125
Neg logit contributions
 Midnight-0.050
Now-0.049
 therapist-0.049
 Woo-0.044
burn-0.044
Token,
Token ID11
Feature activation+3.37e+00

Pos logit contributions
 guess+0.081
 blond+0.080
 counted+0.078
 blonde+0.078
ptions+0.077
Neg logit contributions
actus-0.057
ww-0.052
iflower-0.051
 therapist-0.050
 mosquito-0.047
Token two
Token ID734
Feature activation+2.15e+00

Pos logit contributions
illed+0.122
 blond+0.120
adows+0.118
ill+0.118
ff+0.114
Neg logit contributions
 Midnight-0.135
 Woo-0.129
 poppy-0.118
 bunny-0.116
 you-0.115
Token,
Token ID11
Feature activation+4.04e+00

Pos logit contributions
 guess+0.089
 count+0.079
 imagine+0.077
ruits+0.077
 whether+0.074
Neg logit contributions
 Midnight-0.079
actus-0.078
 drowned-0.074
 therapist-0.073
ls-0.073
Token three
Token ID1115
Feature activation+1.87e+00

Pos logit contributions
illed+0.118
adows+0.111
ff+0.105
owed+0.103
ruits+0.102
Neg logit contributions
 Woo-0.181
 Midnight-0.181
 band-0.175
 poppy-0.174
 mosquito-0.172
Token!"
Token ID2474
Feature activation+1.31e+00

Pos logit contributions
 guess+0.078
 count+0.075
 imagine+0.070
 ten+0.068
 three+0.066
Neg logit contributions
actus-0.072
 therapist-0.070
iflower-0.069
ls-0.068
 Midnight-0.068
Token they
Token ID484
Feature activation+2.27e+00

Pos logit contributions
 count+0.051
 ten+0.051
 guess+0.049
ruits+0.046
 three+0.046
Neg logit contributions
 therapist-0.058
 Midnight-0.058
actus-0.052
 masse-0.049
 Woo-0.049
Token shouted
Token ID17293
Feature activation+1.61e+00

Pos logit contributions
 count+0.112
 ten+0.106
ruits+0.105
 guess+0.103
 whether+0.102
Neg logit contributions
 Midnight-0.071
 therapist-0.068
 Woo-0.059
 drowned-0.056
actus-0.054
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
 count+0.069
 guess+0.067
 ten+0.063
 counted+0.062
ruits+0.062
Neg logit contributions
 Midnight-0.063
 therapist-0.062
 Woo-0.054
actus-0.054
 masse-0.052
Token steps
Token ID4831
Feature activation+1.71e+00

Pos logit contributions
 guess+0.066
ruits+0.062
 count+0.062
 counted+0.060
 imagine+0.060
Neg logit contributions
 therapist-0.040
actus-0.039
 masse-0.037
 Midnight-0.034
 disappears-0.032
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
 guess+0.078
 count+0.074
 imagine+0.070
ruits+0.067
 counted+0.066
Neg logit contributions
 therapist-0.063
actus-0.058
 Midnight-0.058
 masse-0.054
iflower-0.051
Token One
Token ID1881
Feature activation+1.79e+00

Pos logit contributions
 guess+0.087
 count+0.084
 imagine+0.081
 mate+0.078
ruits+0.075
Neg logit contributions
 Midnight-0.067
 therapist-0.066
 masse-0.059
actus-0.055
ugg-0.055
Token,
Token ID11
Feature activation+2.46e+00

Pos logit contributions
racted+0.079
 guess+0.078
ruits+0.078
 blond+0.077
ptions+0.076
Neg logit contributions
 therapist-0.055
 masse-0.052
actus-0.049
iflower-0.046
 band-0.046
Token two
Token ID734
Feature activation+2.74e+00

Pos logit contributions
ruits+0.093
ff+0.089
 guess+0.089
 count+0.087
 imagine+0.086
Neg logit contributions
 therapist-0.101
 masse-0.095
 Midnight-0.095
 band-0.089
 poppy-0.082
Token,
Token ID11
Feature activation+4.04e+00

Pos logit contributions
 guess+0.117
ruits+0.109
 imagine+0.106
ff+0.104
 whether+0.103
Neg logit contributions
ls-0.088
 masse-0.084
 Midnight-0.082
 therapist-0.081
actus-0.075
Token three
Token ID1115
Feature activation+2.79e+00

Pos logit contributions
ruits+0.119
immer+0.113
illed+0.110
 exception+0.104
ff+0.100
Neg logit contributions
 band-0.170
 therapist-0.166
 masse-0.159
 chip-0.159
 po-0.158
Token -
Token ID532
Feature activation+3.59e+00

Pos logit contributions
 guess+0.121
 count+0.117
ruits+0.116
 imagine+0.114
epad+0.112
Neg logit contributions
ls-0.097
 therapist-0.088
actus-0.084
iflower-0.083
 Midnight-0.080
Token all
Token ID477
Feature activation+1.04e+00

Pos logit contributions
ruits+0.125
illed+0.122
 guess+0.116
ff+0.116
immer+0.115
Neg logit contributions
 therapist-0.143
 Midnight-0.131
 masse-0.125
 Woo-0.119
 chip-0.114
Token the
Token ID262
Feature activation+1.94e+00

Pos logit contributions
 guess+0.052
 count+0.049
 ten+0.047
ruits+0.046
 three+0.046
Neg logit contributions
 therapist-0.032
 Midnight-0.029
actus-0.028
 masse-0.026
iflower-0.026
Token way
Token ID835
Feature activation-3.51e-01

Pos logit contributions
ruits+0.096
 guess+0.094
ff+0.093
 yes+0.092
immer+0.091
Neg logit contributions
 therapist-0.053
 masse-0.049
actus-0.046
 Midnight-0.043
 band-0.041
Token.
Token ID13
Feature activation+1.91e+00

Pos logit contributions
 guess+0.011
 count+0.011
 three+0.010
 ten+0.010
 imagine+0.010
Neg logit contributions
 therapist-0.013
actus-0.013
 Midnight-0.012
 masse-0.011
iflower-0.011
Token They
Token ID1119
Feature activation+1.68e+00

Pos logit contributions
 count+0.045
 guess+0.043
 counted+0.042
 ten+0.038
 turrets+0.037
Neg logit contributions
 Midnight-0.109
 Lucky-0.103
actus-0.101
 Woo-0.101
 Velvet-0.098
Token had
Token ID550
Feature activation+1.51e+00

Pos logit contributions
 guess+0.079
 count+0.073
 Today+0.072
ff+0.071
 exception+0.070
Neg logit contributions
 Midnight-0.058
 therapist-0.050
 drowned-0.048
 Woo-0.047
 died-0.047
Token many
Token ID867
Feature activation+2.04e+00

Pos logit contributions
 guess+0.064
 count+0.061
ruits+0.059
ff+0.058
 imagine+0.058
Neg logit contributions
 Midnight-0.060
 Woo-0.055
 drowned-0.050
 therapist-0.050
 band-0.050
Token toys
Token ID14958
Feature activation+3.41e-01

Pos logit contributions
 guess+0.108
ill+0.101
 count+0.101
ff+0.097
 ten+0.097
Neg logit contributions
actus-0.055
 Midnight-0.054
 band-0.054
 swollen-0.049
ls-0.049
Token,
Token ID11
Feature activation+4.04e+00

Pos logit contributions
 guess+0.013
 count+0.012
 three+0.011
 ten+0.011
 imagine+0.011
Neg logit contributions
 therapist-0.016
actus-0.016
 Midnight-0.016
ugg-0.014
 masse-0.014
Token but
Token ID475
Feature activation+1.59e+00

Pos logit contributions
ff+0.181
illed+0.150
 1+0.147
rr+0.143
used+0.142
Neg logit contributions
 Midnight-0.155
 band-0.144
 Velvet-0.143
 Woo-0.139
 poison-0.130
Token their
Token ID511
Feature activation+5.67e-01

Pos logit contributions
 count+0.082
 guess+0.082
ff+0.080
adows+0.078
 ten+0.078
Neg logit contributions
 Midnight-0.046
 therapist-0.042
 Woo-0.039
 Velvet-0.036
 Lucky-0.036
Token favorite
Token ID4004
Feature activation+9.34e-01

Pos logit contributions
 guess+0.019
 count+0.018
 counted+0.016
 ten+0.016
 three+0.016
Neg logit contributions
actus-0.029
 therapist-0.028
 Midnight-0.028
 drowned-0.026
 masse-0.025
Token was
Token ID373
Feature activation+7.35e-01

Pos logit contributions
 guess+0.046
 count+0.045
 ten+0.043
 three+0.042
 1+0.042
Neg logit contributions
 therapist-0.031
actus-0.031
 Midnight-0.030
 drowned-0.026
 died-0.026
Token a
Token ID257
Feature activation-3.05e-01

Pos logit contributions
 guess+0.030
 count+0.028
 imagine+0.026
 three+0.025
 counted+0.025
Neg logit contributions
 Midnight-0.033
 therapist-0.031
actus-0.030
 Woo-0.029
 masse-0.028
Token to
Token ID284
Feature activation+3.16e+00

Pos logit contributions
 guess+0.055
 count+0.052
ruits+0.052
 yes+0.052
 imagine+0.050
Neg logit contributions
 therapist-0.036
actus-0.032
 masse-0.031
 Midnight-0.031
 drowned-0.029
Token count
Token ID954
Feature activation+2.52e+00

Pos logit contributions
ruits+0.106
 yes+0.097
 mate+0.095
immer+0.094
adies+0.093
Neg logit contributions
 tree-0.125
 ble-0.120
 lands-0.119
 band-0.117
 dri-0.117
Token and
Token ID290
Feature activation+3.00e+00

Pos logit contributions
 guess+0.137
 count+0.128
ruits+0.127
 imagine+0.126
 counted+0.124
Neg logit contributions
ls-0.054
 therapist-0.049
 lands-0.045
actus-0.044
iflower-0.043
Token she
Token ID673
Feature activation+2.43e+00

Pos logit contributions
ruits+0.109
illed+0.099
immer+0.098
ff+0.096
 blond+0.096
Neg logit contributions
 therapist-0.117
 Midnight-0.104
 lands-0.103
 she-0.099
 disappears-0.099
Token counted
Token ID14789
Feature activation+3.40e+00

Pos logit contributions
ruits+0.068
 Today+0.064
 yes+0.061
 blond+0.059
 guess+0.058
Neg logit contributions
 therapist-0.118
 drowned-0.113
 died-0.112
actus-0.110
 Midnight-0.104
Token to
Token ID284
Feature activation+4.03e+00

Pos logit contributions
 guess+0.179
ruits+0.173
 want+0.168
 imagine+0.167
 Many+0.166
Neg logit contributions
ls-0.066
 lands-0.064
 therapist-0.056
iflower-0.055
actus-0.054
Token five
Token ID1936
Feature activation+2.56e+00

Pos logit contributions
ruits+0.110
immer+0.102
 yes+0.097
 mate+0.095
 exception+0.093
Neg logit contributions
 tree-0.175
 lands-0.170
 band-0.153
 safety-0.152
 bush-0.149
Token.
Token ID13
Feature activation+8.84e-01

Pos logit contributions
 guess+0.120
ruits+0.114
 imagine+0.113
 mate+0.113
 yes+0.112
Neg logit contributions
ls-0.080
actus-0.071
 lands-0.070
 therapist-0.068
iflower-0.067
Token Then
Token ID3244
Feature activation-2.15e-01

Pos logit contributions
 guess+0.037
 count+0.035
 imagine+0.032
 ten+0.032
 counted+0.031
Neg logit contributions
 Midnight-0.039
 therapist-0.038
actus-0.037
 masse-0.034
 Woo-0.033
Token her
Token ID607
Feature activation+8.32e-01

Pos logit contributions
 therapist+0.012
actus+0.012
 Midnight+0.011
 masse+0.011
lor+0.011
Neg logit contributions
 guess-0.007
 count-0.007
 three-0.006
 ten-0.006
 counted-0.006
Token head
Token ID1182
Feature activation+1.77e+00

Pos logit contributions
 guess+0.037
 count+0.036
 whether+0.034
 imagine+0.034
 ten+0.034
Neg logit contributions
 therapist-0.031
actus-0.030
 Midnight-0.029
 masse-0.028
iflower-0.026
Token blocks
Token ID7021
Feature activation+1.87e+00

Pos logit contributions
 guess+0.098
ruits+0.093
 yes+0.091
 imagine+0.091
 blond+0.089
Neg logit contributions
iflower-0.072
actus-0.072
 band-0.069
 therapist-0.068
 masse-0.068
Token.
Token ID13
Feature activation+2.18e+00

Pos logit contributions
 guess+0.091
 blond+0.085
 count+0.084
 imagine+0.083
 blonde+0.082
Neg logit contributions
 therapist-0.055
actus-0.051
iflower-0.049
 Midnight-0.047
 masse-0.046
Token One
Token ID1881
Feature activation+1.95e+00

Pos logit contributions
 guess+0.101
 imagine+0.096
 count+0.096
 yes+0.091
ruits+0.087
Neg logit contributions
 Midnight-0.074
 therapist-0.066
 Woo-0.063
 Mitt-0.059
 Lucky-0.058
Token,
Token ID11
Feature activation+3.11e+00

Pos logit contributions
 blond+0.092
 blonde+0.090
adies+0.088
ptions+0.087
racted+0.086
Neg logit contributions
iflower-0.050
 therapist-0.049
 band-0.049
 chip-0.049
 jar-0.047
Token two
Token ID734
Feature activation+3.26e+00

Pos logit contributions
ff+0.139
ruits+0.134
adows+0.132
iner+0.129
illed+0.128
Neg logit contributions
 band-0.098
 Midnight-0.095
 therapist-0.093
 bottle-0.086
 Woo-0.085
Token,
Token ID11
Feature activation+4.02e+00

Pos logit contributions
 guess+0.144
ruits+0.133
 imagine+0.133
ff+0.133
adies+0.132
Neg logit contributions
ls-0.091
 drowned-0.088
 died-0.085
 chip-0.085
 Midnight-0.083
Token three
Token ID1115
Feature activation+3.00e+00

Pos logit contributions
immer+0.124
ruits+0.120
illed+0.118
ff+0.113
 exception+0.110
Neg logit contributions
 band-0.172
 chip-0.155
 Midnight-0.152
 therapist-0.152
 jar-0.149
Token,
Token ID11
Feature activation+3.80e+00

Pos logit contributions
 guess+0.133
 count+0.129
 imagine+0.128
ruits+0.125
epad+0.124
Neg logit contributions
ls-0.096
iflower-0.084
 therapist-0.082
actus-0.081
 drowned-0.080
Token four
Token ID1440
Feature activation+3.15e+00

Pos logit contributions
immer+0.122
ruits+0.122
illed+0.115
ff+0.112
ust+0.108
Neg logit contributions
 band-0.147
 therapist-0.138
 chip-0.136
 Midnight-0.135
 she-0.132
Token...
Token ID986
Feature activation+3.54e+00

Pos logit contributions
 guess+0.137
ruits+0.133
 imagine+0.131
adies+0.129
 count+0.126
Neg logit contributions
ls-0.094
 therapist-0.084
 drowned-0.083
 died-0.081
 chip-0.081
Token The
Token ID383
Feature activation+2.22e+00

Pos logit contributions
 imagine+0.133
 mate+0.127
 guess+0.125
 count+0.124
 blond+0.121
Neg logit contributions
 therapist-0.131
Now-0.125
we-0.124
 Midnight-0.118
-0.115
Token.
Token ID13
Feature activation+2.02e+00

Pos logit contributions
 guess+0.006
 count+0.006
 three+0.005
 ten+0.005
 imagine+0.005
Neg logit contributions
 therapist-0.006
actus-0.006
 Midnight-0.005
 masse-0.005
iflower-0.005
Token They
Token ID1119
Feature activation+1.88e+00

Pos logit contributions
 count+0.057
 guess+0.054
 counted+0.051
 yes+0.049
 ten+0.046
Neg logit contributions
 Midnight-0.103
 Lucky-0.097
 Woo-0.096
actus-0.094
 therapist-0.093
Token had
Token ID550
Feature activation+1.66e+00

Pos logit contributions
 guess+0.082
 count+0.080
adies+0.078
teen+0.076
 ten+0.076
Neg logit contributions
 Midnight-0.068
 therapist-0.059
 drowned-0.057
 Woo-0.057
 died-0.056
Token many
Token ID867
Feature activation+1.84e+00

Pos logit contributions
 guess+0.070
 count+0.070
ruits+0.065
 imagine+0.065
ff+0.064
Neg logit contributions
 Midnight-0.066
 Woo-0.063
 therapist-0.055
 drowned-0.054
 Velvet-0.054
Token toys
Token ID14958
Feature activation+3.09e-01

Pos logit contributions
 guess+0.097
 count+0.091
ill+0.089
 ten+0.089
ff+0.086
Neg logit contributions
actus-0.055
 Midnight-0.053
 band-0.050
ls-0.048
 swollen-0.047
Token,
Token ID11
Feature activation+4.01e+00

Pos logit contributions
 guess+0.011
 count+0.011
 three+0.010
 ten+0.010
 imagine+0.010
Neg logit contributions
 therapist-0.015
actus-0.015
 Midnight-0.014
 masse-0.013
ugg-0.013
Token but
Token ID475
Feature activation+1.61e+00

Pos logit contributions
ff+0.173
 1+0.148
rr+0.137
used+0.136
teen+0.135
Neg logit contributions
 Midnight-0.159
 Woo-0.144
 band-0.144
 Velvet-0.142
 poison-0.139
Token their
Token ID511
Feature activation+7.64e-01

Pos logit contributions
 count+0.083
 guess+0.081
 ten+0.078
ff+0.076
 counted+0.075
Neg logit contributions
 Midnight-0.048
 therapist-0.046
 Woo-0.041
 Lucky-0.038
actus-0.037
Token favorite
Token ID4004
Feature activation+9.67e-01

Pos logit contributions
 guess+0.025
 count+0.024
 counted+0.023
 ten+0.022
 three+0.021
Neg logit contributions
actus-0.039
 Midnight-0.038
 therapist-0.038
 drowned-0.034
 masse-0.034
Token was
Token ID373
Feature activation+8.52e-01

Pos logit contributions
 guess+0.049
 count+0.048
 ten+0.046
 three+0.045
 1+0.045
Neg logit contributions
 therapist-0.032
actus-0.031
 Midnight-0.030
 drowned-0.026
 died-0.026
Token a
Token ID257
Feature activation-4.04e-01

Pos logit contributions
 guess+0.033
 count+0.031
 imagine+0.029
 three+0.027
 counted+0.027
Neg logit contributions
 Midnight-0.040
 therapist-0.038
actus-0.036
 Woo-0.035
 masse-0.034
Token so
Token ID523
Feature activation+1.48e+00

Pos logit contributions
 guess+0.144
illed+0.134
 count+0.128
 imagine+0.127
 counted+0.127
Neg logit contributions
actus-0.097
iflower-0.095
 therapist-0.093
 Midnight-0.093
ls-0.090
Token high
Token ID1029
Feature activation-1.93e-01

Pos logit contributions
 guess+0.062
 count+0.061
 counted+0.058
 imagine+0.057
 blond+0.054
Neg logit contributions
 therapist-0.058
 Midnight-0.056
actus-0.054
 masse-0.053
iflower-0.052
Token.
Token ID13
Feature activation+2.80e+00

Pos logit contributions
 therapist+0.009
actus+0.009
 Midnight+0.008
 masse+0.008
lor+0.008
Neg logit contributions
 guess-0.008
 count-0.008
 three-0.007
 ten-0.007
 counted-0.007
Token How
Token ID1374
Feature activation+3.12e+00

Pos logit contributions
 guess+0.126
illed+0.117
 count+0.115
 imagine+0.114
ptions+0.114
Neg logit contributions
 Midnight-0.097
 therapist-0.092
 masse-0.092
 Woo-0.088
 Velvet-0.082
Token did
Token ID750
Feature activation+1.68e-01

Pos logit contributions
 count+0.120
ptions+0.117
 blond+0.116
 counted+0.115
 guess+0.111
Neg logit contributions
actus-0.099
iflower-0.090
 therapist-0.086
 well-0.085
ww-0.085
Token you
Token ID345
Feature activation+4.00e+00

Pos logit contributions
 guess+0.010
 count+0.010
 three+0.009
 ten+0.009
 counted+0.009
Neg logit contributions
 therapist-0.004
 Midnight-0.004
actus-0.004
 masse-0.004
iflower-0.003
Token learn
Token ID2193
Feature activation+1.67e+00

Pos logit contributions
 blond+0.150
 blonde+0.145
ruits+0.140
illed+0.132
 English+0.123
Neg logit contributions
 hurt-0.132
 band-0.129
 tree-0.128
 you-0.122
 healed-0.122
Token to
Token ID284
Feature activation+2.52e+00

Pos logit contributions
 guess+0.073
 count+0.070
 counted+0.068
 imagine+0.067
ruits+0.067
Neg logit contributions
actus-0.060
 therapist-0.060
 Midnight-0.060
lor-0.056
 masse-0.056
Token count
Token ID954
Feature activation+2.80e+00

Pos logit contributions
ruits+0.068
 blonde+0.066
illed+0.065
 blond+0.065
 yes+0.063
Neg logit contributions
iflower-0.108
 band-0.108
actus-0.107
 therapist-0.106
 Midnight-0.105
Token so
Token ID523
Feature activation+1.46e+00

Pos logit contributions
 guess+0.129
 counted+0.120
 count+0.119
 blond+0.116
illed+0.115
Neg logit contributions
actus-0.085
 therapist-0.085
iflower-0.083
 Midnight-0.081
 masse-0.078
Token well
Token ID880
Feature activation+3.87e-01

Pos logit contributions
 count+0.061
 guess+0.060
 counted+0.058
 imagine+0.056
 blond+0.055
Neg logit contributions
 therapist-0.057
actus-0.054
 Midnight-0.052
 masse-0.052
iflower-0.051
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
 therapist+0.017
actus+0.017
 Midnight+0.016
 masse+0.015
lor+0.015
Neg logit contributions
 guess-0.018
 count-0.018
 three-0.017
 ten-0.017
 imagine-0.016
Token mud
Token ID17492
Feature activation-1.47e+00

Pos logit contributions
 therapist+0.056
 Midnight+0.052
actus+0.050
lor+0.050
 masse+0.047
Neg logit contributions
 guess-0.043
 count-0.043
 three-0.041
 ten-0.039
 One-0.038
Token all
Token ID477
Feature activation-6.05e-01

Pos logit contributions
 therapist+0.083
 Midnight+0.081
actus+0.081
 masse+0.080
lor+0.074
Neg logit contributions
 count-0.046
 guess-0.045
 ten-0.042
 three-0.040
 counted-0.040
Token the
Token ID262
Feature activation-4.74e-01

Pos logit contributions
 therapist+0.025
actus+0.025
 Midnight+0.024
 masse+0.022
lor+0.022
Neg logit contributions
 count-0.027
 guess-0.027
 three-0.026
 ten-0.026
 like-0.025
Token time
Token ID640
Feature activation-2.17e+00

Pos logit contributions
 therapist+0.023
actus+0.022
 Midnight+0.022
lor+0.020
 masse+0.020
Neg logit contributions
 guess-0.019
 count-0.018
 three-0.017
 ten-0.017
 imagine-0.016
Token.
Token ID13
Feature activation-4.13e+00

Pos logit contributions
 therapist+0.128
actus+0.125
 masse+0.119
lor+0.116
 Midnight+0.111
Neg logit contributions
 like-0.059
 guess-0.057
 three-0.057
 ten-0.055
 count-0.054
Token<|endoftext|>
Token ID50256
Feature activation-1.63e+00

Pos logit contributions
 therapist+0.285
 swollen+0.279
guard+0.271
actus+0.270
eds+0.270
Neg logit contributions
 One-0.107
 Many-0.058
 three-0.057
 like-0.056
 Today-0.055
TokenOnce
Token ID7454
Feature activation-6.34e-01

Pos logit contributions
 therapist+0.086
actus+0.083
 masse+0.082
 Midnight+0.080
 swollen+0.078
Neg logit contributions
 guess-0.054
 three-0.051
 One-0.051
 count-0.050
 ten-0.048
Token upon
Token ID2402
Feature activation+9.36e-01

Pos logit contributions
 therapist+0.043
actus+0.042
 Midnight+0.042
 masse+0.040
lor+0.039
Neg logit contributions
 guess-0.011
 count-0.011
 three-0.010
 ten-0.009
 imagine-0.008
Token a
Token ID257
Feature activation+1.19e+00

Pos logit contributions
 count+0.031
 guess+0.030
 1+0.028
 counted+0.027
 ten+0.027
Neg logit contributions
 Midnight-0.047
 therapist-0.046
actus-0.046
iflower-0.042
 Woo-0.042
Token time
Token ID640
Feature activation+1.50e+00

Pos logit contributions
 count+0.046
 guess+0.045
 counted+0.042
 three+0.042
 ten+0.042
Neg logit contributions
 Midnight-0.052
actus-0.052
 therapist-0.050
iflower-0.046
 masse-0.046
Token little
Token ID1310
Feature activation-1.14e+00

Pos logit contributions
actus+0.040
 therapist+0.040
lor+0.040
iflower+0.038
 Midnight+0.036
Neg logit contributions
 guess-0.073
 three-0.072
 count-0.069
 One-0.065
 like-0.064
Token girl
Token ID2576
Feature activation-1.96e-01

Pos logit contributions
actus+0.043
 therapist+0.041
iflower+0.040
lor+0.040
 Midnight+0.039
Neg logit contributions
 guess-0.055
 three-0.054
 count-0.052
 imagine-0.049
 like-0.048
Token and
Token ID290
Feature activation-2.52e+00

Pos logit contributions
 therapist+0.010
actus+0.010
 Midnight+0.009
 masse+0.009
lor+0.008
Neg logit contributions
 guess-0.007
 count-0.007
 three-0.006
 ten-0.006
 imagine-0.006
Token her
Token ID607
Feature activation-8.48e-01

Pos logit contributions
actus+0.141
lor+0.126
iflower+0.124
 therapist+0.120
eds+0.120
Neg logit contributions
 guess-0.093
 count-0.089
 three-0.087
 counted-0.083
 imagine-0.083
Token family
Token ID1641
Feature activation-2.06e+00

Pos logit contributions
actus+0.038
 therapist+0.038
 Midnight+0.036
lor+0.036
 masse+0.034
Neg logit contributions
 guess-0.035
 count-0.034
 three-0.033
 ten-0.031
 imagine-0.030
Token.
Token ID13
Feature activation-4.04e+00

Pos logit contributions
actus+0.113
 Midnight+0.109
 masse+0.109
 therapist+0.108
 swollen+0.103
Neg logit contributions
 count-0.064
 guess-0.064
 three-0.062
 like-0.060
 counted-0.058
Token<|endoftext|>
Token ID50256
Feature activation-1.55e+00

Pos logit contributions
 swollen+0.268
 therapist+0.266
eds+0.260
iflower+0.260
 drowned+0.255
Neg logit contributions
 One-0.110
 Many-0.064
 three-0.063
 Today-0.060
 Some-0.060
TokenOnce
Token ID7454
Feature activation-1.78e-01

Pos logit contributions
 therapist+0.077
actus+0.076
 masse+0.073
 Midnight+0.073
 swollen+0.071
Neg logit contributions
 guess-0.055
 three-0.052
 One-0.051
 count-0.051
 ten-0.049
Token upon
Token ID2402
Feature activation+1.48e+00

Pos logit contributions
 therapist+0.012
actus+0.012
 Midnight+0.011
 masse+0.011
lor+0.011
Neg logit contributions
 guess-0.004
 count-0.003
 three-0.003
 ten-0.003
 imagine-0.003
Token a
Token ID257
Feature activation+1.40e+00

Pos logit contributions
 count+0.055
 guess+0.052
 1+0.051
 counted+0.049
ruits+0.048
Neg logit contributions
 Midnight-0.069
 therapist-0.066
actus-0.064
 Woo-0.060
iflower-0.060
Token time
Token ID640
Feature activation+1.89e+00

Pos logit contributions
 count+0.056
 guess+0.054
 counted+0.052
 ten+0.051
 three+0.051
Neg logit contributions
actus-0.059
 Midnight-0.059
 therapist-0.056
iflower-0.052
 masse-0.051
Token and
Token ID290
Feature activation-3.36e+00

Pos logit contributions
actus+0.123
 therapist+0.123
 masse+0.119
 severe+0.115
 Midnight+0.115
Neg logit contributions
 like-0.061
 guess-0.058
 count-0.052
 three-0.051
 whether-0.048
Token grateful
Token ID14066
Feature activation-2.08e+00

Pos logit contributions
actus+0.197
lor+0.192
orns+0.179
rob+0.172
 therapist+0.171
Neg logit contributions
 counted-0.108
 guess-0.106
 like-0.096
 excited-0.096
 count-0.095
Token to
Token ID284
Feature activation-2.54e+00

Pos logit contributions
actus+0.119
 therapist+0.114
 masse+0.110
lor+0.110
 Midnight+0.110
Neg logit contributions
 like-0.066
 guess-0.061
 three-0.058
 count-0.055
 ten-0.053
Token her
Token ID607
Feature activation-7.64e-01

Pos logit contributions
actus+0.162
 therapist+0.155
lor+0.148
iflower+0.139
uddled+0.136
Neg logit contributions
 count-0.075
 One-0.074
 guess-0.073
 three-0.073
 like-0.066
Token friend
Token ID1545
Feature activation-8.93e-01

Pos logit contributions
actus+0.038
 therapist+0.037
 Midnight+0.035
lor+0.034
ugg+0.033
Neg logit contributions
 guess-0.031
 count-0.030
 three-0.028
 imagine-0.026
 ten-0.026
Token.
Token ID13
Feature activation-4.02e+00

Pos logit contributions
actus+0.043
 therapist+0.041
 Midnight+0.039
 masse+0.038
iflower+0.038
Neg logit contributions
 guess-0.035
 count-0.035
 three-0.032
 ten-0.032
 counted-0.030
Token From
Token ID3574
Feature activation-1.72e+00

Pos logit contributions
 severe+0.249
eds+0.249
 thorn+0.244
 drowned+0.243
 swollen+0.243
Neg logit contributions
 One-0.117
 three-0.073
 Today-0.063
 Many-0.063
 As-0.059
Token that
Token ID326
Feature activation-1.19e+00

Pos logit contributions
 therapist+0.081
lor+0.079
actus+0.075
 Midnight+0.073
 sy+0.073
Neg logit contributions
 guess-0.066
 like-0.064
 three-0.064
 count-0.060
 ten-0.058
Token day
Token ID1110
Feature activation-1.99e+00

Pos logit contributions
actus+0.059
 therapist+0.056
lor+0.053
 Midnight+0.053
 masse+0.051
Neg logit contributions
 guess-0.047
 count-0.047
 three-0.045
 One-0.042
 ten-0.041
Token on
Token ID319
Feature activation-2.27e+00

Pos logit contributions
actus+0.107
 therapist+0.101
 Midnight+0.092
lor+0.089
keeping+0.089
Neg logit contributions
 three-0.067
 count-0.067
 guess-0.066
 like-0.063
 ten-0.062
Token,
Token ID11
Feature activation-3.07e+00

Pos logit contributions
actus+0.187
 therapist+0.179
lor+0.178
keeping+0.174
eds+0.170
Neg logit contributions
 One-0.012
 count-0.007
 three-0.004
 like-0.002
 guess-0.002
Token crawled
Token ID45668
Feature activation-7.34e-02

Pos logit contributions
 count+0.012
 guess+0.012
 ten+0.011
 three+0.011
 imagine+0.011
Neg logit contributions
 therapist-0.015
actus-0.015
 Midnight-0.015
 drowned-0.014
 masse-0.013
Token out
Token ID503
Feature activation+6.07e-01

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.003
lor+0.003
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.003
 ten-0.003
 imagine-0.003
Token from
Token ID422
Feature activation+2.78e-01

Pos logit contributions
 guess+0.028
 count+0.028
 imagine+0.026
 counted+0.026
ruits+0.025
Neg logit contributions
 therapist-0.022
 Midnight-0.021
actus-0.021
 masse-0.019
 Woo-0.019
Token under
Token ID739
Feature activation+2.54e-01

Pos logit contributions
 guess+0.011
 count+0.011
 three+0.010
 ten+0.010
 counted+0.010
Neg logit contributions
actus-0.012
 therapist-0.012
 Midnight-0.012
 masse-0.011
 Woo-0.011
Token the
Token ID262
Feature activation-2.25e+00

Pos logit contributions
 guess+0.011
 count+0.011
 imagine+0.010
 ten+0.010
 counted+0.010
Neg logit contributions
 Midnight-0.010
actus-0.010
 therapist-0.010
 Woo-0.009
 masse-0.009
Token leather
Token ID11620
Feature activation-4.01e+00

Pos logit contributions
 therapist+0.106
lor+0.105
ugg+0.099
iflower+0.098
ic+0.096
Neg logit contributions
 three-0.085
 count-0.081
 guess-0.079
 like-0.079
 ten-0.076
Token and
Token ID290
Feature activation-3.97e-01

Pos logit contributions
iflower+0.177
 swollen+0.170
live+0.168
ww+0.168
 therapist+0.167
Neg logit contributions
 three-0.128
 like-0.119
 guess-0.119
 One-0.116
 but-0.116
Token surprised
Token ID6655
Feature activation+5.83e-02

Pos logit contributions
 therapist+0.019
actus+0.018
 Midnight+0.018
 masse+0.017
lor+0.017
Neg logit contributions
 guess-0.016
 count-0.015
 three-0.014
 ten-0.014
 counted-0.014
Token Sammy
Token ID44067
Feature activation-3.50e-01

Pos logit contributions
 guess+0.002
 count+0.002
 three+0.002
 ten+0.002
 counted+0.002
Neg logit contributions
 therapist-0.003
actus-0.003
 Midnight-0.003
 masse-0.003
 Woo-0.002
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
 therapist+0.016
actus+0.015
 Midnight+0.015
 masse+0.014
iflower+0.013
Neg logit contributions
 guess-0.014
 count-0.014
 three-0.013
 ten-0.013
 imagine-0.013
Token Sammy
Token ID44067
Feature activation-6.89e-01

Pos logit contributions
 guess+0.008
 count+0.008
 ten+0.007
 three+0.007
 imagine+0.007
Neg logit contributions
 Midnight-0.010
 therapist-0.010
actus-0.010
 masse-0.009
 Woo-0.009
Token first
Token ID717
Feature activation+9.98e-02

Pos logit contributions
 guess+0.007
 count+0.007
 three+0.006
 ten+0.006
 imagine+0.006
Neg logit contributions
 therapist-0.005
actus-0.005
 Midnight-0.005
 masse-0.005
iflower-0.005
Token."
Token ID526
Feature activation-8.17e-01

Pos logit contributions
 guess+0.004
 count+0.004
 three+0.004
 ten+0.004
 counted+0.004
Neg logit contributions
 therapist-0.004
actus-0.004
 Midnight-0.004
 masse-0.004
iflower-0.004
Token Tim
Token ID5045
Feature activation+5.24e-02

Pos logit contributions
 therapist+0.049
actus+0.048
 Midnight+0.045
 masse+0.044
lor+0.044
Neg logit contributions
 guess-0.023
 count-0.021
 three-0.020
 One-0.020
 ten-0.019
Tokenmy
Token ID1820
Feature activation-8.20e-01

Pos logit contributions
 guess+0.001
 count+0.001
 three+0.001
 ten+0.001
 imagine+0.001
Neg logit contributions
 therapist-0.003
actus-0.003
 Midnight-0.003
 masse-0.003
iflower-0.003
Token promised
Token ID8072
Feature activation-1.81e+00

Pos logit contributions
actus+0.040
 therapist+0.039
 Midnight+0.039
 masse+0.036
lor+0.036
Neg logit contributions
 guess-0.031
 count-0.029
 three-0.028
 counted-0.027
 ten-0.026
Token and
Token ID290
Feature activation-3.98e+00

Pos logit contributions
actus+0.108
 therapist+0.100
iflower+0.099
 Midnight+0.095
lor+0.092
Neg logit contributions
 guess-0.057
 three-0.054
 count-0.053
 like-0.049
 One-0.049
Token quickly
Token ID2952
Feature activation-2.06e+00

Pos logit contributions
actus+0.245
orns+0.212
eds+0.211
lor+0.205
iflower+0.202
Neg logit contributions
 three-0.116
 counted-0.115
 guess-0.114
 count-0.099
 yes-0.099
Token ran
Token ID4966
Feature activation-8.31e-01

Pos logit contributions
actus+0.113
 therapist+0.107
iflower+0.101
 Midnight+0.100
ich+0.097
Neg logit contributions
 guess-0.072
 counted-0.070
 count-0.070
 three-0.063
 like-0.060
Token out
Token ID503
Feature activation+1.40e-01

Pos logit contributions
actus+0.030
 therapist+0.030
iflower+0.028
 Midnight+0.028
lor+0.026
Neg logit contributions
 guess-0.042
 count-0.042
 three-0.040
 ten-0.040
 like-0.038
Token of
Token ID286
Feature activation-1.31e+00

Pos logit contributions
 guess+0.006
 count+0.006
 ten+0.005
 three+0.005
 counted+0.005
Neg logit contributions
 therapist-0.006
actus-0.006
 Midnight-0.006
 masse-0.005
 drowned-0.005
Token the
Token ID262
Feature activation-5.54e-01

Pos logit contributions
actus+0.073
 therapist+0.073
lor+0.070
ugg+0.064
iflower+0.064
Neg logit contributions
 ten-0.044
 count-0.044
 guess-0.043
 three-0.043
 One-0.036
Token for
Token ID329
Feature activation-9.38e-01

Pos logit contributions
 guess+0.009
 count+0.008
 ten+0.008
 three+0.008
 counted+0.008
Neg logit contributions
actus-0.008
 therapist-0.008
 Midnight-0.008
 masse-0.007
 Woo-0.007
Token her
Token ID607
Feature activation-2.51e-01

Pos logit contributions
actus+0.039
 therapist+0.038
lor+0.035
iflower+0.034
ugg+0.033
Neg logit contributions
 guess-0.043
 count-0.043
 three-0.041
 ten-0.041
 One-0.039
Token everywhere
Token ID8347
Feature activation-2.61e-01

Pos logit contributions
 therapist+0.011
actus+0.011
 Midnight+0.011
 masse+0.010
lor+0.010
Neg logit contributions
 guess-0.010
 count-0.010
 three-0.009
 ten-0.009
 imagine-0.009
Token,
Token ID11
Feature activation-1.14e-01

Pos logit contributions
 therapist+0.015
actus+0.015
 Midnight+0.014
 masse+0.013
lor+0.013
Neg logit contributions
 guess-0.008
 count-0.008
 three-0.007
 ten-0.007
 counted-0.006
Token but
Token ID475
Feature activation-1.41e+00

Pos logit contributions
 therapist+0.008
actus+0.008
 Midnight+0.007
 masse+0.007
iflower+0.007
Neg logit contributions
 guess-0.002
 count-0.002
 three-0.002
 ten-0.002
 counted-0.001
Token couldn
Token ID3521
Feature activation-3.96e+00

Pos logit contributions
actus+0.067
 therapist+0.062
lor+0.062
iflower+0.059
 masse+0.058
Neg logit contributions
 guess-0.057
 count-0.054
 three-0.053
 like-0.052
 ten-0.051
Token't
Token ID470
Feature activation-3.70e-01

Pos logit contributions
actus+0.248
 therapist+0.228
rows+0.223
ugg+0.219
iflower+0.218
Neg logit contributions
 One-0.121
 count-0.084
 guess-0.083
 Two-0.081
 three-0.081
Token find
Token ID1064
Feature activation-1.52e+00

Pos logit contributions
 therapist+0.021
actus+0.021
 Midnight+0.020
 masse+0.019
lor+0.019
Neg logit contributions
 guess-0.011
 count-0.011
 three-0.010
 ten-0.010
 imagine-0.009
Token her
Token ID607
Feature activation-6.51e-01

Pos logit contributions
actus+0.076
 therapist+0.076
iflower+0.068
lor+0.067
ugg+0.066
Neg logit contributions
 ten-0.058
 count-0.058
 guess-0.056
 three-0.056
 One-0.053
Token.
Token ID13
Feature activation-2.11e+00

Pos logit contributions
 therapist+0.031
actus+0.030
 Midnight+0.029
 masse+0.028
lor+0.027
Neg logit contributions
 guess-0.025
 count-0.025
 three-0.023
 ten-0.023
 imagine-0.022
Token A
Token ID317
Feature activation-1.07e+00

Pos logit contributions
actus+0.115
 therapist+0.114
iflower+0.113
lor+0.105
eds+0.104
Neg logit contributions
 One-0.070
 guess-0.062
 three-0.060
 Many-0.055
 count-0.052
Token apple
Token ID17180
Feature activation-2.30e+00

Pos logit contributions
 therapist+0.033
actus+0.032
 Midnight+0.031
lor+0.031
ugg+0.028
Neg logit contributions
 count-0.041
 guess-0.040
 three-0.039
 ten-0.038
 imagine-0.037
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
actus+0.130
 therapist+0.130
lor+0.126
 masse+0.122
ugg+0.122
Neg logit contributions
 count-0.064
 guess-0.064
 like-0.062
 ten-0.058
 three-0.056
Token her
Token ID607
Feature activation-8.77e-01

Pos logit contributions
actus+0.130
lor+0.128
 therapist+0.123
 masse+0.117
ugg+0.112
Neg logit contributions
 guess-0.083
 counted-0.083
 like-0.081
 three-0.081
 count-0.080
Token tum
Token ID11814
Feature activation-5.01e-01

Pos logit contributions
 therapist+0.040
actus+0.039
 Midnight+0.037
lor+0.037
 masse+0.035
Neg logit contributions
 guess-0.036
 count-0.035
 three-0.034
 ten-0.032
 imagine-0.031
Tokenmy
Token ID1820
Feature activation-1.89e+00

Pos logit contributions
 therapist+0.032
actus+0.031
 Midnight+0.031
 masse+0.030
lor+0.029
Neg logit contributions
 guess-0.011
 count-0.011
 three-0.009
 ten-0.009
 counted-0.009
Token couldn
Token ID3521
Feature activation-3.94e+00

Pos logit contributions
 therapist+0.074
actus+0.073
 Midnight+0.069
lor+0.069
 Woo+0.067
Neg logit contributions
 count-0.083
 guess-0.083
 three-0.080
 like-0.079
 counted-0.078
Token't
Token ID470
Feature activation-1.11e+00

Pos logit contributions
actus+0.236
 therapist+0.230
ugg+0.210
iflower+0.208
ich+0.206
Neg logit contributions
 One-0.119
 count-0.099
 guess-0.090
 three-0.086
 Many-0.082
Token handle
Token ID5412
Feature activation-8.21e-01

Pos logit contributions
 therapist+0.064
actus+0.062
lor+0.059
 Midnight+0.059
 masse+0.057
Neg logit contributions
 guess-0.036
 count-0.035
 imagine-0.030
 like-0.030
 three-0.030
Token it
Token ID340
Feature activation-5.53e-01

Pos logit contributions
 therapist+0.039
actus+0.038
 Midnight+0.036
lor+0.035
 masse+0.035
Neg logit contributions
 count-0.031
 guess-0.031
 ten-0.030
 three-0.029
 like-0.028
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 therapist+0.027
actus+0.026
 Midnight+0.025
 masse+0.024
lor+0.024
Neg logit contributions
 guess-0.021
 count-0.021
 ten-0.019
 three-0.019
 counted-0.018
Token She
Token ID1375
Feature activation-1.91e+00

Pos logit contributions
 therapist+0.054
actus+0.053
iflower+0.049
 Midnight+0.047
lor+0.047
Neg logit contributions
 One-0.034
 guess-0.034
 three-0.032
 count-0.032
 ten-0.030
Token careful
Token ID8161
Feature activation+3.77e-01

Pos logit contributions
 guess+0.006
 count+0.006
 ten+0.005
 three+0.005
 imagine+0.005
Neg logit contributions
 therapist-0.008
 Midnight-0.007
actus-0.007
 masse-0.007
iflower-0.007
Token and
Token ID290
Feature activation-9.90e-01

Pos logit contributions
 guess+0.014
 count+0.014
 ten+0.013
 three+0.013
 counted+0.013
Neg logit contributions
 therapist-0.018
 Midnight-0.017
actus-0.017
 masse-0.015
iflower-0.015
Token sat
Token ID3332
Feature activation-8.50e-01

Pos logit contributions
actus+0.050
 therapist+0.049
 Midnight+0.047
lor+0.046
 masse+0.045
Neg logit contributions
 guess-0.037
 count-0.036
 three-0.032
 counted-0.032
 imagine-0.032
Token on
Token ID319
Feature activation-8.68e-01

Pos logit contributions
 therapist+0.039
 Midnight+0.038
actus+0.038
 masse+0.036
lor+0.035
Neg logit contributions
 guess-0.033
 count-0.032
 three-0.032
 ten-0.031
 like-0.030
Token the
Token ID262
Feature activation-2.08e+00

Pos logit contributions
actus+0.041
 therapist+0.040
 Midnight+0.038
 masse+0.036
lor+0.036
Neg logit contributions
 guess-0.034
 count-0.033
 three-0.032
 ten-0.032
 One-0.030
Token c
Token ID269
Feature activation-3.91e+00

Pos logit contributions
 Midnight+0.092
 therapist+0.090
lor+0.084
ugg+0.080
+0.078
Neg logit contributions
 three-0.082
 count-0.081
 guess-0.081
 imagine-0.076
 ten-0.074
Tokenactus
Token ID34144
Feature activation-2.98e-01

Pos logit contributions
 Midnight+0.099
 therapist+0.089
 swollen+0.087
 drowned+0.079
 severe+0.077
Neg logit contributions
 One-0.198
 like-0.197
 guess-0.194
ruits-0.192
 ten-0.191
Token.
Token ID13
Feature activation-2.22e-01

Pos logit contributions
 therapist+0.014
actus+0.014
 Midnight+0.013
 masse+0.012
iflower+0.012
Neg logit contributions
 guess-0.012
 count-0.011
 three-0.011
 ten-0.010
 imagine-0.010
Token They
Token ID1119
Feature activation+7.63e-02

Pos logit contributions
 therapist+0.010
actus+0.010
 Midnight+0.010
 masse+0.009
iflower+0.009
Neg logit contributions
 guess-0.009
 count-0.009
 three-0.008
 ten-0.008
 counted-0.008
Token became
Token ID2627
Feature activation-7.53e-01

Pos logit contributions
 guess+0.003
 count+0.003
 three+0.003
 ten+0.003
 imagine+0.002
Neg logit contributions
 therapist-0.004
actus-0.004
 Midnight-0.003
 masse-0.003
iflower-0.003
Token good
Token ID922
Feature activation-5.34e-01

Pos logit contributions
 therapist+0.035
actus+0.035
lor+0.033
 Midnight+0.032
ugg+0.031
Neg logit contributions
 guess-0.031
 count-0.030
 three-0.029
 ten-0.029
 like-0.027
Token you
Token ID345
Feature activation+3.51e-01

Pos logit contributions
 therapist+0.014
actus+0.014
 Midnight+0.013
lor+0.012
 masse+0.012
Neg logit contributions
 guess-0.025
 count-0.025
 three-0.024
 ten-0.024
 imagine-0.023
Token not
Token ID407
Feature activation-3.36e-01

Pos logit contributions
 guess+0.016
 count+0.016
 three+0.015
 ten+0.015
 imagine+0.014
Neg logit contributions
 therapist-0.014
 Midnight-0.013
actus-0.013
 drowned-0.012
iflower-0.012
Token to
Token ID284
Feature activation-5.48e-01

Pos logit contributions
 therapist+0.015
actus+0.015
 Midnight+0.014
 masse+0.013
lor+0.013
Neg logit contributions
 guess-0.014
 count-0.014
 three-0.013
 ten-0.013
 counted-0.013
Token touch
Token ID3638
Feature activation-1.35e+00

Pos logit contributions
 therapist+0.025
actus+0.025
 Midnight+0.023
lor+0.022
iflower+0.022
Neg logit contributions
 guess-0.023
 count-0.023
 ten-0.021
 three-0.021
 imagine-0.020
Token the
Token ID262
Feature activation-2.06e+00

Pos logit contributions
 therapist+0.061
actus+0.059
ugg+0.055
lor+0.054
 Midnight+0.054
Neg logit contributions
 guess-0.053
 count-0.053
 ten-0.053
 three-0.053
 like-0.050
Token c
Token ID269
Feature activation-3.89e+00

Pos logit contributions
 Midnight+0.081
 therapist+0.079
lor+0.077
ugg+0.075
live+0.070
Neg logit contributions
 three-0.092
 guess-0.091
 like-0.087
 count-0.085
 ten-0.083
Tokenactus
Token ID34144
Feature activation-9.23e-01

Pos logit contributions
 Midnight+0.102
 therapist+0.099
 drowned+0.092
 swollen+0.085
 masse+0.083
Neg logit contributions
 like-0.179
ruits-0.176
bell-0.176
iner-0.174
 One-0.170
Token!"
Token ID2474
Feature activation-7.31e-02

Pos logit contributions
 therapist+0.045
actus+0.044
 Midnight+0.043
 masse+0.040
lor+0.040
Neg logit contributions
 guess-0.034
 count-0.033
 ten-0.031
 three-0.031
 like-0.029
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.003
lor+0.003
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.003
 ten-0.003
 imagine-0.003
Token
Token ID198
Feature activation-1.84e-01

Pos logit contributions
 therapist+0.057
actus+0.055
 Midnight+0.053
 masse+0.051
iflower+0.049
Neg logit contributions
 guess-0.040
 count-0.037
 One-0.035
 ten-0.034
 three-0.034
TokenTom
Token ID13787
Feature activation+7.81e-01

Pos logit contributions
 therapist+0.008
actus+0.008
 Midnight+0.008
 masse+0.007
lor+0.007
Neg logit contributions
 guess-0.008
 count-0.007
 three-0.007
 ten-0.007
 counted-0.007
Token She
Token ID1375
Feature activation-1.23e+00

Pos logit contributions
actus+0.029
 therapist+0.029
 Midnight+0.028
iflower+0.027
 masse+0.026
Neg logit contributions
 guess-0.021
 count-0.020
 three-0.019
 ten-0.018
 One-0.018
Token hurt
Token ID5938
Feature activation-7.14e-01

Pos logit contributions
actus+0.058
 therapist+0.058
lor+0.054
 Midnight+0.054
 masse+0.053
Neg logit contributions
 guess-0.049
 count-0.046
 counted-0.044
 three-0.044
 ten-0.043
Token her
Token ID607
Feature activation-7.05e-01

Pos logit contributions
actus+0.033
 therapist+0.032
 Midnight+0.029
lor+0.029
iflower+0.029
Neg logit contributions
 guess-0.029
 count-0.028
 three-0.028
 ten-0.028
 like-0.026
Token head
Token ID1182
Feature activation-3.39e-01

Pos logit contributions
 therapist+0.025
actus+0.025
 Midnight+0.023
lor+0.022
 masse+0.021
Neg logit contributions
 guess-0.036
 count-0.035
 three-0.034
 ten-0.033
 imagine-0.032
Token and
Token ID290
Feature activation-2.01e+00

Pos logit contributions
 therapist+0.017
actus+0.017
 Midnight+0.017
 masse+0.016
iflower+0.015
Neg logit contributions
 guess-0.012
 count-0.011
 three-0.010
 ten-0.010
 counted-0.010
Token couldn
Token ID3521
Feature activation-3.88e+00

Pos logit contributions
actus+0.102
lor+0.098
 therapist+0.095
iflower+0.088
ugg+0.088
Neg logit contributions
 guess-0.077
 count-0.075
 like-0.072
 counted-0.072
 three-0.070
Token't
Token ID470
Feature activation-6.34e-01

Pos logit contributions
actus+0.245
 therapist+0.237
iflower+0.221
ugg+0.217
rows+0.210
Neg logit contributions
 One-0.111
 count-0.087
 guess-0.083
 three-0.078
 ten-0.074
Token wake
Token ID7765
Feature activation-1.39e+00

Pos logit contributions
 therapist+0.033
actus+0.032
 Midnight+0.031
lor+0.030
 masse+0.029
Neg logit contributions
 guess-0.023
 count-0.023
 ten-0.020
 imagine-0.020
 three-0.020
Token up
Token ID510
Feature activation-1.08e+00

Pos logit contributions
 therapist+0.067
actus+0.066
ugg+0.062
lor+0.062
iflower+0.059
Neg logit contributions
 guess-0.051
 count-0.051
 ten-0.050
 three-0.048
 like-0.046
Token.
Token ID13
Feature activation-8.11e-01

Pos logit contributions
actus+0.056
 therapist+0.055
 Midnight+0.052
lor+0.051
iflower+0.050
Neg logit contributions
 guess-0.037
 count-0.037
 three-0.034
 like-0.034
 ten-0.034
Token Lucy
Token ID22162
Feature activation-1.88e+00

Pos logit contributions
actus+0.040
 therapist+0.039
 Midnight+0.036
iflower+0.036
 masse+0.035
Neg logit contributions
 guess-0.030
 count-0.028
 three-0.027
 One-0.027
 ten-0.026
Token't
Token ID470
Feature activation+7.73e-01

Pos logit contributions
actus+0.098
 therapist+0.097
ugg+0.086
lor+0.085
 Midnight+0.082
Neg logit contributions
 count-0.076
 guess-0.075
 three-0.071
 One-0.069
 ten-0.068
Token make
Token ID787
Feature activation-6.29e-01

Pos logit contributions
 guess+0.024
 count+0.023
 1+0.023
 three+0.023
 ten+0.022
Neg logit contributions
 Midnight-0.038
actus-0.036
 therapist-0.035
 Woo-0.035
 drowned-0.034
Token you
Token ID345
Feature activation-1.87e-01

Pos logit contributions
 therapist+0.028
actus+0.027
 Midnight+0.026
ugg+0.025
lor+0.024
Neg logit contributions
 guess-0.027
 count-0.026
 three-0.025
 ten-0.025
 imagine-0.023
Token touch
Token ID3638
Feature activation-1.09e+00

Pos logit contributions
 therapist+0.007
actus+0.007
 Midnight+0.007
 masse+0.006
lor+0.006
Neg logit contributions
 guess-0.009
 count-0.009
 three-0.008
 ten-0.008
 imagine-0.008
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
 therapist+0.048
actus+0.046
 Midnight+0.044
ugg+0.043
lor+0.042
Neg logit contributions
 guess-0.044
 count-0.044
 three-0.043
 ten-0.042
 like-0.041
Token c
Token ID269
Feature activation-3.87e+00

Pos logit contributions
 therapist+0.076
 Midnight+0.073
lor+0.069
 masse+0.068
ugg+0.067
Neg logit contributions
 guess-0.075
 three-0.071
 like-0.071
 imagine-0.069
 count-0.069
Tokenactus
Token ID34144
Feature activation-1.01e+00

Pos logit contributions
 Midnight+0.100
 therapist+0.100
 drowned+0.099
 masse+0.089
 hawk+0.084
Neg logit contributions
bell-0.176
 like-0.173
iner-0.171
ill-0.169
ruits-0.168
Token,"
Token ID553
Feature activation+6.91e-01

Pos logit contributions
 therapist+0.049
actus+0.048
 Midnight+0.047
 masse+0.044
ugg+0.043
Neg logit contributions
 guess-0.038
 count-0.037
 ten-0.034
 three-0.034
 imagine-0.033
Token Sam
Token ID3409
Feature activation+5.63e-01

Pos logit contributions
 count+0.029
 guess+0.028
 ten+0.027
ruits+0.027
 imagine+0.027
Neg logit contributions
 therapist-0.027
 Midnight-0.026
actus-0.026
 masse-0.024
 Woo-0.024
Token said
Token ID531
Feature activation-5.91e-01

Pos logit contributions
 count+0.026
 guess+0.026
 ten+0.025
 three+0.024
 imagine+0.024
Neg logit contributions
 therapist-0.021
 Midnight-0.020
actus-0.020
 drowned-0.018
 masse-0.018
Token.
Token ID13
Feature activation-7.70e-01

Pos logit contributions
actus+0.029
 therapist+0.028
 Midnight+0.027
lor+0.026
 masse+0.026
Neg logit contributions
 guess-0.022
 count-0.022
 three-0.021
 ten-0.020
 counted-0.019
TokenSadly
Token ID36725
Feature activation-1.34e+00

Pos logit contributions
 therapist+0.068
actus+0.065
 Midnight+0.061
ugg+0.056
 masse+0.056
Neg logit contributions
 guess-0.054
 count-0.051
 One-0.050
 like-0.047
 three-0.046
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
 therapist+0.087
actus+0.086
 Midnight+0.083
lor+0.081
 masse+0.081
Neg logit contributions
 guess-0.025
 three-0.024
 count-0.024
 ten-0.021
 like-0.020
Token Nut
Token ID11959
Feature activation-2.17e+00

Pos logit contributions
 therapist+0.058
actus+0.056
lor+0.049
ugg+0.046
 Midnight+0.046
Neg logit contributions
 three-0.075
 guess-0.075
 One-0.075
 like-0.074
 count-0.074
Tokenty
Token ID774
Feature activation-2.32e+00

Pos logit contributions
 therapist+0.093
 masse+0.086
actus+0.083
 Midnight+0.083
ugg+0.082
Neg logit contributions
 guess-0.087
 count-0.083
 counted-0.082
 like-0.079
 three-0.075
Token's
Token ID338
Feature activation-1.11e+00

Pos logit contributions
 therapist+0.117
actus+0.115
 masse+0.114
 Midnight+0.110
lor+0.106
Neg logit contributions
 guess-0.081
 counted-0.078
 count-0.075
 three-0.071
 ten-0.066
Token sickness
Token ID26359
Feature activation-3.86e+00

Pos logit contributions
 therapist+0.048
actus+0.046
 Midnight+0.045
lor+0.044
 masse+0.042
Neg logit contributions
 guess-0.047
 count-0.046
 three-0.045
 ten-0.043
 imagine-0.041
Token got
Token ID1392
Feature activation-1.35e+00

Pos logit contributions
actus+0.210
 masse+0.207
 therapist+0.203
iflower+0.199
lor+0.195
Neg logit contributions
 like-0.117
 counted-0.108
 guess-0.093
 count-0.092
 liked-0.089
Token worse
Token ID4785
Feature activation-1.28e+00

Pos logit contributions
 therapist+0.062
actus+0.060
lor+0.055
 Midnight+0.053
ugg+0.051
Neg logit contributions
 three-0.056
 ten-0.055
 count-0.055
 guess-0.055
 like-0.054
Token and
Token ID290
Feature activation-2.35e+00

Pos logit contributions
 therapist+0.073
actus+0.073
 Midnight+0.072
 masse+0.068
lor+0.067
Neg logit contributions
 guess-0.037
 count-0.036
 like-0.033
 three-0.033
 ten-0.032
Token he
Token ID339
Feature activation-1.70e+00

Pos logit contributions
actus+0.125
 therapist+0.124
lor+0.119
ugg+0.114
Charl+0.110
Neg logit contributions
 three-0.074
 guess-0.073
 One-0.071
 count-0.071
 counted-0.070
Token couldn
Token ID3521
Feature activation-3.54e+00

Pos logit contributions
 therapist+0.084
actus+0.081
 masse+0.079
 Midnight+0.078
lor+0.075
Neg logit contributions
 guess-0.063
 counted-0.061
 count-0.060
 three-0.056
 ten-0.054
Token!"
Token ID2474
Feature activation-2.69e+00

Pos logit contributions
 therapist+0.001
actus+0.001
 Midnight+0.001
 masse+0.000
iflower+0.000
Neg logit contributions
 guess-0.000
 count-0.000
 three-0.000
 ten-0.000
 counted-0.000
Token Mr
Token ID1770
Feature activation-1.50e+00

Pos logit contributions
actus+0.155
 therapist+0.144
eds+0.144
iflower+0.142
 hawk+0.140
Neg logit contributions
 One-0.075
 guess-0.068
 three-0.068
 count-0.058
 yes-0.058
Token.
Token ID13
Feature activation-1.89e+00

Pos logit contributions
 therapist+0.079
actus+0.075
 Midnight+0.074
 masse+0.071
lor+0.071
Neg logit contributions
 guess-0.048
 count-0.048
 three-0.047
 ten-0.045
 like-0.044
Token Fox
Token ID5426
Feature activation-6.59e-01

Pos logit contributions
actus+0.070
 therapist+0.067
ugg+0.060
iflower+0.055
 masse+0.054
Neg logit contributions
 three-0.094
 guess-0.092
 count-0.090
 One-0.089
 ten-0.087
Token smiled
Token ID13541
Feature activation-5.75e-01

Pos logit contributions
actus+0.033
 therapist+0.033
 Midnight+0.032
 masse+0.031
lor+0.030
Neg logit contributions
 guess-0.023
 count-0.022
 three-0.021
 ten-0.020
 counted-0.020
Token and
Token ID290
Feature activation-3.86e+00

Pos logit contributions
 therapist+0.031
actus+0.031
 Midnight+0.030
lor+0.028
iflower+0.028
Neg logit contributions
 guess-0.018
 count-0.018
 three-0.018
 ten-0.016
 like-0.016
Token said
Token ID531
Feature activation-6.89e-01

Pos logit contributions
eds+0.250
lor+0.249
actus+0.249
orns+0.238
iflower+0.232
Neg logit contributions
 counted-0.085
 three-0.081
 guess-0.081
 yes-0.074
 like-0.066
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
actus+0.042
 therapist+0.041
 Midnight+0.039
 masse+0.038
iflower+0.038
Neg logit contributions
 guess-0.017
 count-0.017
 three-0.016
 ten-0.015
 yes-0.014
Token "
Token ID366
Feature activation-2.09e+00

Pos logit contributions
actus+0.099
 therapist+0.093
 Midnight+0.088
 drowned+0.088
iflower+0.088
Neg logit contributions
 three-0.044
 guess-0.043
 count-0.042
 yes-0.040
 One-0.038
TokenYou
Token ID1639
Feature activation+2.19e-01

Pos logit contributions
actus+0.087
 disappears+0.077
 drowned+0.076
 therapist+0.075
 Midnight+0.073
Neg logit contributions
 three-0.092
 guess-0.088
 like-0.087
 count-0.083
 especially-0.082
Token're
Token ID821
Feature activation-1.18e+00

Pos logit contributions
 guess+0.006
 count+0.006
 ten+0.005
 three+0.005
 imagine+0.005
Neg logit contributions
 therapist-0.013
actus-0.013
 Midnight-0.013
iflower-0.011
 masse-0.011
Token ear
Token ID1027
Feature activation+2.55e-01

Pos logit contributions
 therapist+0.034
actus+0.033
 Midnight+0.032
lor+0.030
 masse+0.030
Neg logit contributions
 guess-0.031
 count-0.030
 three-0.028
 ten-0.028
 imagine-0.027
Token."
Token ID526
Feature activation-2.63e+00

Pos logit contributions
 guess+0.011
 count+0.011
 three+0.010
 ten+0.010
 counted+0.010
Neg logit contributions
 therapist-0.011
actus-0.011
 Midnight-0.010
lor-0.009
iflower-0.009
Token Fl
Token ID1610
Feature activation-1.73e+00

Pos logit contributions
actus+0.131
iflower+0.129
ich+0.126
 severe+0.126
 therapist+0.125
Neg logit contributions
 One-0.099
 three-0.085
 guess-0.082
 count-0.075
 yes-0.073
Tokenoppy
Token ID26696
Feature activation-1.48e+00

Pos logit contributions
 therapist+0.064
keeping+0.053
 Midnight+0.053
 masse+0.052
actus+0.052
Neg logit contributions
 guess-0.084
 three-0.077
 count-0.076
 One-0.074
 ten-0.073
Token apologized
Token ID22893
Feature activation-1.04e+00

Pos logit contributions
actus+0.076
 therapist+0.075
 Midnight+0.073
 masse+0.071
lor+0.070
Neg logit contributions
 guess-0.052
 count-0.049
 three-0.047
 counted-0.046
 ten-0.043
Token and
Token ID290
Feature activation-3.85e+00

Pos logit contributions
actus+0.063
 therapist+0.061
 Midnight+0.059
iflower+0.058
lor+0.058
Neg logit contributions
 guess-0.028
 count-0.026
 three-0.026
 like-0.024
 ten-0.024
Token they
Token ID484
Feature activation-1.56e+00

Pos logit contributions
actus+0.230
eds+0.223
iflower+0.222
lor+0.221
ich+0.211
Neg logit contributions
 guess-0.105
 counted-0.101
 three-0.099
 yes-0.093
 pretended-0.089
Token continued
Token ID3767
Feature activation-2.27e+00

Pos logit contributions
actus+0.084
 therapist+0.083
lor+0.077
iflower+0.077
 masse+0.077
Neg logit contributions
 guess-0.053
 count-0.051
 counted-0.050
 three-0.049
 ten-0.047
Token to
Token ID284
Feature activation-6.72e-01

Pos logit contributions
actus+0.147
 therapist+0.142
lor+0.135
iflower+0.130
 masse+0.125
Neg logit contributions
 three-0.063
 count-0.062
 guess-0.057
 One-0.056
 ten-0.055
Token have
Token ID423
Feature activation-1.50e+00

Pos logit contributions
 therapist+0.038
actus+0.038
 Midnight+0.036
lor+0.035
 masse+0.034
Neg logit contributions
 count-0.021
 guess-0.021
 three-0.018
 imagine-0.018
 ten-0.018
Token a
Token ID257
Feature activation-7.57e-01

Pos logit contributions
actus+0.096
 therapist+0.094
iflower+0.085
lor+0.084
ich+0.083
Neg logit contributions
 guess-0.039
 count-0.038
 three-0.037
 ten-0.036
 One-0.033
Tokeny
Token ID88
Feature activation-2.08e+00

Pos logit contributions
 therapist+0.055
actus+0.054
 Midnight+0.053
 masse+0.052
lor+0.050
Neg logit contributions
 guess-0.035
 count-0.033
 three-0.032
 counted-0.030
 ten-0.030
Token flew
Token ID13112
Feature activation-9.18e-01

Pos logit contributions
actus+0.109
 therapist+0.106
 Midnight+0.104
 masse+0.104
lor+0.102
Neg logit contributions
 guess-0.070
 count-0.066
 three-0.064
 counted-0.063
 imagine-0.057
Token away
Token ID1497
Feature activation-1.55e+00

Pos logit contributions
actus+0.038
 therapist+0.037
iflower+0.034
 Midnight+0.034
lor+0.034
Neg logit contributions
 guess-0.042
 three-0.041
 count-0.041
 ten-0.040
 like-0.040
Token,
Token ID11
Feature activation-1.97e+00

Pos logit contributions
actus+0.094
 therapist+0.094
lor+0.092
 Midnight+0.090
iflower+0.088
Neg logit contributions
 guess-0.042
 three-0.040
 count-0.038
 like-0.038
 ten-0.035
Token happy
Token ID3772
Feature activation-2.06e+00

Pos logit contributions
actus+0.106
 therapist+0.103
lor+0.102
iflower+0.096
 masse+0.094
Neg logit contributions
 guess-0.068
 like-0.064
 three-0.064
 count-0.063
 counted-0.060
Token and
Token ID290
Feature activation-3.84e+00

Pos logit contributions
 therapist+0.131
 Midnight+0.129
lor+0.128
 masse+0.126
actus+0.126
Neg logit contributions
 like-0.053
 three-0.047
 guess-0.045
 count-0.038
 imagine-0.037
Token clean
Token ID3424
Feature activation-2.10e+00

Pos logit contributions
lor+0.225
actus+0.215
orns+0.207
 therapist+0.203
pan+0.199
Neg logit contributions
 like-0.112
 excited-0.106
 guess-0.104
 counted-0.104
 want-0.095
Token.
Token ID13
Feature activation-3.33e+00

Pos logit contributions
actus+0.117
lor+0.111
ugg+0.111
 therapist+0.110
ich+0.109
Neg logit contributions
 like-0.058
 guess-0.054
 count-0.054
 three-0.054
 imagine-0.049
Token<|endoftext|>
Token ID50256
Feature activation-1.22e+00

Pos logit contributions
 therapist+0.197
actus+0.195
 masse+0.185
ich+0.185
 severe+0.183
Neg logit contributions
 One-0.097
 three-0.078
 like-0.065
 guess-0.064
 Many-0.063
TokenOnce
Token ID7454
Feature activation-5.83e-01

Pos logit contributions
 therapist+0.061
actus+0.058
 Midnight+0.056
 masse+0.056
iflower+0.053
Neg logit contributions
 guess-0.045
 count-0.043
 three-0.042
 One-0.041
 ten-0.040
Token upon
Token ID2402
Feature activation+5.20e-01

Pos logit contributions
 therapist+0.040
actus+0.039
 Midnight+0.039
 masse+0.038
lor+0.037
Neg logit contributions
 guess-0.010
 count-0.009
 three-0.008
 ten-0.007
 imagine-0.007
Token happy
Token ID3772
Feature activation-7.47e-01

Pos logit contributions
 guess+0.010
 count+0.010
 ten+0.009
 imagine+0.009
 three+0.009
Neg logit contributions
 therapist-0.013
 Midnight-0.012
actus-0.012
 masse-0.011
iflower-0.011
Token and
Token ID290
Feature activation-1.84e+00

Pos logit contributions
actus+0.037
 therapist+0.036
 Midnight+0.035
lor+0.034
iflower+0.033
Neg logit contributions
 guess-0.028
 count-0.028
 three-0.027
 ten-0.025
 like-0.025
Token thanked
Token ID26280
Feature activation-2.35e+00

Pos logit contributions
actus+0.102
lor+0.094
 therapist+0.090
 Midnight+0.085
iflower+0.084
Neg logit contributions
 guess-0.068
 count-0.067
 counted-0.062
 three-0.060
 imagine-0.058
Token Farmer
Token ID37755
Feature activation-1.30e+00

Pos logit contributions
actus+0.102
 therapist+0.093
ugg+0.086
lor+0.082
iflower+0.082
Neg logit contributions
 three-0.105
 guess-0.102
 One-0.100
 count-0.099
 ten-0.098
Token Brown
Token ID4373
Feature activation-1.20e+00

Pos logit contributions
actus+0.047
 therapist+0.045
 Midnight+0.042
 masse+0.040
iflower+0.040
Neg logit contributions
 guess-0.063
 count-0.063
 three-0.061
 ten-0.059
 counted-0.058
Token.
Token ID13
Feature activation-3.84e+00

Pos logit contributions
 therapist+0.055
actus+0.055
 Midnight+0.052
 masse+0.051
lor+0.050
Neg logit contributions
 guess-0.045
 count-0.045
 three-0.043
 ten-0.043
 like-0.040
Token They
Token ID1119
Feature activation-1.01e+00

Pos logit contributions
actus+0.220
eds+0.211
 therapist+0.208
iflower+0.206
web+0.205
Neg logit contributions
 One-0.121
 three-0.083
 So-0.080
 Today-0.077
 Two-0.075
Token brought
Token ID3181
Feature activation-1.47e+00

Pos logit contributions
actus+0.050
 therapist+0.049
 Midnight+0.046
lor+0.046
 masse+0.045
Neg logit contributions
 guess-0.038
 count-0.037
 three-0.035
 counted-0.034
 ten-0.034
Token the
Token ID262
Feature activation+8.33e-02

Pos logit contributions
actus+0.080
 therapist+0.079
lor+0.072
iflower+0.070
 masse+0.070
Neg logit contributions
 three-0.050
 guess-0.047
 count-0.046
 ten-0.046
 One-0.042
Token hay
Token ID27678
Feature activation-1.81e+00

Pos logit contributions
 guess+0.004
 count+0.004
 ten+0.004
 three+0.004
 counted+0.004
Neg logit contributions
 therapist-0.003
actus-0.003
 Midnight-0.003
 masse-0.002
iflower-0.002
Token back
Token ID736
Feature activation-9.39e-01

Pos logit contributions
actus+0.101
 therapist+0.099
 Midnight+0.095
 masse+0.094
lor+0.092
Neg logit contributions
 count-0.059
 guess-0.058
 three-0.053
 ten-0.051
 like-0.050
Token,
Token ID11
Feature activation+3.29e-01

Pos logit contributions
 guess+0.018
 count+0.018
 ten+0.017
 three+0.016
 imagine+0.016
Neg logit contributions
 Midnight-0.023
 therapist-0.022
actus-0.022
 Woo-0.021
 drowned-0.020
Token he
Token ID339
Feature activation+1.70e-01

Pos logit contributions
 guess+0.013
 count+0.013
 ten+0.012
 three+0.011
 imagine+0.011
Neg logit contributions
 Midnight-0.015
 therapist-0.015
actus-0.015
 Woo-0.013
 masse-0.013
Token crawled
Token ID45668
Feature activation-6.95e-01

Pos logit contributions
 guess+0.007
 count+0.007
 ten+0.006
 three+0.006
 imagine+0.006
Neg logit contributions
 therapist-0.008
actus-0.008
 Midnight-0.007
 masse-0.007
iflower-0.007
Token under
Token ID739
Feature activation+3.79e-01

Pos logit contributions
 therapist+0.033
actus+0.033
 Midnight+0.032
iflower+0.031
lor+0.030
Neg logit contributions
 guess-0.027
 count-0.026
 three-0.025
 ten-0.025
 like-0.024
Token the
Token ID262
Feature activation-2.35e+00

Pos logit contributions
 guess+0.019
 count+0.019
 imagine+0.017
ruits+0.017
 counted+0.017
Neg logit contributions
 Midnight-0.013
actus-0.013
 therapist-0.012
 drowned-0.011
 Woo-0.011
Token leather
Token ID11620
Feature activation-3.84e+00

Pos logit contributions
 therapist+0.112
lor+0.107
ugg+0.100
 Midnight+0.097
 drowned+0.094
Neg logit contributions
 three-0.092
 count-0.091
 ten-0.087
 guess-0.085
 like-0.083
Token and
Token ID290
Feature activation-6.71e-01

Pos logit contributions
 swollen+0.178
 therapist+0.176
rim+0.174
 Midnight+0.174
Kids+0.171
Neg logit contributions
 three-0.124
 ten-0.119
 guess-0.117
 count-0.115
 like-0.111
Token hid
Token ID24519
Feature activation-6.46e-02

Pos logit contributions
actus+0.033
 therapist+0.033
 Midnight+0.032
lor+0.031
 masse+0.030
Neg logit contributions
 guess-0.025
 count-0.024
 three-0.022
 counted-0.022
 ten-0.022
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.002
iflower+0.002
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.003
 ten-0.003
 imagine-0.002
Token
Token ID220
Feature activation+5.17e-01

Pos logit contributions
 guess+0.007
 count+0.007
 ten+0.007
 three+0.007
 imagine+0.007
Neg logit contributions
actus-0.007
 therapist-0.007
 Midnight-0.007
 masse-0.006
 Woo-0.006
Token
Token ID198
Feature activation+4.00e-01

Pos logit contributions
 guess+0.025
 count+0.025
 three+0.024
 ten+0.023
 imagine+0.023
Neg logit contributions
actus-0.018
 therapist-0.018
 Midnight-0.018
lor-0.015
iflower-0.015
Token help
Token ID1037
Feature activation-8.46e-01

Pos logit contributions
actus+0.071
 therapist+0.066
 Midnight+0.063
ugg+0.061
lor+0.060
Neg logit contributions
 guess-0.048
 count-0.047
 three-0.044
 ten-0.044
 imagine-0.040
Token when
Token ID618
Feature activation-1.63e+00

Pos logit contributions
actus+0.044
 therapist+0.043
 Midnight+0.040
ugg+0.039
lor+0.038
Neg logit contributions
 guess-0.031
 count-0.030
 three-0.028
 ten-0.028
 like-0.026
Token he
Token ID339
Feature activation-4.69e-01

Pos logit contributions
actus+0.071
 therapist+0.068
 Midnight+0.064
lor+0.063
ugg+0.061
Neg logit contributions
 guess-0.072
 count-0.071
 three-0.068
 ten-0.066
 imagine-0.064
Token needed
Token ID2622
Feature activation-1.52e+00

Pos logit contributions
 therapist+0.022
actus+0.022
 Midnight+0.021
 masse+0.020
lor+0.019
Neg logit contributions
 guess-0.018
 count-0.018
 three-0.017
 ten-0.016
 counted-0.016
Token it
Token ID340
Feature activation-1.04e+00

Pos logit contributions
actus+0.077
 therapist+0.071
 Midnight+0.066
ugg+0.065
lor+0.064
Neg logit contributions
 guess-0.063
 count-0.061
 three-0.058
 ten-0.057
 like-0.054
Token.
Token ID13
Feature activation-3.83e+00

Pos logit contributions
actus+0.053
 therapist+0.052
 Midnight+0.050
 masse+0.047
ugg+0.046
Neg logit contributions
 guess-0.039
 count-0.038
 ten-0.034
 three-0.034
 like-0.033
Token<|endoftext|>
Token ID50256
Feature activation-1.37e+00

Pos logit contributions
actus+0.253
 therapist+0.245
eds+0.243
 drowned+0.243
 swollen+0.241
Neg logit contributions
 One-0.091
 three-0.053
 Many-0.046
 Today-0.046
 like-0.038
TokenOnce
Token ID7454
Feature activation+2.33e-01

Pos logit contributions
 therapist+0.066
actus+0.064
 Midnight+0.062
 masse+0.062
iflower+0.059
Neg logit contributions
 guess-0.050
 three-0.048
 count-0.047
 One-0.047
 ten-0.045
Token upon
Token ID2402
Feature activation+1.93e+00

Pos logit contributions
 guess+0.005
 count+0.005
 three+0.004
 ten+0.004
 counted+0.004
Neg logit contributions
actus-0.015
 therapist-0.015
 Midnight-0.014
 masse-0.013
iflower-0.013
Token a
Token ID257
Feature activation+1.40e+00

Pos logit contributions
 count+0.077
 1+0.073
 guess+0.071
 counted+0.070
teen+0.069
Neg logit contributions
 Midnight-0.084
 therapist-0.078
actus-0.075
 Woo-0.073
iflower-0.072
Token time
Token ID640
Feature activation+2.11e+00

Pos logit contributions
 count+0.056
 guess+0.055
 counted+0.053
 three+0.051
 ten+0.051
Neg logit contributions
 Midnight-0.059
actus-0.058
 therapist-0.055
iflower-0.052
 masse-0.052
Token not
Token ID407
Feature activation+1.26e+00

Pos logit contributions
 therapist+0.010
actus+0.010
 Midnight+0.010
 masse+0.009
lor+0.009
Neg logit contributions
 guess-0.008
 count-0.008
 three-0.007
 ten-0.007
 imagine-0.007
Token make
Token ID787
Feature activation-2.87e-01

Pos logit contributions
 1+0.029
 guess+0.026
 three+0.025
 count+0.024
 6+0.023
Neg logit contributions
 Midnight-0.072
actus-0.069
 therapist-0.068
 drowned-0.067
 Woo-0.067
Token Tom
Token ID4186
Feature activation+4.04e-01

Pos logit contributions
 therapist+0.015
actus+0.014
 Midnight+0.014
 masse+0.013
lor+0.013
Neg logit contributions
 guess-0.010
 count-0.010
 three-0.009
 ten-0.009
 imagine-0.009
Token touch
Token ID3638
Feature activation+5.69e-02

Pos logit contributions
 guess+0.018
 count+0.017
 ten+0.016
 three+0.016
 counted+0.016
Neg logit contributions
 therapist-0.016
actus-0.016
 Midnight-0.016
 masse-0.014
 drowned-0.014
Token the
Token ID262
Feature activation-1.80e+00

Pos logit contributions
 guess+0.002
 count+0.002
 three+0.002
 ten+0.002
 counted+0.002
Neg logit contributions
 therapist-0.002
actus-0.002
 Midnight-0.002
 masse-0.002
iflower-0.002
Token c
Token ID269
Feature activation-3.82e+00

Pos logit contributions
 therapist+0.074
 Midnight+0.073
lor+0.068
ugg+0.066
 masse+0.066
Neg logit contributions
 guess-0.076
 three-0.072
 like-0.070
 count-0.069
 imagine-0.069
Tokenactus
Token ID34144
Feature activation-1.07e+00

Pos logit contributions
 Midnight+0.099
 therapist+0.097
 drowned+0.089
 masse+0.083
 hawk+0.080
Neg logit contributions
 like-0.176
iner-0.174
bell-0.173
ill-0.170
ruits-0.170
Token.
Token ID13
Feature activation+7.23e-01

Pos logit contributions
 therapist+0.052
actus+0.050
 Midnight+0.049
lor+0.046
ugg+0.046
Neg logit contributions
 guess-0.041
 count-0.040
 three-0.037
 ten-0.037
 like-0.036
Token He
Token ID679
Feature activation-1.82e-02

Pos logit contributions
 count+0.030
 guess+0.029
 imagine+0.027
 ten+0.027
 three+0.027
Neg logit contributions
 Midnight-0.030
actus-0.029
 therapist-0.029
 Woo-0.027
 masse-0.026
Token tried
Token ID3088
Feature activation-4.85e-01

Pos logit contributions
 therapist+0.001
actus+0.001
 Midnight+0.001
 masse+0.001
iflower+0.001
Neg logit contributions
 guess-0.001
 count-0.001
 three-0.001
 ten-0.001
 imagine-0.001
Token to
Token ID284
Feature activation-3.22e-01

Pos logit contributions
 therapist+0.022
actus+0.022
 Midnight+0.021
lor+0.020
 masse+0.020
Neg logit contributions
 guess-0.020
 count-0.019
 ten-0.018
 three-0.018
 counted-0.017
Token She
Token ID1375
Feature activation-1.06e+00

Pos logit contributions
actus+0.040
 therapist+0.039
iflower+0.037
 Midnight+0.036
lor+0.036
Neg logit contributions
 guess-0.026
 count-0.025
 three-0.025
 One-0.024
 ten-0.023
Token hurt
Token ID5938
Feature activation-5.30e-01

Pos logit contributions
 therapist+0.044
actus+0.043
 Midnight+0.042
lor+0.041
 masse+0.040
Neg logit contributions
 guess-0.047
 count-0.045
 counted-0.043
 three-0.043
 ten-0.042
Token herself
Token ID5223
Feature activation-9.08e-01

Pos logit contributions
actus+0.023
 therapist+0.023
 Midnight+0.021
lor+0.020
iflower+0.020
Neg logit contributions
 guess-0.023
 count-0.022
 three-0.022
 ten-0.021
 like-0.020
Token badly
Token ID11234
Feature activation-1.69e+00

Pos logit contributions
 therapist+0.043
actus+0.043
 Midnight+0.041
lor+0.038
 masse+0.038
Neg logit contributions
 guess-0.035
 count-0.035
 three-0.033
 ten-0.032
 like-0.031
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
 therapist+0.106
actus+0.104
lor+0.100
 Midnight+0.100
 masse+0.094
Neg logit contributions
 like-0.045
 guess-0.040
 count-0.038
 ten-0.038
 three-0.037
Token couldn
Token ID3521
Feature activation-3.81e+00

Pos logit contributions
actus+0.083
lor+0.079
 therapist+0.078
 masse+0.072
iflower+0.072
Neg logit contributions
 guess-0.066
 count-0.063
 like-0.062
 three-0.062
 counted-0.061
Token't
Token ID470
Feature activation-6.82e-01

Pos logit contributions
actus+0.231
 therapist+0.223
ugg+0.213
iflower+0.207
rows+0.207
Neg logit contributions
 One-0.117
 count-0.094
 guess-0.091
 three-0.085
 ten-0.081
Token get
Token ID651
Feature activation-1.73e+00

Pos logit contributions
 therapist+0.036
actus+0.035
 Midnight+0.034
lor+0.032
 masse+0.032
Neg logit contributions
 guess-0.024
 count-0.024
 imagine-0.021
 ten-0.021
 three-0.021
Token out
Token ID503
Feature activation-1.08e-01

Pos logit contributions
 therapist+0.094
actus+0.092
lor+0.091
ugg+0.081
iflower+0.081
Neg logit contributions
 ten-0.061
 count-0.061
 three-0.061
 guess-0.060
 like-0.056
Token of
Token ID286
Feature activation-1.65e+00

Pos logit contributions
 therapist+0.005
actus+0.005
 Midnight+0.005
 masse+0.004
iflower+0.004
Neg logit contributions
 guess-0.004
 count-0.004
 three-0.004
 ten-0.004
 imagine-0.004
Token the
Token ID262
Feature activation-6.84e-01

Pos logit contributions
actus+0.086
 therapist+0.084
lor+0.080
ugg+0.079
iflower+0.076
Neg logit contributions
 ten-0.060
 guess-0.060
 count-0.058
 three-0.057
 like-0.054
Token,
Token ID11
Feature activation+1.77e+00

Pos logit contributions
 guess+0.030
 count+0.029
 ten+0.027
 imagine+0.026
 counted+0.026
Neg logit contributions
actus-0.034
 therapist-0.033
 Midnight-0.033
ugg-0.030
iflower-0.030
Token there
Token ID612
Feature activation+1.02e+00

Pos logit contributions
 count+0.080
 guess+0.079
racted+0.074
 exception+0.073
 imagine+0.072
Neg logit contributions
 Midnight-0.066
 Woo-0.056
 therapist-0.055
actus-0.055
iflower-0.051
Token was
Token ID373
Feature activation+1.74e+00

Pos logit contributions
 guess+0.053
 count+0.052
 ten+0.050
 three+0.049
 1+0.049
Neg logit contributions
 Midnight-0.030
actus-0.030
 therapist-0.029
iflower-0.025
 died-0.025
Token a
Token ID257
Feature activation-4.92e-01

Pos logit contributions
 count+0.076
 guess+0.076
 imagine+0.072
ruits+0.070
ff+0.068
Neg logit contributions
 Midnight-0.070
 Woo-0.064
 therapist-0.060
actus-0.057
 Mitt-0.057
Token swing
Token ID9628
Feature activation-9.19e-01

Pos logit contributions
 therapist+0.024
actus+0.023
 Midnight+0.023
lor+0.022
 masse+0.021
Neg logit contributions
 guess-0.018
 count-0.018
 three-0.017
 ten-0.016
 imagine-0.015
Token.
Token ID13
Feature activation+2.55e+00

Pos logit contributions
 therapist+0.045
actus+0.045
 Midnight+0.044
 masse+0.041
 drowned+0.041
Neg logit contributions
 count-0.033
 guess-0.032
 three-0.031
 ten-0.030
 One-0.029
Token A
Token ID317
Feature activation-5.63e-01

Pos logit contributions
 guess+0.109
 count+0.103
 exception+0.100
 imagine+0.099
 whether+0.097
Neg logit contributions
 Midnight-0.103
 therapist-0.088
 Woo-0.084
 masse-0.082
 Velvet-0.082
Token girl
Token ID2576
Feature activation+2.83e-01

Pos logit contributions
 therapist+0.027
actus+0.027
 Midnight+0.026
lor+0.025
 masse+0.024
Neg logit contributions
 guess-0.021
 count-0.020
 three-0.019
 ten-0.019
 imagine-0.018
Token named
Token ID3706
Feature activation+1.36e+00

Pos logit contributions
 guess+0.011
 count+0.011
 ten+0.010
 three+0.010
 imagine+0.009
Neg logit contributions
 therapist-0.013
actus-0.013
 Midnight-0.013
 masse-0.011
iflower-0.011
Token Lily
Token ID20037
Feature activation-6.19e-01

Pos logit contributions
 guess+0.060
 count+0.059
 imagine+0.056
 counted+0.052
 whether+0.051
Neg logit contributions
 Midnight-0.055
 therapist-0.049
 Woo-0.048
actus-0.047
ugg-0.044
Token loved
Token ID6151
Feature activation-1.42e+00

Pos logit contributions
 therapist+0.033
actus+0.033
 Midnight+0.032
 masse+0.031
lor+0.030
Neg logit contributions
 guess-0.020
 count-0.020
 three-0.018
 counted-0.018
 ten-0.018
Token They
Token ID1119
Feature activation+9.88e-01

Pos logit contributions
 guess+0.085
 count+0.082
 ten+0.080
ptions+0.077
 imagine+0.077
Neg logit contributions
 Woo-0.044
 Midnight-0.043
ugg-0.041
actus-0.040
 Mitt-0.038
Token like
Token ID588
Feature activation-4.66e-01

Pos logit contributions
 guess+0.018
 count+0.016
 ten+0.015
 1+0.014
 three+0.014
Neg logit contributions
 Midnight-0.063
 therapist-0.061
actus-0.061
 Woo-0.058
ugg-0.058
Token to
Token ID284
Feature activation+1.88e+00

Pos logit contributions
actus+0.021
 therapist+0.021
 Midnight+0.020
lor+0.019
iflower+0.018
Neg logit contributions
 guess-0.019
 count-0.019
 three-0.018
 ten-0.018
 imagine-0.017
Token play
Token ID711
Feature activation+1.71e-01

Pos logit contributions
ptions+0.088
 1+0.086
 whether+0.083
illed+0.082
 2+0.082
Neg logit contributions
 Midnight-0.059
 therapist-0.054
 band-0.054
actus-0.051
 Woo-0.049
Token outside
Token ID2354
Feature activation+1.24e+00

Pos logit contributions
 guess+0.007
 count+0.007
 ten+0.006
 three+0.006
 counted+0.006
Neg logit contributions
 therapist-0.008
actus-0.008
 Midnight-0.008
 masse-0.007
iflower-0.007
Token.
Token ID13
Feature activation+2.75e+00

Pos logit contributions
 guess+0.060
 count+0.059
 counted+0.056
ruits+0.055
 blond+0.054
Neg logit contributions
 therapist-0.040
 Midnight-0.040
actus-0.037
 Woo-0.034
 masse-0.034
Token One
Token ID1881
Feature activation+1.32e-01

Pos logit contributions
 guess+0.075
ptions+0.074
epad+0.072
 ten+0.070
 imagined+0.067
Neg logit contributions
 Woo-0.134
 Midnight-0.133
 Mitt-0.129
 Snow-0.121
 Pink-0.120
Token day
Token ID1110
Feature activation+1.55e+00

Pos logit contributions
 guess+0.007
 count+0.007
 ten+0.006
 three+0.006
 counted+0.006
Neg logit contributions
 therapist-0.004
actus-0.004
 Midnight-0.004
 masse-0.004
iflower-0.004
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
 guess+0.069
 count+0.066
 imagine+0.063
 ten+0.061
ptions+0.061
Neg logit contributions
 Midnight-0.055
 Woo-0.054
 therapist-0.051
actus-0.049
ls-0.049
Token they
Token ID484
Feature activation+3.25e-01

Pos logit contributions
 count+0.104
 guess+0.103
ff+0.101
 imagine+0.098
 blond+0.097
Neg logit contributions
 Woo-0.024
 Midnight-0.022
 Mitt-0.019
 therapist-0.017
actus-0.014
Token see
Token ID766
Feature activation+6.47e-01

Pos logit contributions
 guess+0.008
 count+0.008
 three+0.008
 ten+0.007
 counted+0.007
Neg logit contributions
actus-0.018
 Midnight-0.018
 therapist-0.018
 masse-0.017
 drowned-0.017
Token whatever
Token ID4232
Feature activation-1.34e+00

Pos logit contributions
 therapist+0.004
actus+0.004
 Midnight+0.004
 masse+0.003
iflower+0.003
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.003
 ten-0.003
 counted-0.002
Token you
Token ID345
Feature activation+2.25e+00

Pos logit contributions
actus+0.052
 therapist+0.049
 Midnight+0.046
lor+0.045
ugg+0.045
Neg logit contributions
 count-0.065
 guess-0.065
 three-0.060
 ten-0.059
 imagine-0.058
Token like
Token ID588
Feature activation-4.55e-01

Pos logit contributions
 blond+0.052
 ten+0.050
 blonde+0.050
 whether+0.050
 guess+0.048
Neg logit contributions
 Midnight-0.113
actus-0.112
 therapist-0.107
 drowned-0.106
 died-0.106
Token,"
Token ID553
Feature activation+7.90e-01

Pos logit contributions
 therapist+0.019
actus+0.019
 Midnight+0.018
 masse+0.016
lor+0.016
Neg logit contributions
 guess-0.020
 count-0.020
 three-0.019
 ten-0.019
 imagine-0.018
Token his
Token ID465
Feature activation+4.26e-01

Pos logit contributions
 count+0.033
 guess+0.032
 ten+0.032
 counted+0.030
ruits+0.030
Neg logit contributions
 therapist-0.034
 Midnight-0.032
actus-0.029
 masse-0.029
 Woo-0.028
Token mom
Token ID1995
Feature activation+2.48e+00

Pos logit contributions
 guess+0.020
 count+0.020
 ten+0.019
 counted+0.019
 like+0.018
Neg logit contributions
 therapist-0.016
actus-0.015
 Midnight-0.015
 masse-0.014
iflower-0.013
Tokenmy
Token ID1820
Feature activation+1.70e+00

Pos logit contributions
 ten+0.105
 count+0.102
 guess+0.101
 imagine+0.101
 mate+0.098
Neg logit contributions
 therapist-0.091
 Midnight-0.083
actus-0.077
 band-0.077
iflower-0.074
Token said
Token ID531
Feature activation+1.09e+00

Pos logit contributions
 ten+0.084
 count+0.082
 guess+0.080
 imagine+0.079
ruits+0.078
Neg logit contributions
 therapist-0.059
 Midnight-0.051
actus-0.044
 band-0.043
 masse-0.042
Token.
Token ID13
Feature activation+5.93e-01

Pos logit contributions
 count+0.051
 guess+0.051
 ten+0.048
 imagine+0.047
 counted+0.045
Neg logit contributions
 therapist-0.040
 Midnight-0.038
actus-0.033
 masse-0.033
ugg-0.032
Token
Token ID198
Feature activation+6.42e-02

Pos logit contributions
 count+0.027
 guess+0.027
 imagine+0.025
 ten+0.025
 three+0.024
Neg logit contributions
 therapist-0.023
 Midnight-0.023
actus-0.022
 Woo-0.020
 masse-0.020
Token
Token ID198
Feature activation+9.16e-01

Pos logit contributions
 guess+0.003
 count+0.003
 three+0.003
 ten+0.003
 imagine+0.003
Neg logit contributions
 therapist-0.003
actus-0.002
 Midnight-0.002
 masse-0.002
lor-0.002
Token to
Token ID284
Feature activation+2.49e+00

Pos logit contributions
 guess+0.057
 count+0.056
ruits+0.054
 imagine+0.054
 blond+0.053
Neg logit contributions
 Midnight-0.032
 therapist-0.032
actus-0.030
iflower-0.028
 masse-0.027
Token count
Token ID954
Feature activation+2.98e+00

Pos logit contributions
ruits+0.070
illed+0.067
ptions+0.061
 blond+0.061
 blonde+0.060
Neg logit contributions
 band-0.107
 therapist-0.106
 drowned-0.105
 died-0.105
actus-0.103
Token!"
Token ID2474
Feature activation-8.53e-01

Pos logit contributions
 guess+0.146
 blond+0.137
illed+0.137
ruits+0.136
 count+0.134
Neg logit contributions
iflower-0.075
 therapist-0.071
 lands-0.070
actus-0.069
eds-0.069
Token So
Token ID1406
Feature activation-6.46e-01

Pos logit contributions
actus+0.047
 therapist+0.045
iflower+0.043
 Midnight+0.043
 masse+0.042
Neg logit contributions
 guess-0.028
 count-0.026
 three-0.025
 One-0.024
 counted-0.023
Token,
Token ID11
Feature activation-8.70e-01

Pos logit contributions
actus+0.035
 therapist+0.034
 Midnight+0.032
lor+0.032
iflower+0.031
Neg logit contributions
 guess-0.021
 count-0.021
 three-0.020
 ten-0.019
 imagine-0.018
Token they
Token ID484
Feature activation+2.72e+00

Pos logit contributions
actus+0.037
 therapist+0.034
lor+0.032
iflower+0.032
 Midnight+0.031
Neg logit contributions
 guess-0.040
 count-0.039
 three-0.038
 ten-0.036
 imagine-0.035
Token started
Token ID2067
Feature activation+1.91e+00

Pos logit contributions
ruits+0.100
 whether+0.099
 guess+0.096
 Today+0.095
 blond+0.095
Neg logit contributions
 Midnight-0.093
 therapist-0.092
 died-0.092
 drowned-0.090
 dri-0.087
Token counting
Token ID14143
Feature activation+2.78e+00

Pos logit contributions
 guess+0.072
ruits+0.071
 yes+0.069
adies+0.065
 blond+0.064
Neg logit contributions
 therapist-0.068
 Midnight-0.063
 drowned-0.061
 ble-0.061
 band-0.061
Token the
Token ID262
Feature activation+1.91e+00

Pos logit contributions
 guess+0.140
ruits+0.136
 count+0.132
 imagine+0.132
 mate+0.131
Neg logit contributions
ls-0.066
 therapist-0.060
 lands-0.059
iflower-0.056
actus-0.055
Token days
Token ID1528
Feature activation+1.62e+00

Pos logit contributions
 guess+0.079
 counted+0.075
ruits+0.074
 whether+0.073
 imagine+0.073
Neg logit contributions
 therapist-0.070
actus-0.069
 masse-0.065
iflower-0.061
 Midnight-0.061
Token on
Token ID319
Feature activation+1.55e-01

Pos logit contributions
 guess+0.069
 count+0.062
 imagine+0.062
 blond+0.060
 One+0.059
Neg logit contributions
 therapist-0.060
actus-0.055
 Midnight-0.054
iflower-0.052
 masse-0.050
Token golf
Token ID13126
Feature activation+4.74e-01

Pos logit contributions
 guess+0.047
 count+0.046
ruits+0.045
 counted+0.044
ff+0.044
Neg logit contributions
 Midnight-0.039
 therapist-0.037
actus-0.036
 Woo-0.035
 band-0.034
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
 guess+0.019
 count+0.019
 three+0.018
 imagine+0.017
 ten+0.017
Neg logit contributions
 therapist-0.020
actus-0.020
 Midnight-0.020
 masse-0.018
 Woo-0.017
Token He
Token ID679
Feature activation+1.02e+00

Pos logit contributions
 count+0.062
 imagine+0.060
 guess+0.059
 ten+0.056
 mate+0.055
Neg logit contributions
 therapist-0.047
actus-0.046
 Midnight-0.045
 Woo-0.042
 masse-0.041
Token asked
Token ID1965
Feature activation+2.02e+00

Pos logit contributions
 count+0.040
 guess+0.039
 ten+0.037
ruits+0.036
 imagine+0.035
Neg logit contributions
 therapist-0.045
actus-0.043
 Midnight-0.043
 drowned-0.041
 died-0.039
Token his
Token ID465
Feature activation+8.06e-01

Pos logit contributions
 guess+0.110
 blonde+0.109
 imagine+0.108
 count+0.104
 blond+0.103
Neg logit contributions
 Midnight-0.046
 therapist-0.043
 Woo-0.041
 band-0.037
 Mitt-0.036
Token dad
Token ID9955
Feature activation+2.45e+00

Pos logit contributions
 count+0.038
 guess+0.038
 counted+0.036
 ten+0.036
 One+0.035
Neg logit contributions
actus-0.028
 therapist-0.028
 Midnight-0.025
 masse-0.024
iflower-0.022
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
 blonde+0.103
 blond+0.100
 guess+0.094
 turrets+0.092
 imagine+0.089
Neg logit contributions
 therapist-0.094
 Midnight-0.093
actus-0.081
 Woo-0.079
 Lucky-0.077
Token "
Token ID366
Feature activation+1.80e+00

Pos logit contributions
 count+0.050
 guess+0.050
 imagine+0.048
 counted+0.048
ff+0.046
Neg logit contributions
 Midnight-0.041
 therapist-0.040
 Woo-0.035
 masse-0.034
actus-0.032
TokenWhy
Token ID5195
Feature activation+9.28e-01

Pos logit contributions
 count+0.093
 blond+0.089
 blonde+0.089
 guess+0.088
 imagine+0.088
Neg logit contributions
 therapist-0.057
actus-0.050
 Midnight-0.047
 masse-0.045
lor-0.043
Token do
Token ID466
Feature activation-7.06e-01

Pos logit contributions
 guess+0.037
 three+0.035
 count+0.035
 ten+0.035
 1+0.034
Neg logit contributions
 Midnight-0.038
 therapist-0.037
actus-0.037
 Woo-0.033
iflower-0.033
Token we
Token ID356
Feature activation+2.73e-01

Pos logit contributions
actus+0.024
 therapist+0.024
 Midnight+0.022
ugg+0.021
lor+0.020
Neg logit contributions
 guess-0.037
 count-0.037
 three-0.034
 ten-0.034
 imagine-0.033
Token Every
Token ID3887
Feature activation+1.13e-01

Pos logit contributions
 therapist+0.012
actus+0.012
 Midnight+0.011
 masse+0.010
lor+0.010
Neg logit contributions
 guess-0.014
 count-0.014
 three-0.013
 ten-0.013
 imagine-0.012
Token day
Token ID1110
Feature activation+3.45e-01

Pos logit contributions
 guess+0.005
 count+0.005
 ten+0.005
 three+0.005
 counted+0.005
Neg logit contributions
 therapist-0.004
actus-0.004
 Midnight-0.004
 masse-0.004
iflower-0.004
Token the
Token ID262
Feature activation-2.62e-01

Pos logit contributions
 guess+0.015
 count+0.015
 ten+0.014
 three+0.014
 imagine+0.013
Neg logit contributions
 therapist-0.014
 Midnight-0.014
actus-0.014
 masse-0.012
lor-0.012
Token pin
Token ID6757
Feature activation-8.15e-01

Pos logit contributions
 therapist+0.010
actus+0.010
 Midnight+0.010
 masse+0.009
lor+0.009
Neg logit contributions
 guess-0.012
 count-0.012
 three-0.011
 ten-0.011
 imagine-0.011
Token would
Token ID561
Feature activation+1.50e-01

Pos logit contributions
actus+0.036
 therapist+0.035
 Midnight+0.035
 masse+0.032
iflower+0.032
Neg logit contributions
 guess-0.034
 count-0.033
 three-0.031
 ten-0.030
 counted-0.030
Token slide
Token ID10649
Feature activation+9.60e-01

Pos logit contributions
 guess+0.005
 count+0.005
 three+0.005
 ten+0.005
 counted+0.005
Neg logit contributions
 therapist-0.007
actus-0.007
 Midnight-0.007
 masse-0.007
iflower-0.006
Token down
Token ID866
Feature activation-8.13e-02

Pos logit contributions
 guess+0.041
 count+0.040
 three+0.036
 counted+0.036
 imagine+0.036
Neg logit contributions
 therapist-0.040
actus-0.039
 Midnight-0.038
 Woo-0.035
 masse-0.034
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.003
iflower+0.003
Neg logit contributions
 guess-0.004
 count-0.004
 three-0.003
 ten-0.003
 imagine-0.003
Token leaves
Token ID5667
Feature activation-6.91e-02

Pos logit contributions
 therapist+0.051
 Midnight+0.051
actus+0.049
lor+0.048
ugg+0.046
Neg logit contributions
 guess-0.061
 count-0.061
 three-0.059
 ten-0.057
 imagine-0.054
Token.
Token ID13
Feature activation+7.65e-01

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.003
lor+0.003
Neg logit contributions
 guess-0.003
 count-0.002
 three-0.002
 ten-0.002
 counted-0.002
Token It
Token ID632
Feature activation+2.26e-01

Pos logit contributions
 count+0.030
 guess+0.029
 ten+0.027
 imagine+0.027
 counted+0.026
Neg logit contributions
 therapist-0.035
 Midnight-0.035
actus-0.033
 masse-0.031
 Woo-0.031
Token planting
Token ID26630
Feature activation-5.62e-01

Pos logit contributions
 therapist+0.064
actus+0.064
lor+0.061
 Midnight+0.060
 masse+0.056
Neg logit contributions
 count-0.042
 three-0.042
 guess-0.041
 ten-0.040
 counted-0.037
Token seeds
Token ID11904
Feature activation-8.80e-01

Pos logit contributions
 therapist+0.021
actus+0.021
 Midnight+0.020
 masse+0.018
lor+0.018
Neg logit contributions
 guess-0.027
 count-0.027
 three-0.025
 ten-0.025
 counted-0.024
Token in
Token ID287
Feature activation+6.05e-01

Pos logit contributions
 therapist+0.045
actus+0.044
 Midnight+0.044
 masse+0.041
 drowned+0.041
Neg logit contributions
 guess-0.031
 count-0.030
 three-0.028
 like-0.027
 ten-0.027
Token the
Token ID262
Feature activation-1.63e+00

Pos logit contributions
 guess+0.031
 count+0.031
 counted+0.029
 imagine+0.028
 three+0.028
Neg logit contributions
 Midnight-0.018
actus-0.018
 therapist-0.018
iflower-0.017
 masse-0.016
Token ground
Token ID2323
Feature activation+7.25e-01

Pos logit contributions
 therapist+0.071
 Midnight+0.067
actus+0.064
lor+0.064
ugg+0.064
Neg logit contributions
 three-0.070
 guess-0.069
 count-0.067
 ten-0.067
 One-0.064
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
 guess+0.032
 count+0.031
 three+0.029
 ten+0.029
 counted+0.029
Neg logit contributions
actus-0.028
 therapist-0.028
 Midnight-0.027
iflower-0.025
lor-0.024
Token One
Token ID1881
Feature activation-6.85e-02

Pos logit contributions
 guess+0.045
 count+0.040
 imagine+0.036
 counted+0.036
 ten+0.035
Neg logit contributions
 Midnight-0.089
actus-0.085
 therapist-0.084
 Woo-0.082
 masse-0.080
Token day
Token ID1110
Feature activation+9.40e-01

Pos logit contributions
 therapist+0.002
actus+0.002
 Midnight+0.002
 masse+0.002
iflower+0.002
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.003
 ten-0.003
 counted-0.003
Token,
Token ID11
Feature activation+6.08e-01

Pos logit contributions
 guess+0.039
 count+0.038
 imagine+0.034
 ten+0.034
 1+0.033
Neg logit contributions
 Midnight-0.040
 therapist-0.039
actus-0.038
 Woo-0.036
ugg-0.034
Token he
Token ID339
Feature activation+5.66e-02

Pos logit contributions
 guess+0.031
 count+0.031
 counted+0.028
 ten+0.028
ruits+0.028
Neg logit contributions
 therapist-0.020
 Midnight-0.020
actus-0.019
 Woo-0.017
 masse-0.017
Token found
Token ID1043
Feature activation-4.83e-01

Pos logit contributions
 guess+0.002
 count+0.002
 three+0.002
 ten+0.002
 imagine+0.002
Neg logit contributions
 therapist-0.003
actus-0.003
 Midnight-0.003
 masse-0.003
iflower-0.003
Token Lily
Token ID20037
Feature activation+1.18e+00

Pos logit contributions
actus+0.023
 therapist+0.023
 Midnight+0.021
lor+0.021
iflower+0.021
Neg logit contributions
 guess-0.022
 count-0.022
 three-0.021
 ten-0.020
 counted-0.020
Token nodded
Token ID14464
Feature activation+1.47e+00

Pos logit contributions
 count+0.044
 ten+0.043
 guess+0.042
 imagine+0.039
 three+0.039
Neg logit contributions
 therapist-0.054
actus-0.052
 Midnight-0.051
 drowned-0.047
ugg-0.046
Token and
Token ID290
Feature activation-5.16e-01

Pos logit contributions
 count+0.075
 guess+0.073
ptions+0.072
 ten+0.071
ruits+0.070
Neg logit contributions
 therapist-0.046
 Midnight-0.044
 band-0.038
 Woo-0.037
 died-0.037
Token said
Token ID531
Feature activation+6.58e-01

Pos logit contributions
actus+0.026
 therapist+0.026
 Midnight+0.025
lor+0.024
iflower+0.024
Neg logit contributions
 guess-0.019
 count-0.018
 three-0.017
 counted-0.016
 ten-0.016
Token,
Token ID11
Feature activation+6.28e-02

Pos logit contributions
 guess+0.024
 count+0.023
 ten+0.021
 imagine+0.020
 three+0.020
Neg logit contributions
 therapist-0.032
 Midnight-0.031
actus-0.030
ugg-0.028
 masse-0.028
Token "
Token ID366
Feature activation+3.19e-01

Pos logit contributions
 guess+0.003
 count+0.002
 ten+0.002
 three+0.002
 imagine+0.002
Neg logit contributions
 therapist-0.003
actus-0.003
 Midnight-0.003
 masse-0.003
iflower-0.002
TokenYes
Token ID5297
Feature activation+3.98e-01

Pos logit contributions
 count+0.014
 guess+0.014
 ten+0.013
 counted+0.013
 three+0.013
Neg logit contributions
 therapist-0.013
actus-0.013
 Midnight-0.012
 masse-0.012
ugg-0.011
Token,
Token ID11
Feature activation+9.80e-01

Pos logit contributions
 guess+0.014
 count+0.013
 counted+0.012
 imagine+0.012
 ten+0.012
Neg logit contributions
 therapist-0.020
 Midnight-0.020
actus-0.019
 masse-0.018
 Woo-0.018
Token Mom
Token ID11254
Feature activation+8.94e-01

Pos logit contributions
 count+0.042
 guess+0.042
 counted+0.039
 ten+0.039
 imagine+0.038
Neg logit contributions
 Midnight-0.039
 therapist-0.039
actus-0.036
 Woo-0.034
 masse-0.034
Token."
Token ID526
Feature activation-3.04e-01

Pos logit contributions
 guess+0.040
 count+0.037
 imagine+0.036
 ten+0.035
 three+0.035
Neg logit contributions
actus-0.034
 therapist-0.033
 Midnight-0.033
iflower-0.029
ugg-0.029
Token
Token ID198
Feature activation-5.42e-01

Pos logit contributions
 therapist+0.015
actus+0.015
 Midnight+0.014
 masse+0.013
iflower+0.013
Neg logit contributions
 guess-0.011
 count-0.011
 three-0.010
 ten-0.010
 counted-0.010
Token and
Token ID290
Feature activation+2.39e-02

Pos logit contributions
 therapist+0.029
actus+0.029
 Midnight+0.029
 masse+0.027
lor+0.026
Neg logit contributions
 guess-0.020
 count-0.019
 three-0.018
 ten-0.017
 counted-0.017
Token Jill
Token ID22772
Feature activation-1.75e-02

Pos logit contributions
 guess+0.001
 count+0.001
 ten+0.001
 three+0.001
 counted+0.001
Neg logit contributions
 therapist-0.001
actus-0.001
 Midnight-0.001
 masse-0.001
iflower-0.001
Token wanted
Token ID2227
Feature activation+8.06e-01

Pos logit contributions
 therapist+0.001
actus+0.001
 Midnight+0.001
 masse+0.001
iflower+0.001
Neg logit contributions
 guess-0.001
 count-0.001
 three-0.001
 ten-0.001
 imagine-0.001
Token to
Token ID284
Feature activation+8.91e-01

Pos logit contributions
 count+0.040
 guess+0.040
 counted+0.037
 imagine+0.037
ruits+0.036
Neg logit contributions
 therapist-0.027
 Midnight-0.026
actus-0.024
 masse-0.022
 Woo-0.022
Token buy
Token ID2822
Feature activation+4.18e-01

Pos logit contributions
 guess+0.030
 count+0.029
 ten+0.029
 three+0.028
ruits+0.028
Neg logit contributions
 therapist-0.041
 Midnight-0.040
actus-0.038
 masse-0.037
 Woo-0.035
Token a
Token ID257
Feature activation+5.05e-01

Pos logit contributions
 guess+0.014
 count+0.013
 counted+0.012
 imagine+0.012
 ten+0.012
Neg logit contributions
 therapist-0.022
actus-0.021
 Midnight-0.021
 masse-0.020
 Woo-0.019
Token present
Token ID1944
Feature activation+4.99e-01

Pos logit contributions
 guess+0.024
 count+0.024
 counted+0.023
 ten+0.023
 three+0.023
Neg logit contributions
actus-0.018
 therapist-0.018
 Midnight-0.017
 masse-0.016
iflower-0.015
Token for
Token ID329
Feature activation+9.40e-01

Pos logit contributions
 guess+0.022
 count+0.021
 three+0.020
 ten+0.020
 counted+0.020
Neg logit contributions
 therapist-0.020
 Midnight-0.019
actus-0.019
 masse-0.017
lor-0.017
Token their
Token ID511
Feature activation+1.01e+00

Pos logit contributions
 count+0.045
 guess+0.044
 counted+0.042
 imagine+0.041
 want+0.041
Neg logit contributions
 Midnight-0.031
 therapist-0.031
actus-0.027
iflower-0.026
 Woo-0.025
Token mom
Token ID1995
Feature activation+1.35e+00

Pos logit contributions
 guess+0.041
 count+0.040
 counted+0.039
 ten+0.038
 like+0.037
Neg logit contributions
 therapist-0.043
actus-0.042
 Midnight-0.041
 masse-0.038
iflower-0.036
Token.
Token ID13
Feature activation+2.21e+00

Pos logit contributions
 guess+0.058
 count+0.056
 ten+0.054
 imagine+0.053
 three+0.052
Neg logit contributions
 Midnight-0.052
 therapist-0.051
actus-0.049
iflower-0.043
 masse-0.043
Token mom
Token ID1995
Feature activation+9.23e-01

Pos logit contributions
 therapist+0.002
actus+0.002
 Midnight+0.002
 masse+0.001
iflower+0.001
Neg logit contributions
 guess-0.002
 count-0.002
 three-0.002
 ten-0.002
 counted-0.002
Token said
Token ID531
Feature activation+2.49e-01

Pos logit contributions
 guess+0.042
 count+0.042
 ten+0.042
 three+0.039
 imagine+0.039
Neg logit contributions
 Midnight-0.033
actus-0.033
 therapist-0.033
 died-0.028
iflower-0.028
Token,
Token ID11
Feature activation-6.83e-02

Pos logit contributions
 guess+0.008
 count+0.008
 ten+0.007
 three+0.007
 imagine+0.007
Neg logit contributions
 therapist-0.013
 Midnight-0.013
actus-0.013
iflower-0.011
 masse-0.011
Token "
Token ID366
Feature activation-9.15e-02

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.003
iflower+0.003
Neg logit contributions
 guess-0.003
 count-0.003
 ten-0.002
 three-0.002
 imagine-0.002
TokenI
Token ID40
Feature activation+8.28e-01

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.002
lor+0.002
Neg logit contributions
 guess-0.005
 count-0.005
 three-0.005
 ten-0.005
 imagine-0.005
Token am
Token ID716
Feature activation+7.10e-01

Pos logit contributions
 count+0.031
 ten+0.031
 three+0.030
 guess+0.030
 1+0.030
Neg logit contributions
actus-0.035
 therapist-0.034
 Midnight-0.034
 drowned-0.031
 died-0.031
Token measuring
Token ID15964
Feature activation+1.18e+00

Pos logit contributions
 guess+0.030
 count+0.029
 imagine+0.027
 counted+0.027
 ten+0.027
Neg logit contributions
 therapist-0.027
 Midnight-0.027
actus-0.025
iflower-0.024
 masse-0.024
Token this
Token ID428
Feature activation+9.46e-01

Pos logit contributions
 guess+0.059
 count+0.057
 counted+0.054
 imagine+0.054
ruits+0.052
Neg logit contributions
 therapist-0.037
 Midnight-0.037
actus-0.037
iflower-0.036
 masse-0.033
Token box
Token ID3091
Feature activation+1.01e+00

Pos logit contributions
 guess+0.050
 count+0.049
 counted+0.047
 ten+0.047
 three+0.047
Neg logit contributions
actus-0.029
 therapist-0.027
 Midnight-0.026
iflower-0.026
 masse-0.022
Token for
Token ID329
Feature activation+1.79e-01

Pos logit contributions
 guess+0.045
 count+0.043
 three+0.040
 ten+0.040
 blond+0.040
Neg logit contributions
 therapist-0.038
actus-0.037
 Midnight-0.037
iflower-0.034
 masse-0.032
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
 guess+0.009
 count+0.009
 three+0.008
 imagine+0.008
 counted+0.008
Neg logit contributions
 therapist-0.006
 Midnight-0.006
actus-0.006
 masse-0.005
iflower-0.005
Token red
Token ID2266
Feature activation-8.16e-01

Pos logit contributions
actus+0.015
 therapist+0.015
 Midnight+0.015
lor+0.014
 masse+0.013
Neg logit contributions
 guess-0.016
 count-0.016
 three-0.015
 ten-0.015
 imagine-0.014
Token balloon
Token ID21190
Feature activation-6.83e-01

Pos logit contributions
 Midnight+0.030
actus+0.029
 therapist+0.029
lor+0.026
iflower+0.026
Neg logit contributions
 guess-0.039
 count-0.038
 three-0.038
 ten-0.036
 imagine-0.035
Token shrunk
Token ID38900
Feature activation+9.15e-03

Pos logit contributions
actus+0.030
 therapist+0.029
 Midnight+0.029
 masse+0.027
iflower+0.026
Neg logit contributions
 guess-0.029
 count-0.028
 three-0.026
 ten-0.025
 counted-0.025
Token itself
Token ID2346
Feature activation+6.47e-02

Pos logit contributions
 guess+0.000
 count+0.000
 three+0.000
 ten+0.000
 imagine+0.000
Neg logit contributions
 therapist-0.000
actus-0.000
 Midnight-0.000
 masse-0.000
iflower-0.000
Token down
Token ID866
Feature activation-4.25e-01

Pos logit contributions
 guess+0.003
 count+0.003
 three+0.002
 ten+0.002
 imagine+0.002
Neg logit contributions
 therapist-0.003
actus-0.003
 Midnight-0.003
 masse-0.003
iflower-0.003
Token to
Token ID284
Feature activation-1.41e-01

Pos logit contributions
 therapist+0.020
actus+0.020
 Midnight+0.019
 masse+0.018
iflower+0.017
Neg logit contributions
 guess-0.017
 count-0.016
 three-0.015
 ten-0.015
 counted-0.014
Token join
Token ID4654
Feature activation+1.35e-01

Pos logit contributions
 therapist+0.006
actus+0.006
 Midnight+0.006
 masse+0.005
lor+0.005
Neg logit contributions
 guess-0.006
 count-0.006
 three-0.006
 ten-0.006
 counted-0.006
Token the
Token ID262
Feature activation-4.61e-01

Pos logit contributions
 guess+0.006
 count+0.006
 ten+0.006
 three+0.006
 counted+0.006
Neg logit contributions
 therapist-0.005
actus-0.005
 Midnight-0.005
 masse-0.005
iflower-0.004
Token jog
Token ID48342
Feature activation-1.08e+00

Pos logit contributions
 therapist+0.020
actus+0.020
 Midnight+0.020
lor+0.018
iflower+0.018
Neg logit contributions
 guess-0.019
 count-0.019
 three-0.018
 ten-0.017
 imagine-0.017
Token.
Token ID13
Feature activation-1.67e-02

Pos logit contributions
actus+0.051
 Midnight+0.050
 therapist+0.050
iflower+0.046
 masse+0.046
Neg logit contributions
 guess-0.041
 count-0.040
 three-0.038
 ten-0.037
 like-0.036
Token All
Token ID1439
Feature activation-8.26e-01

Pos logit contributions
 therapist+0.001
actus+0.001
 Midnight+0.001
 masse+0.001
lor+0.001
Neg logit contributions
 guess-0.001
 count-0.001
 three-0.001
 ten-0.001
 imagine-0.001
Token He
Token ID679
Feature activation+7.97e-01

Pos logit contributions
 guess+0.011
 count+0.011
 ten+0.010
 imagine+0.010
 three+0.009
Neg logit contributions
 therapist-0.016
 Midnight-0.015
actus-0.015
 masse-0.014
 Woo-0.014
Token started
Token ID2067
Feature activation-3.32e-01

Pos logit contributions
 count+0.030
 guess+0.030
 ten+0.028
 imagine+0.027
 three+0.027
Neg logit contributions
 therapist-0.036
actus-0.035
 Midnight-0.034
 drowned-0.031
iflower-0.031
Token picking
Token ID10868
Feature activation+2.22e-01

Pos logit contributions
actus+0.015
 therapist+0.015
 Midnight+0.014
 masse+0.013
lor+0.013
Neg logit contributions
 guess-0.014
 count-0.014
 three-0.013
 ten-0.012
 counted-0.012
Token some
Token ID617
Feature activation+1.27e+00

Pos logit contributions
 guess+0.010
 count+0.010
 three+0.009
 imagine+0.009
 counted+0.009
Neg logit contributions
 therapist-0.009
actus-0.009
 Midnight-0.009
 masse-0.008
iflower-0.008
Token special
Token ID2041
Feature activation+1.47e+00

Pos logit contributions
 guess+0.061
 count+0.059
 counted+0.057
 imagine+0.056
 ten+0.055
Neg logit contributions
actus-0.044
 therapist-0.042
 Midnight-0.041
iflower-0.039
 masse-0.038
Token oats
Token ID47009
Feature activation-8.22e-01

Pos logit contributions
 guess+0.075
 count+0.072
 counted+0.069
 imagine+0.068
 ten+0.067
Neg logit contributions
 therapist-0.045
actus-0.045
 Midnight-0.040
iflower-0.039
 masse-0.038
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
actus+0.042
 therapist+0.042
 Midnight+0.041
 masse+0.039
lor+0.038
Neg logit contributions
 guess-0.029
 count-0.029
 ten-0.026
 three-0.026
 counted-0.025
Token He
Token ID679
Feature activation+4.77e-01

Pos logit contributions
 guess+0.010
 count+0.010
 ten+0.009
 imagine+0.009
 three+0.009
Neg logit contributions
 therapist-0.014
 Midnight-0.014
actus-0.014
 masse-0.013
 Woo-0.013
Token picked
Token ID6497
Feature activation+3.02e-01

Pos logit contributions
 guess+0.019
 count+0.019
 ten+0.017
 three+0.017
 imagine+0.017
Neg logit contributions
 therapist-0.021
actus-0.021
 Midnight-0.020
iflower-0.018
 drowned-0.018
Token and
Token ID290
Feature activation-6.81e-02

Pos logit contributions
 guess+0.012
 count+0.012
 imagine+0.010
 three+0.010
 counted+0.010
Neg logit contributions
 therapist-0.014
actus-0.014
 Midnight-0.013
 masse-0.012
iflower-0.012
Token picked
Token ID6497
Feature activation+5.79e-01

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.003
iflower+0.003
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.002
 ten-0.002
 imagine-0.002
Token day
Token ID1110
Feature activation+8.50e-01

Pos logit contributions
 guess+0.033
 count+0.033
 ten+0.031
 counted+0.031
 1+0.031
Neg logit contributions
actus-0.016
 therapist-0.015
 Midnight-0.014
iflower-0.013
 masse-0.013
Token,
Token ID11
Feature activation+6.75e-01

Pos logit contributions
 count+0.032
 guess+0.032
 ten+0.028
 imagine+0.028
 counted+0.028
Neg logit contributions
 therapist-0.039
 Midnight-0.039
actus-0.037
 Woo-0.035
 masse-0.033
Token they
Token ID484
Feature activation+1.39e+00

Pos logit contributions
 count+0.034
 guess+0.033
 counted+0.031
 imagine+0.031
 ten+0.031
Neg logit contributions
 therapist-0.023
 Midnight-0.023
actus-0.021
 Woo-0.020
 masse-0.018
Token saw
Token ID2497
Feature activation+3.22e-01

Pos logit contributions
 count+0.059
 guess+0.055
 blond+0.052
 whether+0.051
 exception+0.050
Neg logit contributions
actus-0.055
 Midnight-0.054
 therapist-0.052
 died-0.051
 drowned-0.051
Token something
Token ID1223
Feature activation-1.94e-01

Pos logit contributions
 count+0.010
 guess+0.010
 imagine+0.009
 counted+0.009
 ten+0.009
Neg logit contributions
 therapist-0.017
 Midnight-0.017
actus-0.017
 Woo-0.015
iflower-0.015
Token shiny
Token ID22441
Feature activation-2.29e-02

Pos logit contributions
 therapist+0.009
actus+0.009
 Midnight+0.009
 masse+0.008
lor+0.008
Neg logit contributions
 guess-0.007
 count-0.007
 three-0.007
 ten-0.007
 imagine-0.006
Token in
Token ID287
Feature activation-1.54e-02

Pos logit contributions
 therapist+0.001
actus+0.001
 Midnight+0.001
 masse+0.001
iflower+0.001
Neg logit contributions
 guess-0.001
 count-0.001
 three-0.001
 ten-0.001
 counted-0.001
Token the
Token ID262
Feature activation-6.37e-01

Pos logit contributions
 therapist+0.001
actus+0.001
 Midnight+0.001
 masse+0.000
iflower+0.000
Neg logit contributions
 guess-0.001
 count-0.001
 three-0.001
 ten-0.001
 counted-0.001
Token hole
Token ID7604
Feature activation+3.53e-01

Pos logit contributions
 therapist+0.023
 Midnight+0.022
actus+0.021
lor+0.019
 masse+0.019
Neg logit contributions
 count-0.032
 guess-0.032
 three-0.031
 ten-0.031
 imagine-0.029
Token.
Token ID13
Feature activation+8.89e-01

Pos logit contributions
 guess+0.016
 count+0.015
 three+0.014
 ten+0.014
 imagine+0.014
Neg logit contributions
 therapist-0.014
actus-0.014
 Midnight-0.013
iflower-0.012
lor-0.012
Token They
Token ID1119
Feature activation+2.73e+00

Pos logit contributions
 count+0.037
 guess+0.036
 imagine+0.033
 ten+0.033
 counted+0.032
Neg logit contributions
 Midnight-0.039
 therapist-0.038
actus-0.037
 Woo-0.035
 masse-0.034
Token do
Token ID466
Feature activation-7.05e-02

Pos logit contributions
 guess+0.027
 count+0.027
 ten+0.026
 counted+0.025
 three+0.025
Neg logit contributions
 therapist-0.037
 Midnight-0.036
actus-0.035
 masse-0.034
 drowned-0.033
Token things
Token ID1243
Feature activation-7.01e-01

Pos logit contributions
 therapist+0.003
actus+0.003
 Midnight+0.003
 masse+0.003
iflower+0.003
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.002
 ten-0.002
 counted-0.002
Token we
Token ID356
Feature activation+8.05e-01

Pos logit contributions
 therapist+0.034
actus+0.034
 Midnight+0.032
 masse+0.030
lor+0.030
Neg logit contributions
 guess-0.026
 count-0.026
 three-0.024
 ten-0.024
 imagine-0.023
Token don
Token ID836
Feature activation-1.79e+00

Pos logit contributions
 guess+0.024
 count+0.023
 ten+0.022
 three+0.022
 counted+0.021
Neg logit contributions
 Midnight-0.041
 therapist-0.041
actus-0.041
 drowned-0.038
iflower-0.037
Token't
Token ID470
Feature activation+3.48e-01

Pos logit contributions
actus+0.080
 therapist+0.079
 Midnight+0.072
ugg+0.069
lor+0.067
Neg logit contributions
 count-0.075
 guess-0.075
 One-0.070
 three-0.069
 like-0.067
Token want
Token ID765
Feature activation+1.80e-01

Pos logit contributions
 guess+0.009
 count+0.008
 ten+0.008
 three+0.007
 counted+0.007
Neg logit contributions
 Midnight-0.020
 therapist-0.020
actus-0.020
 masse-0.019
 Woo-0.018
Token to
Token ID284
Feature activation+4.33e-01

Pos logit contributions
 guess+0.008
 count+0.008
 three+0.007
 ten+0.007
 imagine+0.007
Neg logit contributions
 therapist-0.007
 Midnight-0.007
actus-0.007
 masse-0.006
iflower-0.006
Token do
Token ID466
Feature activation-1.36e-01

Pos logit contributions
 guess+0.016
 count+0.015
 three+0.014
 counted+0.014
 ten+0.014
Neg logit contributions
 Midnight-0.020
 therapist-0.020
actus-0.020
 masse-0.018
 drowned-0.018
Token in
Token ID287
Feature activation-1.17e-01

Pos logit contributions
 therapist+0.006
actus+0.006
 Midnight+0.006
 masse+0.005
iflower+0.005
Neg logit contributions
 guess-0.006
 count-0.006
 three-0.005
 ten-0.005
 counted-0.005
Token order
Token ID1502
Feature activation+5.01e-01

Pos logit contributions
 therapist+0.005
actus+0.005
 Midnight+0.005
 masse+0.004
iflower+0.004
Neg logit contributions
 guess-0.005
 count-0.005
 three-0.005
 ten-0.005
 counted-0.004
Token to
Token ID284
Feature activation+4.45e-01

Pos logit contributions
 guess+0.022
 count+0.021
 three+0.020
 imagine+0.020
 ten+0.020
Neg logit contributions
 therapist-0.020
actus-0.020
 Midnight-0.020
 masse-0.018
iflower-0.017
Token outside
Token ID2354
Feature activation+1.67e-01

Pos logit contributions
 therapist+0.000
actus+0.000
 Midnight+0.000
 masse+0.000
iflower+0.000
Neg logit contributions
 guess-0.000
 count-0.000
 three-0.000
 ten-0.000
 counted-0.000
Token his
Token ID465
Feature activation-8.90e-01

Pos logit contributions
 guess+0.006
 count+0.006
 ten+0.006
 three+0.006
 counted+0.006
Neg logit contributions
 therapist-0.008
actus-0.008
 Midnight-0.007
 masse-0.007
 drowned-0.007
Token room
Token ID2119
Feature activation-2.54e-01

Pos logit contributions
 therapist+0.037
 Midnight+0.035
actus+0.034
lor+0.031
 masse+0.031
Neg logit contributions
 guess-0.042
 count-0.041
 three-0.039
 ten-0.038
 imagine-0.038
Token.
Token ID13
Feature activation+8.70e-01

Pos logit contributions
 therapist+0.012
actus+0.012
 Midnight+0.012
 masse+0.011
iflower+0.011
Neg logit contributions
 guess-0.009
 count-0.009
 three-0.009
 ten-0.009
 counted-0.008
Token He
Token ID679
Feature activation+1.06e+00

Pos logit contributions
 count+0.034
 guess+0.034
 imagine+0.032
 ten+0.032
 three+0.030
Neg logit contributions
 therapist-0.038
 Midnight-0.037
actus-0.037
 masse-0.034
 Woo-0.033
Token got
Token ID1392
Feature activation-7.84e-02

Pos logit contributions
 count+0.042
 guess+0.040
ruits+0.040
 ten+0.039
 imagine+0.037
Neg logit contributions
actus-0.045
 therapist-0.045
 Midnight-0.043
 drowned-0.041
 died-0.041
Token out
Token ID503
Feature activation-6.25e-01

Pos logit contributions
 therapist+0.004
actus+0.004
 Midnight+0.003
 masse+0.003
iflower+0.003
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.003
 ten-0.003
 imagine-0.003
Token of
Token ID286
Feature activation-1.53e+00

Pos logit contributions
actus+0.033
 therapist+0.033
 Midnight+0.032
lor+0.030
iflower+0.030
Neg logit contributions
 guess-0.020
 count-0.020
 three-0.020
 ten-0.018
 counted-0.017
Token bed
Token ID3996
Feature activation-1.13e+00

Pos logit contributions
actus+0.083
 therapist+0.081
lor+0.079
iflower+0.072
 Midnight+0.072
Neg logit contributions
 count-0.052
 three-0.051
 guess-0.051
 ten-0.050
 One-0.046
Token to
Token ID284
Feature activation+8.86e-02

Pos logit contributions
actus+0.064
 therapist+0.063
 Midnight+0.063
 masse+0.060
iflower+0.059
Neg logit contributions
 guess-0.032
 count-0.030
 three-0.030
 ten-0.027
 counted-0.027
Token investigate
Token ID9161
Feature activation+5.77e-01

Pos logit contributions
 guess+0.003
 count+0.003
 three+0.003
 ten+0.003
 counted+0.003
Neg logit contributions
 therapist-0.004
actus-0.004
 Midnight-0.004
 masse-0.004
iflower-0.004
Token don
Token ID836
Feature activation-2.15e+00

Pos logit contributions
 therapist+0.082
actus+0.080
 Midnight+0.078
 masse+0.076
lor+0.074
Neg logit contributions
 guess-0.040
 count-0.039
 like-0.033
 ten-0.033
 three-0.033
Token't
Token ID470
Feature activation-5.95e-01

Pos logit contributions
actus+0.109
 therapist+0.108
ugg+0.097
 Midnight+0.095
lor+0.093
Neg logit contributions
 count-0.078
 guess-0.076
 One-0.074
 like-0.072
 three-0.071
Token belong
Token ID5594
Feature activation-1.12e-01

Pos logit contributions
 therapist+0.031
actus+0.030
 Midnight+0.029
lor+0.028
 masse+0.028
Neg logit contributions
 guess-0.021
 count-0.021
 three-0.019
 ten-0.019
 imagine-0.018
Token to
Token ID284
Feature activation-6.44e-01

Pos logit contributions
 therapist+0.005
actus+0.005
 Midnight+0.005
 masse+0.004
lor+0.004
Neg logit contributions
 guess-0.005
 count-0.005
 three-0.004
 ten-0.004
 imagine-0.004
Token you
Token ID345
Feature activation+9.34e-01

Pos logit contributions
 therapist+0.023
actus+0.023
 Midnight+0.021
 masse+0.020
lor+0.019
Neg logit contributions
 guess-0.032
 count-0.032
 ten-0.030
 three-0.030
 imagine-0.029
Token!"
Token ID2474
Feature activation-2.55e+00

Pos logit contributions
 guess+0.043
 count+0.042
 three+0.039
 counted+0.039
 ten+0.039
Neg logit contributions
 Midnight-0.035
 therapist-0.035
actus-0.035
lor-0.030
 Woo-0.030
Token The
Token ID383
Feature activation-8.83e-01

Pos logit contributions
actus+0.143
iflower+0.135
eds+0.131
 therapist+0.129
lor+0.124
Neg logit contributions
 guess-0.078
 three-0.077
 One-0.076
 count-0.065
 like-0.064
Token thief
Token ID25906
Feature activation+9.74e-01

Pos logit contributions
 therapist+0.035
actus+0.035
 Midnight+0.034
lor+0.032
ugg+0.031
Neg logit contributions
 guess-0.039
 three-0.039
 count-0.038
 ten-0.037
 imagine-0.034
Token was
Token ID373
Feature activation+4.25e-01

Pos logit contributions
 count+0.039
 guess+0.039
 ten+0.038
ruits+0.036
 three+0.036
Neg logit contributions
 therapist-0.042
 Midnight-0.041
actus-0.039
 drowned-0.036
 died-0.036
Token surprised
Token ID6655
Feature activation-5.75e-03

Pos logit contributions
 count+0.017
 guess+0.017
 ten+0.015
 imagine+0.015
 three+0.015
Neg logit contributions
 therapist-0.019
 Midnight-0.019
actus-0.018
 masse-0.017
 drowned-0.017
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
 therapist+0.000
actus+0.000
 Midnight+0.000
 masse+0.000
lor+0.000
Neg logit contributions
 guess-0.000
 count-0.000
 ten-0.000
 three-0.000
 counted-0.000
Tokenpillar
Token ID41643
Feature activation-7.08e-01

Pos logit contributions
 drowned+0.088
 disappears+0.085
keeping+0.080
 died+0.079
burn+0.078
Neg logit contributions
 like-0.137
 guess-0.130
 want-0.124
 three-0.122
 count-0.119
Token for
Token ID329
Feature activation-2.08e+00

Pos logit contributions
 therapist+0.036
actus+0.036
 Midnight+0.034
 masse+0.033
lor+0.032
Neg logit contributions
 guess-0.025
 count-0.025
 three-0.023
 ten-0.022
 imagine-0.021
Token the
Token ID262
Feature activation-2.89e-01

Pos logit contributions
actus+0.130
 therapist+0.113
lor+0.102
 masse+0.102
act+0.102
Neg logit contributions
 count-0.072
 guess-0.070
 three-0.069
 ten-0.067
 yes-0.062
Token wonderful
Token ID7932
Feature activation-7.52e-01

Pos logit contributions
 therapist+0.012
actus+0.012
 Midnight+0.012
 masse+0.011
lor+0.010
Neg logit contributions
 guess-0.013
 count-0.013
 three-0.012
 ten-0.012
 imagine-0.011
Token gift
Token ID6979
Feature activation-9.40e-01

Pos logit contributions
 therapist+0.030
actus+0.030
 Midnight+0.030
lor+0.028
 masse+0.027
Neg logit contributions
 guess-0.034
 count-0.034
 three-0.032
 imagine-0.030
 ten-0.030
Token and
Token ID290
Feature activation-2.47e+00

Pos logit contributions
actus+0.048
 therapist+0.047
 Midnight+0.046
 masse+0.044
iflower+0.043
Neg logit contributions
 guess-0.033
 count-0.032
 three-0.029
 ten-0.028
 counted-0.028
Token said
Token ID531
Feature activation-5.04e-01

Pos logit contributions
actus+0.140
lor+0.127
 therapist+0.123
iflower+0.118
 Midnight+0.118
Neg logit contributions
 guess-0.087
 count-0.082
 counted-0.081
 three-0.079
 like-0.074
Token goodbye
Token ID24829
Feature activation-3.95e-01

Pos logit contributions
actus+0.027
 therapist+0.027
 Midnight+0.026
 masse+0.024
iflower+0.024
Neg logit contributions
 guess-0.016
 count-0.016
 three-0.015
 ten-0.015
 counted-0.014
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
 therapist+0.018
actus+0.018
 Midnight+0.017
 masse+0.016
iflower+0.016
Neg logit contributions
 guess-0.016
 count-0.015
 three-0.015
 ten-0.014
 counted-0.014
Token
Token ID220
Feature activation-1.18e+00

Pos logit contributions
actus+0.080
 therapist+0.079
 Midnight+0.073
 severe+0.073
iflower+0.073
Neg logit contributions
 One-0.042
 guess-0.041
 count-0.040
 three-0.040
 counted-0.037
Token
Token ID198
Feature activation-1.22e+00

Pos logit contributions
 therapist+0.057
actus+0.056
 Midnight+0.053
 masse+0.052
iflower+0.049
Neg logit contributions
 guess-0.046
 count-0.044
 three-0.041
 ten-0.040
 One-0.040
Token honest
Token ID5508
Feature activation-1.10e-01

Pos logit contributions
 guess+0.024
 count+0.023
 counted+0.021
 ten+0.021
 imagine+0.020
Neg logit contributions
 Midnight-0.029
 therapist-0.029
actus-0.027
 masse-0.027
 Woo-0.027
Token and
Token ID290
Feature activation-3.42e-01

Pos logit contributions
 therapist+0.006
actus+0.006
 Midnight+0.005
 masse+0.005
iflower+0.005
Neg logit contributions
 guess-0.004
 count-0.004
 three-0.003
 ten-0.003
 counted-0.003
Token very
Token ID845
Feature activation+4.21e-01

Pos logit contributions
actus+0.018
 therapist+0.018
 Midnight+0.018
lor+0.017
 masse+0.017
Neg logit contributions
 guess-0.011
 count-0.011
 three-0.010
 ten-0.009
 counted-0.009
Token excited
Token ID6568
Feature activation+2.57e-01

Pos logit contributions
 guess+0.017
 count+0.017
 ten+0.015
 three+0.015
 counted+0.015
Neg logit contributions
 therapist-0.019
actus-0.019
 Midnight-0.018
 masse-0.017
iflower-0.016
Token because
Token ID780
Feature activation+5.63e-01

Pos logit contributions
 guess+0.011
 count+0.011
 ten+0.010
 three+0.010
 counted+0.010
Neg logit contributions
 therapist-0.011
actus-0.011
 Midnight-0.010
 masse-0.009
iflower-0.009
Token it
Token ID340
Feature activation-2.22e+00

Pos logit contributions
 guess+0.029
 count+0.029
 ten+0.026
 counted+0.026
 three+0.026
Neg logit contributions
 therapist-0.019
 Midnight-0.018
actus-0.018
 masse-0.016
 Woo-0.016
Token was
Token ID373
Feature activation-1.63e-01

Pos logit contributions
actus+0.125
lor+0.118
 therapist+0.118
eds+0.112
iflower+0.111
Neg logit contributions
 count-0.066
 counted-0.065
 guess-0.064
 three-0.059
 like-0.058
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
 therapist+0.008
actus+0.008
 Midnight+0.007
 masse+0.007
lor+0.007
Neg logit contributions
 guess-0.006
 count-0.006
 three-0.006
 ten-0.006
 counted-0.005
Token special
Token ID2041
Feature activation-8.01e-01

Pos logit contributions
 therapist+0.060
actus+0.060
lor+0.058
 Midnight+0.057
orns+0.055
Neg logit contributions
 guess-0.045
 three-0.043
 count-0.043
 ten-0.039
 imagine-0.039
Token day
Token ID1110
Feature activation-8.01e-02

Pos logit contributions
actus+0.031
 therapist+0.031
 Midnight+0.031
lor+0.029
 drowned+0.028
Neg logit contributions
 guess-0.037
 count-0.036
 three-0.035
 ten-0.033
 imagine-0.033
Token.
Token ID13
Feature activation+7.25e-01

Pos logit contributions
 therapist+0.004
actus+0.004
 Midnight+0.003
 masse+0.003
iflower+0.003
Neg logit contributions
 guess-0.003
 count-0.003
 three-0.003
 ten-0.003
 imagine-0.003
Token Sam
Token ID3409
Feature activation+4.08e-02

Pos logit contributions
actus+0.022
 therapist+0.022
 Midnight+0.021
iflower+0.020
lor+0.020
Neg logit contributions
 guess-0.018
 count-0.018
 three-0.017
 ten-0.016
 counted-0.016
Token says
Token ID1139
Feature activation+2.13e-01

Pos logit contributions
 guess+0.002
 count+0.002
 three+0.002
 ten+0.002
 imagine+0.001
Neg logit contributions
 therapist-0.002
actus-0.002
 Midnight-0.002
 masse-0.002
iflower-0.002
Token,
Token ID11
Feature activation-8.00e-01

Pos logit contributions
 guess+0.007
 count+0.007
 ten+0.006
 three+0.006
 imagine+0.006
Neg logit contributions
 therapist-0.011
 Midnight-0.011
actus-0.011
 masse-0.010
iflower-0.010
Token "
Token ID366
Feature activation-1.46e+00

Pos logit contributions
actus+0.042
 therapist+0.041
 Midnight+0.040
 masse+0.038
lor+0.038
Neg logit contributions
 guess-0.026
 count-0.026
 three-0.025
 ten-0.023
 counted-0.023
TokenI
Token ID40
Feature activation+1.73e-01

Pos logit contributions
actus+0.057
 therapist+0.055
 Midnight+0.055
lor+0.053
 masse+0.050
Neg logit contributions
 guess-0.064
 three-0.059
 like-0.058
 count-0.058
 One-0.057
Token don
Token ID836
Feature activation-2.44e+00

Pos logit contributions
 count+0.006
 guess+0.006
 three+0.006
 ten+0.006
 counted+0.006
Neg logit contributions
 therapist-0.008
actus-0.008
 Midnight-0.008
 drowned-0.007
 masse-0.007
Token't
Token ID470
Feature activation+4.61e-01

Pos logit contributions
actus+0.125
 therapist+0.123
ugg+0.112
lor+0.110
 Midnight+0.109
Neg logit contributions
 count-0.083
 guess-0.082
 One-0.079
 three-0.078
 like-0.073
Token know
Token ID760
Feature activation+6.68e-01

Pos logit contributions
 count+0.008
 guess+0.008
 1+0.006
 counted+0.006
 ten+0.006
Neg logit contributions
 Midnight-0.030
 therapist-0.030
actus-0.029
 Woo-0.028
 masse-0.028
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
 guess+0.027
 count+0.026
 three+0.024
 imagine+0.023
 counted+0.023
Neg logit contributions
 Midnight-0.030
 therapist-0.029
actus-0.029
iflower-0.027
 masse-0.026
Token Maybe
Token ID6674
Feature activation+1.94e-01

Pos logit contributions
 guess+0.007
 count+0.007
 ten+0.006
 three+0.006
 imagine+0.006
Neg logit contributions
 therapist-0.008
 Midnight-0.007
actus-0.007
 masse-0.007
lor-0.006
Token it
Token ID340
Feature activation-1.18e+00

Pos logit contributions
 guess+0.010
 count+0.009
 three+0.009
 ten+0.009
 counted+0.009
Neg logit contributions
 therapist-0.007
 Midnight-0.007
actus-0.007
 masse-0.006
iflower-0.006
Token a
Token ID257
Feature activation-1.24e+00

Pos logit contributions
 guess+0.008
 count+0.008
 ten+0.007
 three+0.007
 counted+0.007
Neg logit contributions
 therapist-0.009
actus-0.009
 Midnight-0.009
 masse-0.008
 Woo-0.008
Token bee
Token ID20697
Feature activation-1.59e+00

Pos logit contributions
 therapist+0.055
 Midnight+0.053
actus+0.051
lor+0.051
ugg+0.050
Neg logit contributions
 guess-0.053
 count-0.051
 three-0.050
 ten-0.048
 imagine-0.046
Token flying
Token ID7348
Feature activation-1.53e-01

Pos logit contributions
 therapist+0.076
actus+0.073
 Midnight+0.070
 masse+0.070
ugg+0.066
Neg logit contributions
 count-0.063
 guess-0.062
 like-0.059
 imagine-0.057
 three-0.056
Token near
Token ID1474
Feature activation+2.41e-01

Pos logit contributions
 therapist+0.007
actus+0.007
 Midnight+0.007
 masse+0.006
lor+0.006
Neg logit contributions
 guess-0.006
 count-0.006
 three-0.005
 ten-0.005
 counted-0.005
Token a
Token ID257
Feature activation-2.32e+00

Pos logit contributions
 guess+0.011
 count+0.011
 counted+0.010
 three+0.010
 imagine+0.010
Neg logit contributions
 therapist-0.009
 Midnight-0.009
actus-0.009
 masse-0.008
iflower-0.008
Token da
Token ID12379
Feature activation-2.66e+00

Pos logit contributions
 therapist+0.090
ugg+0.088
 Midnight+0.085
 masse+0.080
lor+0.078
Neg logit contributions
 guess-0.111
 ten-0.107
 three-0.105
 count-0.105
 like-0.100
Tokenisy
Token ID13560
Feature activation-1.08e+00

Pos logit contributions
 therapist+0.110
keeping+0.109
 Midnight+0.106
 drowned+0.105
 Woo+0.101
Neg logit contributions
 count-0.108
 guess-0.107
 like-0.105
 imagine-0.099
 ten-0.098
Token.
Token ID13
Feature activation+3.39e-01

Pos logit contributions
 therapist+0.052
actus+0.050
 Midnight+0.050
 masse+0.047
 Woo+0.046
Neg logit contributions
 guess-0.041
 count-0.040
 three-0.038
 ten-0.037
 imagine-0.036
Token He
Token ID679
Feature activation+5.24e-01

Pos logit contributions
 count+0.013
 guess+0.013
 ten+0.012
 imagine+0.012
 three+0.012
Neg logit contributions
 Midnight-0.015
 therapist-0.015
actus-0.015
 masse-0.014
 Woo-0.014
Token wanted
Token ID2227
Feature activation+3.21e-01

Pos logit contributions
 count+0.018
 guess+0.018
 ten+0.017
 three+0.016
 imagine+0.016
Neg logit contributions
 therapist-0.025
actus-0.025
 Midnight-0.025
iflower-0.022
 drowned-0.022
Token to
Token ID284
Feature activation-1.08e-01

Pos logit contributions
 guess+0.015
 count+0.015
 imagine+0.014
 three+0.014
 counted+0.014
Neg logit contributions
 therapist-0.012
 Midnight-0.012
actus-0.012
 masse-0.010
 Woo-0.010