TOP ACTIVATIONS
MAX = 7.697

She opened her mouth in amaz ement . She had never
. He watched them in amaz ement and even laughed a
The girl stepped back in amaz ement as the dark started
all . She blinked in amaz ement as the pet als
ice . They stared in amaz ement . They wanted to
but they ran back in amaz ement ! The post had
gazed at the butterfly in amaz ement . But
the people cl apped in amaz ement . The singing rock
looked at the bug in amaz ement . She was so
. The deer stared in amaz ement . He had never
shaking . Mom watched in amaz ement from a distance ,
things , but they flew everywhere . The other children in
just stared at him in amaz ement . So , he
fingertips . She gasped in amaz ement and smiled .
bird looked at her in amaz ement , and said :
line . Billy watched in amaz ement as the ro oster
. She looked up in amaz ement at all of the
at the red blocks in amaz ement . He couldn 't
away . Lucy watched in amaz ement as it flew away
uddle !" she said in amaz ement .

MIN ACTIVATIONS
MIN = -7.492

. She had a bad tum my ac he and her
<|endoftext|> Once there was a sne ez ing boy . He
<|endoftext|> Once there was an ice cream cone . It sat
different parts and explained what each one did . Her mom
apple . With a full tum my , he marched back
did not have a messy tum my anymore . <|endoftext|> Ben
hungry little girl . Her tum my was so big and
many carrots and got a tum my ac he . His
or you will get a tum my ac he . Stay
What if I get a tum my ac he ?" But
time there lived a little chip m unk named Spike .
y ears and a round tum my . Where ver he
their snack . With happy tum m ies , they went
eating too much because his tum my hurt . The bear
time , there was a chip m unk who lived in
the jungle and see what each new day and night brought
With his tum my satisfied , the fox
was feeling uncomfortable because his tum my hurt . He walked
too many and got a tum my ac he .
ista became sleepy and her tum my began to g urg

INTERVAL 3.900 - 7.697
CONTAINS 0.643%

to be able to gaze upon the stadium and now the
understood . He said , " That 's okay . I
how much they loved spending time together in the garden .
M agg ie shouted , " Be careful brother !" But
mom and dad and say " thank you for taking us

INTERVAL 0.102 - 3.900
CONTAINS 45.665%

she went back to gently washing her delicate laundry . Everyone
'm sorry , mom ." Mom sigh s .
He says he is on the sun . He
!" Tim my exclaimed . " Don 't
very sick . Mary felt very helpless and realized

INTERVAL -3.695 - 0.102
CONTAINS 53.352%

as they walked . That made Sandy happy . They paused
stranger said , " It 's okay , I just want
it was wet . She said , " It looks like
everywhere but he couldn 't find it . He was so
friends . And they promised to play together again soon .

INTERVAL -7.492 - -3.695
CONTAINS 0.340%

mom , " What is it ?" She said , "
care about their aunt or each other . Soon
what was on the TV too . Tim laughed and said
, you can have some too ." They sit
it was and how great it looked on her .
Token She
Token ID1375
Feature activation+8.45e-01

Pos logit contributions
bek+0.005
 rect+0.005
heit+0.004
vered+0.004
 angles+0.004
Neg logit contributions
 doctor-0.003
 sick-0.003
 hospital-0.003
 sleep-0.003
 dentist-0.003
Token opened
Token ID4721
Feature activation+2.27e+00

Pos logit contributions
bek+0.037
 rect+0.036
heit+0.034
vered+0.034
 angles+0.033
Neg logit contributions
 sick-0.025
 hospital-0.024
 doctor-0.024
 sleep-0.024
 dentist-0.023
Token her
Token ID607
Feature activation+2.53e+00

Pos logit contributions
bek+0.150
 rect+0.133
vered+0.131
osaurs+0.131
anish+0.130
Neg logit contributions
 sleep-0.028
.-0.027
 doctor-0.026
 sick-0.025
?"-0.025
Token mouth
Token ID5422
Feature activation+2.86e-01

Pos logit contributions
bek+0.109
 rect+0.102
vered+0.098
heit+0.095
Wow+0.094
Neg logit contributions
 sick-0.084
 hospital-0.083
 doctor-0.083
 sleep-0.083
 tum-0.081
Token in
Token ID287
Feature activation+2.86e+00

Pos logit contributions
bek+0.015
 rect+0.014
heit+0.014
vered+0.014
 angles+0.013
Neg logit contributions
 sick-0.006
 doctor-0.006
 hospital-0.006
 dentist-0.006
 sleep-0.006
Token amaz
Token ID40642
Feature activation+7.70e+00

Pos logit contributions
bek+0.159
 rect+0.150
heit+0.150
vered+0.146
osaurs+0.139
Neg logit contributions
 sleep-0.058
 sick-0.057
 doctor-0.052
 hospital-0.051
.-0.050
Tokenement
Token ID972
Feature activation+4.43e+00

Pos logit contributions
bek+0.222
 rect+0.197
heit+0.184
vered+0.179
osaurs+0.176
Neg logit contributions
 doctor-0.348
 dentist-0.346
 hospital-0.344
 sick-0.333
?"-0.331
Token.
Token ID13
Feature activation+4.92e-01

Pos logit contributions
bek+0.252
 rect+0.245
heit+0.239
vered+0.237
 angles+0.229
Neg logit contributions
 sick-0.076
 doctor-0.069
 hospital-0.069
 sleep-0.068
 dentist-0.064
Token She
Token ID1375
Feature activation+1.13e+00

Pos logit contributions
bek+0.022
 rect+0.021
heit+0.021
vered+0.021
 angles+0.020
Neg logit contributions
 doctor-0.014
 sleep-0.014
 sick-0.014
 hospital-0.014
 dentist-0.014
Token had
Token ID550
Feature activation+2.04e+00

Pos logit contributions
bek+0.051
 rect+0.049
heit+0.047
vered+0.046
 angles+0.045
Neg logit contributions
 sick-0.032
 sleep-0.031
 doctor-0.031
 hospital-0.030
 dentist-0.029
Token never
Token ID1239
Feature activation+1.36e+00

Pos logit contributions
bek+0.096
 rect+0.092
heit+0.091
vered+0.090
 angles+0.085
Neg logit contributions
 sick-0.055
 hospital-0.053
 doctor-0.052
 dentist-0.052
 sleep-0.051
Token.
Token ID13
Feature activation+9.07e-02

Pos logit contributions
 sick+0.004
 hospital+0.004
 doctor+0.004
 dentist+0.004
 sleep+0.003
Neg logit contributions
bek-0.020
 rect-0.020
vered-0.020
heit-0.019
 angles-0.019
Token He
Token ID679
Feature activation+1.07e+00

Pos logit contributions
bek+0.004
 rect+0.004
vered+0.004
heit+0.004
 angles+0.004
Neg logit contributions
 doctor-0.002
 sick-0.002
 hospital-0.002
 dentist-0.002
 sleep-0.002
Token watched
Token ID7342
Feature activation+5.98e-01

Pos logit contributions
bek+0.048
 rect+0.047
vered+0.045
heit+0.045
 angles+0.044
Neg logit contributions
 sick-0.030
 hospital-0.029
 doctor-0.029
 dentist-0.028
 sleep-0.028
Token them
Token ID606
Feature activation-6.31e-01

Pos logit contributions
bek+0.029
 rect+0.027
vered+0.026
heit+0.026
 angles+0.026
Neg logit contributions
 sick-0.016
 doctor-0.016
 hospital-0.015
 sleep-0.015
 dentist-0.014
Token in
Token ID287
Feature activation+2.23e+00

Pos logit contributions
 sick+0.014
 hospital+0.013
 doctor+0.013
 dentist+0.013
 sleep+0.013
Neg logit contributions
bek-0.032
 rect-0.031
vered-0.030
heit-0.030
 angles-0.029
Token amaz
Token ID40642
Feature activation+7.42e+00

Pos logit contributions
bek+0.095
 rect+0.089
heit+0.087
vered+0.085
ically+0.079
Neg logit contributions
 sleep-0.074
 sick-0.073
 hospital-0.070
 doctor-0.070
 bat-0.069
Tokenement
Token ID972
Feature activation+4.02e+00

Pos logit contributions
bek+0.230
 rect+0.197
heit+0.182
vered+0.181
 magn+0.172
Neg logit contributions
 doctor-0.317
 dentist-0.316
?"-0.315
 hospital-0.315
 ambulance-0.304
Token and
Token ID290
Feature activation+5.05e-01

Pos logit contributions
bek+0.218
 rect+0.212
heit+0.207
vered+0.207
 angles+0.199
Neg logit contributions
 sick-0.078
 doctor-0.073
 hospital-0.073
 sleep-0.071
 dentist-0.069
Token even
Token ID772
Feature activation-1.22e+00

Pos logit contributions
bek+0.024
 rect+0.023
heit+0.022
vered+0.022
anish+0.021
Neg logit contributions
 sick-0.014
 hospital-0.013
 sleep-0.013
 doctor-0.013
 cook-0.012
Token laughed
Token ID13818
Feature activation+8.64e-01

Pos logit contributions
 sick+0.031
 hospital+0.029
 doctor+0.029
 sleep+0.028
 dentist+0.027
Neg logit contributions
bek-0.059
 rect-0.058
heit-0.056
vered-0.056
 angles-0.053
Token a
Token ID257
Feature activation+1.50e+00

Pos logit contributions
bek+0.048
 rect+0.048
vered+0.046
heit+0.046
 angles+0.045
Neg logit contributions
 sick-0.015
 hospital-0.014
 doctor-0.014
 dentist-0.013
 sleep-0.013
Token The
Token ID383
Feature activation-7.16e-01

Pos logit contributions
bek+0.007
 rect+0.007
vered+0.007
heit+0.006
 angles+0.006
Neg logit contributions
 sick-0.005
 hospital-0.004
 doctor-0.004
 dentist-0.004
 sleep-0.004
Token girl
Token ID2576
Feature activation+1.39e+00

Pos logit contributions
 sick+0.019
 hospital+0.018
 doctor+0.018
 dentist+0.017
 sleep+0.017
Neg logit contributions
bek-0.034
 rect-0.034
heit-0.033
vered-0.033
 angles-0.031
Token stepped
Token ID10764
Feature activation-6.73e-01

Pos logit contributions
bek+0.067
 rect+0.066
heit+0.065
 angles+0.061
ifying+0.060
Neg logit contributions
 sick-0.036
 hospital-0.035
 doctor-0.035
 sleep-0.033
 hurt-0.033
Token back
Token ID736
Feature activation-1.11e+00

Pos logit contributions
 sick+0.021
 hospital+0.020
 doctor+0.020
 dentist+0.020
 sleep+0.020
Neg logit contributions
bek-0.029
 rect-0.028
heit-0.027
vered-0.027
 angles-0.026
Token in
Token ID287
Feature activation+2.54e+00

Pos logit contributions
 sleep+0.020
 sick+0.020
 doctor+0.019
 hospital+0.019
.+0.016
Neg logit contributions
 rect-0.063
bek-0.063
heit-0.061
vered-0.058
 angles-0.056
Token amaz
Token ID40642
Feature activation+7.38e+00

Pos logit contributions
bek+0.137
 rect+0.133
heit+0.131
vered+0.126
osaurs+0.122
Neg logit contributions
 sleep-0.057
 sick-0.055
 doctor-0.053
 hospital-0.052
 bat-0.051
Tokenement
Token ID972
Feature activation+4.25e+00

Pos logit contributions
bek+0.210
 rect+0.185
vered+0.171
heit+0.170
osaurs+0.164
Neg logit contributions
 doctor-0.341
 hospital-0.340
 dentist-0.340
 sick-0.326
 doctors-0.322
Token as
Token ID355
Feature activation-9.17e-01

Pos logit contributions
bek+0.234
 rect+0.229
heit+0.222
vered+0.222
 angles+0.214
Neg logit contributions
 sick-0.079
 hospital-0.074
 doctor-0.073
 sleep-0.071
 dentist-0.069
Token the
Token ID262
Feature activation-1.84e+00

Pos logit contributions
 sick+0.023
 doctor+0.022
 hospital+0.022
 dentist+0.021
 sleep+0.021
Neg logit contributions
bek-0.045
 rect-0.045
heit-0.043
vered-0.043
 angles-0.041
Token dark
Token ID3223
Feature activation+5.00e-01

Pos logit contributions
 sick+0.046
 hospital+0.044
 doctor+0.044
 dentist+0.042
 sleep+0.042
Neg logit contributions
bek-0.091
 rect-0.089
vered-0.087
heit-0.087
 angles-0.082
Token started
Token ID2067
Feature activation-2.47e+00

Pos logit contributions
 rect+0.020
bek+0.020
vered+0.020
 angles+0.019
heit+0.019
Neg logit contributions
 sick-0.016
 hospital-0.015
 doctor-0.015
 dentist-0.015
 sleep-0.014
Token all
Token ID477
Feature activation-2.31e+00

Pos logit contributions
 sick+0.037
 doctor+0.035
 hospital+0.035
 sleep+0.034
 dentist+0.033
Neg logit contributions
bek-0.079
 rect-0.076
vered-0.075
heit-0.074
 angles-0.071
Token.
Token ID13
Feature activation-4.39e-02

Pos logit contributions
 sick+0.031
 hospital+0.028
 doctor+0.027
 sleep+0.026
 dentist+0.025
Neg logit contributions
bek-0.139
 rect-0.137
heit-0.134
vered-0.134
 angles-0.128
Token She
Token ID1375
Feature activation+6.94e-01

Pos logit contributions
 doctor+0.001
 sick+0.001
 hospital+0.001
 sleep+0.001
 dentist+0.001
Neg logit contributions
bek-0.002
 rect-0.002
heit-0.002
vered-0.002
 angles-0.002
Token blinked
Token ID40360
Feature activation+7.49e-01

Pos logit contributions
bek+0.029
 rect+0.028
vered+0.027
heit+0.027
 angles+0.026
Neg logit contributions
 sick-0.022
 hospital-0.021
 doctor-0.021
 dentist-0.020
 sleep-0.020
Token in
Token ID287
Feature activation+2.25e+00

Pos logit contributions
bek+0.042
heit+0.038
 rect+0.038
vered+0.036
osaurs+0.036
Neg logit contributions
 sick-0.015
 sleep-0.015
 doctor-0.014
 hospital-0.013
.-0.013
Token amaz
Token ID40642
Feature activation+7.27e+00

Pos logit contributions
bek+0.096
 rect+0.083
heit+0.082
x+0.080
ogram+0.080
Neg logit contributions
 sleep-0.083
 sick-0.076
 hot-0.073
 bat-0.073
 hospital-0.073
Tokenement
Token ID972
Feature activation+4.04e+00

Pos logit contributions
bek+0.252
osaurs+0.208
 rect+0.207
heit+0.203
vered+0.192
Neg logit contributions
 doctor-0.292
 dentist-0.286
 hospital-0.284
?"-0.277
.-0.274
Token as
Token ID355
Feature activation-6.95e-01

Pos logit contributions
bek+0.213
 rect+0.203
heit+0.200
vered+0.197
 angles+0.189
Neg logit contributions
 sick-0.088
 doctor-0.082
 sleep-0.082
 hospital-0.081
 dentist-0.075
Token the
Token ID262
Feature activation-2.03e+00

Pos logit contributions
 sick+0.017
 doctor+0.016
 hospital+0.015
 sleep+0.015
 dentist+0.015
Neg logit contributions
bek-0.035
 rect-0.034
vered-0.033
heit-0.033
 angles-0.031
Token pet
Token ID4273
Feature activation-1.39e+00

Pos logit contributions
 sick+0.051
 hospital+0.048
 doctor+0.048
 sleep+0.047
 itch+0.045
Neg logit contributions
bek-0.100
 rect-0.097
vered-0.095
heit-0.094
 angles-0.087
Tokenals
Token ID874
Feature activation-1.49e+00

Pos logit contributions
 doctor+0.069
 sick+0.068
 hospital+0.068
 sleep+0.066
 dentist+0.065
Neg logit contributions
bek-0.033
heit-0.032
 rect-0.031
vered-0.028
 angles-0.027
Token ice
Token ID4771
Feature activation-3.77e+00

Pos logit contributions
 rect+0.006
 angles+0.006
vered+0.006
bek+0.006
Wow+0.006
Neg logit contributions
 sleep-0.002
iver-0.002
 sick-0.002
 dentist-0.002
 doctor-0.002
Token.
Token ID13
Feature activation+6.59e-01

Pos logit contributions
 sick+0.093
 hospital+0.091
 doctor+0.090
 dentist+0.090
 sleep+0.087
Neg logit contributions
 rect-0.177
bek-0.176
vered-0.168
heit-0.166
 angles-0.166
Token They
Token ID1119
Feature activation+9.66e-01

Pos logit contributions
bek+0.031
heit+0.029
vered+0.028
 rect+0.027
 angles+0.026
Neg logit contributions
 hospital-0.018
 doctor-0.018
 sick-0.018
 sleep-0.018
 bat-0.017
Token stared
Token ID18484
Feature activation+8.73e-01

Pos logit contributions
bek+0.041
 rect+0.041
vered+0.039
heit+0.039
 angles+0.038
Neg logit contributions
 sick-0.029
 hospital-0.028
 doctor-0.028
 dentist-0.027
 sleep-0.027
Token in
Token ID287
Feature activation+2.31e+00

Pos logit contributions
bek+0.046
 rect+0.044
heit+0.044
vered+0.043
 angles+0.041
Neg logit contributions
 sick-0.019
 doctor-0.018
 hospital-0.017
 sleep-0.017
 dentist-0.016
Token amaz
Token ID40642
Feature activation+7.26e+00

Pos logit contributions
bek+0.114
 rect+0.104
heit+0.103
vered+0.100
x+0.100
Neg logit contributions
 sleep-0.062
 sick-0.060
 bat-0.060
.-0.058
 hot-0.057
Tokenement
Token ID972
Feature activation+4.47e+00

Pos logit contributions
bek+0.234
 rect+0.198
heit+0.191
vered+0.185
osaurs+0.184
Neg logit contributions
 doctor-0.303
 dentist-0.302
 hospital-0.300
 ambulance-0.292
?"-0.290
Token.
Token ID13
Feature activation+7.83e-01

Pos logit contributions
bek+0.276
 rect+0.269
heit+0.265
vered+0.263
 angles+0.254
Neg logit contributions
 sick-0.054
 doctor-0.047
 hospital-0.047
 sleep-0.046
 dentist-0.042
Token They
Token ID1119
Feature activation+8.63e-01

Pos logit contributions
bek+0.035
heit+0.032
vered+0.032
 rect+0.032
 angles+0.030
Neg logit contributions
 sick-0.023
 doctor-0.023
 hospital-0.023
 sleep-0.022
 bat-0.022
Token wanted
Token ID2227
Feature activation+6.25e-01

Pos logit contributions
bek+0.036
 rect+0.036
vered+0.034
heit+0.034
 angles+0.033
Neg logit contributions
 sick-0.027
 hospital-0.026
 doctor-0.026
 dentist-0.025
 sleep-0.025
Token to
Token ID284
Feature activation-8.53e-01

Pos logit contributions
bek+0.037
heit+0.035
 rect+0.034
vered+0.034
 angles+0.032
Neg logit contributions
 sick-0.011
 doctor-0.011
 hospital-0.011
 sleep-0.010
 dentist-0.010
Token but
Token ID475
Feature activation+2.24e+00

Pos logit contributions
 sick+0.007
 doctor+0.006
 hospital+0.006
 sleep+0.006
 dentist+0.005
Neg logit contributions
bek-0.039
 rect-0.038
heit-0.037
vered-0.036
 angles-0.036
Token they
Token ID484
Feature activation-4.19e-01

Pos logit contributions
bek+0.117
 rect+0.115
heit+0.113
x+0.108
vered+0.107
Neg logit contributions
 sick-0.055
 hospital-0.051
 doctor-0.048
 sleep-0.048
 left-0.047
Token ran
Token ID4966
Feature activation-4.22e-01

Pos logit contributions
 sick+0.014
 hospital+0.013
 doctor+0.013
 sleep+0.013
 dentist+0.013
Neg logit contributions
bek-0.017
 rect-0.017
heit-0.016
vered-0.016
 angles-0.015
Token back
Token ID736
Feature activation-7.42e-01

Pos logit contributions
 sick+0.012
 hospital+0.012
 sleep+0.012
 doctor+0.011
 bat+0.011
Neg logit contributions
 rect-0.020
bek-0.020
heit-0.019
vered-0.018
 angles-0.017
Token in
Token ID287
Feature activation+2.89e+00

Pos logit contributions
 sick+0.012
 sleep+0.011
 hospital+0.011
 doctor+0.011
 bat+0.009
Neg logit contributions
 rect-0.044
bek-0.043
heit-0.042
vered-0.040
anish-0.039
Token amaz
Token ID40642
Feature activation+7.26e+00

Pos logit contributions
bek+0.161
 rect+0.160
heit+0.156
vered+0.151
x+0.145
Neg logit contributions
 sick-0.053
 sleep-0.050
 hospital-0.049
 doctor-0.049
 bat-0.047
Tokenement
Token ID972
Feature activation+4.18e+00

Pos logit contributions
bek+0.203
 rect+0.182
heit+0.164
vered+0.163
osaurs+0.153
Neg logit contributions
 hospital-0.334
 doctor-0.332
 dentist-0.329
?"-0.324
 sick-0.321
Token!
Token ID0
Feature activation+7.17e-02

Pos logit contributions
bek+0.268
 rect+0.265
vered+0.258
heit+0.258
 angles+0.251
Neg logit contributions
 sick-0.037
 hospital-0.033
 doctor-0.033
 dentist-0.030
 sleep-0.028
Token The
Token ID383
Feature activation-8.77e-01

Pos logit contributions
bek+0.004
 rect+0.004
vered+0.004
heit+0.004
 angles+0.004
Neg logit contributions
 doctor-0.001
 hospital-0.001
 sleep-0.001
 sick-0.001
 bat-0.001
Token post
Token ID1281
Feature activation-5.53e-01

Pos logit contributions
 sick+0.026
 sleep+0.025
 hospital+0.024
 dentist+0.024
 doctor+0.024
Neg logit contributions
bek-0.037
 rect-0.036
vered-0.035
heit-0.034
 angles-0.034
Token had
Token ID550
Feature activation+1.71e+00

Pos logit contributions
 doctor+0.015
 hospital+0.014
 sick+0.014
 dentist+0.014
 doctors+0.013
Neg logit contributions
bek-0.026
 rect-0.026
heit-0.026
vered-0.024
 angles-0.024
Token gazed
Token ID50255
Feature activation+1.00e+00

Pos logit contributions
 doctor+0.011
 sick+0.011
 hospital+0.011
 dentist+0.011
 sleep+0.010
Neg logit contributions
bek-0.012
 rect-0.011
vered-0.011
heit-0.011
 angles-0.010
Token at
Token ID379
Feature activation+2.17e+00

Pos logit contributions
bek+0.053
 rect+0.051
heit+0.050
vered+0.049
 angles+0.048
Neg logit contributions
 sick-0.022
 doctor-0.019
 hospital-0.019
 sleep-0.019
 bat-0.018
Token the
Token ID262
Feature activation+3.00e-02

Pos logit contributions
bek+0.139
 rect+0.129
heit+0.124
ically+0.119
vin+0.118
Neg logit contributions
 sick-0.039
.-0.035
 sleep-0.034
 hospital-0.034
 hurt-0.031
Token butterfly
Token ID35113
Feature activation+1.95e+00

Pos logit contributions
bek+0.001
 rect+0.001
vered+0.001
heit+0.001
 angles+0.001
Neg logit contributions
 sick-0.001
 hospital-0.001
 doctor-0.001
 dentist-0.001
 sleep-0.001
Token in
Token ID287
Feature activation+2.39e+00

Pos logit contributions
bek+0.104
heit+0.100
 rect+0.099
vered+0.097
 angles+0.094
Neg logit contributions
 sick-0.042
 sleep-0.040
 hospital-0.039
 doctor-0.038
 itch-0.035
Token amaz
Token ID40642
Feature activation+7.20e+00

Pos logit contributions
bek+0.111
heit+0.105
 rect+0.101
vered+0.100
x+0.095
Neg logit contributions
 sick-0.071
 sleep-0.069
 bat-0.067
 doctor-0.064
 hospital-0.064
Tokenement
Token ID972
Feature activation+4.01e+00

Pos logit contributions
bek+0.201
 rect+0.177
heit+0.170
vered+0.165
osaurs+0.155
Neg logit contributions
 hospital-0.329
 doctor-0.328
 dentist-0.327
 sick-0.318
?"-0.314
Token.
Token ID13
Feature activation+3.21e-01

Pos logit contributions
bek+0.252
 rect+0.244
heit+0.241
vered+0.238
 angles+0.231
Neg logit contributions
 sick-0.045
 hospital-0.039
 doctor-0.038
 sleep-0.038
 dentist-0.033
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
bek+0.017
 rect+0.016
heit+0.016
vered+0.016
 angles+0.016
Neg logit contributions
 doctor-0.007
 sick-0.007
 hospital-0.007
 sleep-0.006
 dentist-0.006
Token
Token ID198
Feature activation-2.96e-01

Pos logit contributions
bek+0.008
 rect+0.008
heit+0.008
vered+0.008
 angles+0.007
Neg logit contributions
 sick-0.005
 hospital-0.005
 dentist-0.005
 doctor-0.005
 itch-0.005
TokenBut
Token ID1537
Feature activation+2.08e+00

Pos logit contributions
 sick+0.010
 doctor+0.010
 hospital+0.010
 dentist+0.009
 sleep+0.009
Neg logit contributions
bek-0.012
 rect-0.011
vered-0.011
heit-0.011
 angles-0.011
Token the
Token ID262
Feature activation-1.41e+00

Pos logit contributions
bek+0.027
 rect+0.026
heit+0.026
vered+0.026
 angles+0.025
Neg logit contributions
 sick-0.013
 hospital-0.013
 doctor-0.012
 sleep-0.012
 dentist-0.012
Token people
Token ID661
Feature activation+1.70e+00

Pos logit contributions
 sick+0.035
 hospital+0.033
 doctor+0.033
 dentist+0.032
 sleep+0.032
Neg logit contributions
bek-0.068
 rect-0.067
vered-0.065
heit-0.065
 angles-0.062
Token cl
Token ID537
Feature activation+3.81e-01

Pos logit contributions
bek+0.073
 rect+0.072
heit+0.069
vered+0.068
 angles+0.065
Neg logit contributions
 sick-0.052
 hospital-0.049
 doctor-0.049
 sleep-0.048
 dentist-0.047
Tokenapped
Token ID6320
Feature activation+9.45e-01

Pos logit contributions
bek+0.012
 rect+0.012
heit+0.011
vered+0.011
 angles+0.011
Neg logit contributions
 sick-0.016
 hospital-0.015
 doctor-0.015
 sleep-0.015
 dentist-0.015
Token in
Token ID287
Feature activation+3.64e+00

Pos logit contributions
bek+0.050
heit+0.047
 rect+0.046
vered+0.045
vin+0.043
Neg logit contributions
 sick-0.022
 sleep-0.022
.-0.020
 bat-0.020
 doctor-0.019
Token amaz
Token ID40642
Feature activation+7.17e+00

Pos logit contributions
bek+0.181
x+0.164
ogram+0.161
heit+0.161
osaurs+0.159
Neg logit contributions
 sleep-0.102
.-0.098
 bat-0.096
 sick-0.095
 hot-0.090
Tokenement
Token ID972
Feature activation+4.27e+00

Pos logit contributions
bek+0.192
 rect+0.172
heit+0.160
osaurs+0.154
vered+0.150
Neg logit contributions
 hospital-0.330
 doctor-0.329
 sick-0.323
 dentist-0.321
 sleep-0.317
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
bek+0.259
 rect+0.256
vered+0.249
heit+0.248
 angles+0.242
Neg logit contributions
 sick-0.052
 hospital-0.049
 doctor-0.048
 dentist-0.045
 sleep-0.042
Token The
Token ID383
Feature activation-3.41e-01

Pos logit contributions
bek+0.012
 rect+0.011
heit+0.011
vered+0.011
 angles+0.011
Neg logit contributions
 hospital-0.005
 doctor-0.005
 sleep-0.005
 sick-0.005
 dentist-0.004
Token singing
Token ID13777
Feature activation-7.16e-01

Pos logit contributions
 sick+0.008
 hospital+0.008
 doctor+0.008
 dentist+0.008
 sleep+0.007
Neg logit contributions
bek-0.017
 rect-0.017
vered-0.016
heit-0.016
 angles-0.015
Token rock
Token ID3881
Feature activation-1.02e+00

Pos logit contributions
 sick+0.016
 doctor+0.015
 hospital+0.015
 sleep+0.014
 dentist+0.014
Neg logit contributions
bek-0.037
 rect-0.036
heit-0.035
vered-0.034
 angles-0.034
Token looked
Token ID3114
Feature activation-5.84e-01

Pos logit contributions
bek+0.093
 rect+0.091
heit+0.089
 angles+0.085
vered+0.083
Neg logit contributions
 sick-0.054
 doctor-0.053
 hospital-0.053
 sleep-0.051
 bat-0.049
Token at
Token ID379
Feature activation+1.90e+00

Pos logit contributions
 sick+0.014
 hospital+0.013
 sleep+0.013
 doctor+0.012
 left+0.012
Neg logit contributions
bek-0.030
 rect-0.029
vered-0.028
heit-0.028
anish-0.027
Token the
Token ID262
Feature activation-4.70e-02

Pos logit contributions
bek+0.126
 rect+0.123
heit+0.114
osaurs+0.112
ults+0.111
Neg logit contributions
 hospital-0.024
 sleep-0.023
 doctor-0.022
 sick-0.022
.-0.018
Token bug
Token ID5434
Feature activation+9.20e-01

Pos logit contributions
 sick+0.002
 hospital+0.002
 doctor+0.002
 dentist+0.002
 sleep+0.001
Neg logit contributions
bek-0.002
 rect-0.002
vered-0.002
heit-0.002
 angles-0.002
Token in
Token ID287
Feature activation+2.22e+00

Pos logit contributions
bek+0.051
 rect+0.049
heit+0.048
vered+0.047
 angles+0.046
Neg logit contributions
 sick-0.018
 hospital-0.018
 doctor-0.017
 sleep-0.017
 itch-0.015
Token amaz
Token ID40642
Feature activation+7.16e+00

Pos logit contributions
bek+0.102
 rect+0.097
heit+0.095
x+0.090
vered+0.089
Neg logit contributions
 sick-0.066
 sleep-0.065
 doctor-0.063
 bat-0.062
 hospital-0.061
Tokenement
Token ID972
Feature activation+3.81e+00

Pos logit contributions
bek+0.223
 rect+0.196
heit+0.192
osaurs+0.185
vered+0.180
Neg logit contributions
 doctor-0.307
 dentist-0.306
 hospital-0.304
?"-0.288
 ambulance-0.287
Token.
Token ID13
Feature activation+4.75e-01

Pos logit contributions
bek+0.221
 rect+0.213
heit+0.210
vered+0.206
 angles+0.199
Neg logit contributions
 sick-0.063
 doctor-0.057
 hospital-0.057
 sleep-0.056
 dentist-0.051
Token She
Token ID1375
Feature activation+1.21e+00

Pos logit contributions
bek+0.025
heit+0.024
 rect+0.023
vered+0.023
 angles+0.022
Neg logit contributions
 doctor-0.011
 hospital-0.010
 sleep-0.010
 dentist-0.010
 sick-0.010
Token was
Token ID373
Feature activation+7.94e-01

Pos logit contributions
bek+0.056
 rect+0.055
heit+0.053
 angles+0.051
vered+0.051
Neg logit contributions
 sick-0.032
 doctor-0.031
 hospital-0.030
 sleep-0.030
 hurt-0.029
Token so
Token ID523
Feature activation+6.01e-02

Pos logit contributions
bek+0.031
 rect+0.031
vered+0.030
heit+0.029
 angles+0.028
Neg logit contributions
 sick-0.027
 hospital-0.026
 doctor-0.026
 dentist-0.025
 sleep-0.025
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
 rect+0.053
bek+0.053
vered+0.051
heit+0.051
 angles+0.050
Neg logit contributions
 sick-0.013
 hospital-0.013
 doctor-0.012
 dentist-0.012
 sleep-0.011
Token The
Token ID383
Feature activation-6.05e-01

Pos logit contributions
bek+0.014
 rect+0.014
vered+0.014
heit+0.014
 angles+0.013
Neg logit contributions
 sick-0.008
 hospital-0.008
 doctor-0.008
 dentist-0.008
 sleep-0.008
Token deer
Token ID20096
Feature activation+1.02e+00

Pos logit contributions
 sick+0.020
 hospital+0.019
 doctor+0.019
 sleep+0.018
 dentist+0.018
Neg logit contributions
bek-0.026
 rect-0.025
vered-0.024
heit-0.023
 angles-0.023
Token stared
Token ID18484
Feature activation+2.45e-01

Pos logit contributions
bek+0.049
 rect+0.047
heit+0.046
vered+0.044
 angles+0.044
Neg logit contributions
 sick-0.027
 doctor-0.025
 hospital-0.025
 sleep-0.025
 hurt-0.024
Token in
Token ID287
Feature activation+2.13e+00

Pos logit contributions
bek+0.012
 rect+0.012
vered+0.011
heit+0.011
 angles+0.011
Neg logit contributions
 sick-0.006
 hospital-0.006
 doctor-0.006
 dentist-0.006
 sleep-0.006
Token amaz
Token ID40642
Feature activation+7.15e+00

Pos logit contributions
bek+0.075
 rect+0.069
heit+0.067
vered+0.066
x+0.062
Neg logit contributions
 sleep-0.086
 sick-0.085
 hospital-0.082
 doctor-0.082
 bat-0.081
Tokenement
Token ID972
Feature activation+4.12e+00

Pos logit contributions
bek+0.204
 rect+0.180
heit+0.169
vered+0.165
osaurs+0.157
Neg logit contributions
 doctor-0.322
 dentist-0.320
 hospital-0.319
 sick-0.311
 sleep-0.308
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
bek+0.226
 rect+0.221
heit+0.215
vered+0.215
 angles+0.207
Neg logit contributions
 sick-0.077
 hospital-0.072
 doctor-0.072
 sleep-0.069
 dentist-0.068
Token He
Token ID679
Feature activation+1.05e+00

Pos logit contributions
bek+0.010
 rect+0.010
vered+0.009
heit+0.009
 angles+0.009
Neg logit contributions
 sick-0.006
 doctor-0.006
 hospital-0.006
 dentist-0.006
 sleep-0.006
Token had
Token ID550
Feature activation+2.08e+00

Pos logit contributions
bek+0.049
 rect+0.048
heit+0.046
vered+0.046
 angles+0.044
Neg logit contributions
 sick-0.028
 doctor-0.027
 hospital-0.027
 sleep-0.026
 dentist-0.026
Token never
Token ID1239
Feature activation+1.29e+00

Pos logit contributions
bek+0.091
 rect+0.089
heit+0.086
vered+0.086
 angles+0.082
Neg logit contributions
 sick-0.062
 hospital-0.060
 doctor-0.060
 dentist-0.058
 sleep-0.058
Token shaking
Token ID17275
Feature activation-1.07e+00

Pos logit contributions
 rect+0.036
bek+0.035
vered+0.034
heit+0.034
 angles+0.033
Neg logit contributions
 hospital-0.024
 sick-0.024
 doctor-0.024
 dentist-0.024
 sleep-0.022
Token.
Token ID13
Feature activation+8.86e-02

Pos logit contributions
 sick+0.016
 doctor+0.015
 hospital+0.015
 sleep+0.014
 bat+0.013
Neg logit contributions
bek-0.064
 rect-0.061
heit-0.060
vered-0.059
 angles-0.058
Token Mom
Token ID11254
Feature activation-1.38e+00

Pos logit contributions
bek+0.004
 rect+0.004
heit+0.004
vered+0.004
 angles+0.003
Neg logit contributions
 sick-0.003
 doctor-0.003
 hospital-0.003
 sleep-0.003
 dentist-0.003
Token watched
Token ID7342
Feature activation+6.35e-01

Pos logit contributions
 sick+0.037
 doctor+0.036
 hospital+0.035
 dentist+0.035
 sleep+0.035
Neg logit contributions
bek-0.064
 rect-0.063
heit-0.061
vered-0.060
 angles-0.058
Token in
Token ID287
Feature activation+2.30e+00

Pos logit contributions
bek+0.034
 rect+0.032
heit+0.032
vered+0.031
 angles+0.031
Neg logit contributions
 sick-0.014
 sleep-0.014
 doctor-0.014
 hospital-0.013
 bat-0.012
Token amaz
Token ID40642
Feature activation+7.14e+00

Pos logit contributions
bek+0.089
 rect+0.085
heit+0.083
vered+0.078
x+0.076
Neg logit contributions
 sleep-0.085
 sick-0.082
 hospital-0.082
 doctor-0.081
 bat-0.080
Tokenement
Token ID972
Feature activation+4.08e+00

Pos logit contributions
bek+0.222
 rect+0.191
osaurs+0.181
heit+0.175
vered+0.170
Neg logit contributions
 hospital-0.312
 doctor-0.312
 dentist-0.303
 sick-0.298
?"-0.295
Token from
Token ID422
Feature activation+1.08e+00

Pos logit contributions
bek+0.201
 rect+0.197
heit+0.191
vered+0.190
 angles+0.184
Neg logit contributions
 sick-0.098
 hospital-0.094
 doctor-0.093
 sleep-0.089
 dentist-0.089
Token a
Token ID257
Feature activation+1.19e+00

Pos logit contributions
bek+0.063
 rect+0.062
heit+0.059
vered+0.057
osaurs+0.056
Neg logit contributions
 sick-0.019
 hospital-0.019
 doctor-0.018
 sleep-0.018
 dentist-0.016
Token distance
Token ID5253
Feature activation+2.34e-01

Pos logit contributions
bek+0.056
 rect+0.056
vered+0.055
heit+0.054
 angles+0.052
Neg logit contributions
 sick-0.031
 hospital-0.030
 doctor-0.030
 sleep-0.029
 dentist-0.028
Token,
Token ID11
Feature activation+1.95e-01

Pos logit contributions
bek+0.013
 rect+0.013
heit+0.013
vered+0.012
 angles+0.012
Neg logit contributions
 sick-0.004
 hospital-0.004
 doctor-0.004
 sleep-0.004
 dentist-0.003
Token things
Token ID1243
Feature activation+3.51e+00

Pos logit contributions
bek+0.068
 rect+0.068
vered+0.065
heit+0.065
 angles+0.063
Neg logit contributions
 sick-0.041
 hospital-0.041
 doctor-0.040
 dentist-0.040
 sleep-0.039
Token,
Token ID11
Feature activation+8.85e-01

Pos logit contributions
bek+0.199
 rect+0.195
heit+0.190
vered+0.190
 angles+0.184
Neg logit contributions
 sick-0.059
 hospital-0.056
 doctor-0.055
 dentist-0.051
 sleep-0.051
Token but
Token ID475
Feature activation+2.50e+00

Pos logit contributions
bek+0.045
heit+0.042
vered+0.042
 rect+0.040
 angles+0.039
Neg logit contributions
 sick-0.026
 hospital-0.026
 doctor-0.025
 sleep-0.024
 bat-0.023
Token they
Token ID484
Feature activation+2.87e-01

Pos logit contributions
bek+0.130
 rect+0.126
vered+0.124
heit+0.124
 angles+0.118
Neg logit contributions
 sick-0.058
 doctor-0.057
 hospital-0.056
 dentist-0.051
 sleep-0.050
Token flew
Token ID13112
Feature activation-9.35e-02

Pos logit contributions
bek+0.012
 rect+0.012
heit+0.011
vered+0.011
 angles+0.011
Neg logit contributions
 sick-0.009
 hospital-0.009
 doctor-0.009
 hurt-0.009
 sleep-0.008
Token everywhere
Token ID8347
Feature activation+7.14e+00

Pos logit contributions
 sick+0.003
 doctor+0.003
 hospital+0.003
 sleep+0.003
.+0.003
Neg logit contributions
bek-0.004
 rect-0.004
heit-0.004
vered-0.004
 angles-0.004
Token.
Token ID13
Feature activation+7.87e-01

Pos logit contributions
 rect+0.430
bek+0.425
vered+0.413
heit+0.406
 angles+0.403
Neg logit contributions
 sick-0.082
 hospital-0.081
 dentist-0.077
 doctor-0.074
 sleep-0.067
Token The
Token ID383
Feature activation-2.45e-01

Pos logit contributions
 rect+0.034
bek+0.033
vered+0.032
heit+0.031
 angles+0.030
Neg logit contributions
 sick-0.023
 hospital-0.022
 dentist-0.022
 doctor-0.021
 sleep-0.021
Token other
Token ID584
Feature activation+1.08e+00

Pos logit contributions
 sick+0.004
 hospital+0.004
 doctor+0.004
 dentist+0.004
 sleep+0.004
Neg logit contributions
bek-0.013
 rect-0.013
heit-0.013
vered-0.013
 angles-0.013
Token children
Token ID1751
Feature activation+1.80e+00

Pos logit contributions
 rect+0.053
bek+0.052
vered+0.050
heit+0.050
 angles+0.050
Neg logit contributions
 sick-0.026
 hospital-0.024
 doctor-0.024
 sleep-0.023
 dentist-0.023
Token in
Token ID287
Feature activation+2.01e+00

Pos logit contributions
bek+0.081
 rect+0.080
heit+0.077
vered+0.076
 angles+0.074
Neg logit contributions
 sick-0.051
 hospital-0.049
 doctor-0.048
 sleep-0.047
 dentist-0.046
Token just
Token ID655
Feature activation+8.74e-01

Pos logit contributions
 sick+0.003
 hospital+0.003
 doctor+0.003
 dentist+0.003
 sleep+0.003
Neg logit contributions
bek-0.004
 rect-0.004
vered-0.003
heit-0.003
 angles-0.003
Token stared
Token ID18484
Feature activation+3.84e-01

Pos logit contributions
bek+0.035
 rect+0.034
heit+0.032
vered+0.032
 angles+0.030
Neg logit contributions
 sick-0.029
 sleep-0.028
 hospital-0.028
 doctor-0.028
 dentist-0.027
Token at
Token ID379
Feature activation+2.09e+00

Pos logit contributions
bek+0.019
 rect+0.019
vered+0.018
heit+0.018
 angles+0.017
Neg logit contributions
 sick-0.009
 hospital-0.009
 doctor-0.009
 dentist-0.008
 sleep-0.008
Token him
Token ID683
Feature activation+7.43e-02

Pos logit contributions
bek+0.121
 rect+0.113
heit+0.109
vered+0.105
anish+0.102
Neg logit contributions
 sick-0.043
 doctor-0.042
 hospital-0.042
 sleep-0.038
 dentist-0.038
Token in
Token ID287
Feature activation+2.21e+00

Pos logit contributions
bek+0.004
 rect+0.004
heit+0.004
vered+0.004
 angles+0.004
Neg logit contributions
 sick-0.001
 doctor-0.001
 hospital-0.001
 sleep-0.001
 dentist-0.001
Token amaz
Token ID40642
Feature activation+7.13e+00

Pos logit contributions
bek+0.088
 rect+0.082
heit+0.081
vered+0.077
osaurs+0.071
Neg logit contributions
 sick-0.080
 sleep-0.078
 doctor-0.076
 hospital-0.076
 bat-0.074
Tokenement
Token ID972
Feature activation+4.07e+00

Pos logit contributions
bek+0.187
 rect+0.167
heit+0.154
vered+0.153
osaurs+0.140
Neg logit contributions
 doctor-0.339
 hospital-0.337
 dentist-0.336
 sick-0.334
?"-0.324
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
bek+0.253
 rect+0.252
vered+0.245
heit+0.243
 angles+0.238
Neg logit contributions
 sick-0.042
 hospital-0.039
 doctor-0.038
 dentist-0.036
 sleep-0.032
Token So
Token ID1406
Feature activation+2.16e+00

Pos logit contributions
bek+0.012
heit+0.011
vered+0.011
 rect+0.011
 angles+0.011
Neg logit contributions
 doctor-0.006
 hospital-0.006
 sick-0.006
 dentist-0.006
 sleep-0.006
Token,
Token ID11
Feature activation-4.89e-01

Pos logit contributions
bek+0.129
 rect+0.125
heit+0.125
ogram+0.121
 angles+0.121
Neg logit contributions
 doctor-0.035
 hospital-0.030
 sick-0.030
 sleep-0.028
 left-0.024
Token he
Token ID339
Feature activation-6.85e-01

Pos logit contributions
 doctor+0.009
 sick+0.009
 hospital+0.009
 sleep+0.008
 dentist+0.008
Neg logit contributions
bek-0.028
heit-0.027
 rect-0.027
vered-0.026
 angles-0.026
Token fingertips
Token ID40142
Feature activation+1.10e+00

Pos logit contributions
bek+0.044
 rect+0.044
vered+0.042
heit+0.042
 angles+0.040
Neg logit contributions
 sick-0.031
 hospital-0.030
 doctor-0.030
 dentist-0.030
 sleep-0.029
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
bek+0.066
 rect+0.066
vered+0.064
 angles+0.063
heit+0.063
Neg logit contributions
 hospital-0.013
 sick-0.013
 doctor-0.012
 dentist-0.012
 sleep-0.010
Token She
Token ID1375
Feature activation+1.19e+00

Pos logit contributions
bek+0.008
 rect+0.008
heit+0.008
vered+0.008
 angles+0.008
Neg logit contributions
 sick-0.005
 doctor-0.005
 hospital-0.005
 sleep-0.005
 dentist-0.005
Token gasped
Token ID45236
Feature activation+2.57e+00

Pos logit contributions
bek+0.052
 rect+0.051
vered+0.049
heit+0.049
 angles+0.047
Neg logit contributions
 sick-0.035
 hospital-0.034
 doctor-0.034
 dentist-0.033
 sleep-0.033
Token in
Token ID287
Feature activation+1.77e+00

Pos logit contributions
bek+0.138
 rect+0.133
heit+0.132
vered+0.128
 angles+0.125
Neg logit contributions
 sick-0.055
 doctor-0.052
 hospital-0.050
 sleep-0.049
 dentist-0.047
Token amaz
Token ID40642
Feature activation+7.11e+00

Pos logit contributions
bek+0.061
 rect+0.057
vered+0.052
heit+0.051
x+0.049
Neg logit contributions
 sleep-0.074
 sick-0.073
 hospital-0.070
 doctor-0.069
 bat-0.069
Tokenement
Token ID972
Feature activation+3.76e+00

Pos logit contributions
bek+0.207
 rect+0.182
heit+0.169
vered+0.166
osaurs+0.162
Neg logit contributions
 doctor-0.318
 dentist-0.315
 hospital-0.315
 sick-0.307
?"-0.303
Token and
Token ID290
Feature activation+3.57e-01

Pos logit contributions
bek+0.204
 rect+0.203
vered+0.197
heit+0.196
 angles+0.191
Neg logit contributions
 sick-0.068
 hospital-0.065
 doctor-0.065
 dentist-0.063
 sleep-0.059
Token smiled
Token ID13541
Feature activation+6.78e-01

Pos logit contributions
bek+0.019
 rect+0.018
vered+0.018
heit+0.018
 angles+0.017
Neg logit contributions
 sick-0.008
 hospital-0.007
 doctor-0.007
 sleep-0.007
 dentist-0.007
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
 rect+0.037
bek+0.036
vered+0.036
 angles+0.035
 perfectly+0.035
Neg logit contributions
 hospital-0.012
 dentist-0.012
 sick-0.011
 doctor-0.011
 doctors-0.009
Token
Token ID220
Feature activation+1.14e+00

Pos logit contributions
bek+0.010
 rect+0.009
heit+0.009
vered+0.009
 angles+0.009
Neg logit contributions
 doctor-0.003
 sick-0.003
 hospital-0.003
 sleep-0.003
 dentist-0.003
Token bird
Token ID6512
Feature activation-1.41e+00

Pos logit contributions
 sick+0.022
 hospital+0.022
 doctor+0.022
 dentist+0.021
 sleep+0.021
Neg logit contributions
bek-0.024
 rect-0.023
vered-0.022
heit-0.022
 angles-0.021
Token looked
Token ID3114
Feature activation-7.38e-01

Pos logit contributions
 sick+0.039
 hospital+0.037
 sleep+0.036
 doctor+0.036
 hurt+0.035
Neg logit contributions
bek-0.066
 rect-0.063
heit-0.062
vered-0.059
 angles-0.058
Token at
Token ID379
Feature activation+1.87e+00

Pos logit contributions
 sick+0.019
 sleep+0.017
 hospital+0.016
 cold+0.016
 hot+0.016
Neg logit contributions
bek-0.039
 rect-0.036
vered-0.036
heit-0.035
anish-0.034
Token her
Token ID607
Feature activation+1.38e+00

Pos logit contributions
bek+0.122
 rect+0.116
heit+0.110
vered+0.105
ults+0.105
Neg logit contributions
 hospital-0.026
 sleep-0.026
 doctor-0.025
 sick-0.025
.-0.022
Token in
Token ID287
Feature activation+2.05e+00

Pos logit contributions
bek+0.070
 rect+0.068
heit+0.066
vered+0.065
 angles+0.063
Neg logit contributions
 sick-0.033
 doctor-0.031
 hospital-0.031
 sleep-0.031
 dentist-0.029
Token amaz
Token ID40642
Feature activation+7.10e+00

Pos logit contributions
bek+0.087
 rect+0.078
heit+0.078
vered+0.075
x+0.073
Neg logit contributions
 sick-0.071
 sleep-0.070
 doctor-0.067
 bat-0.067
 hospital-0.066
Tokenement
Token ID972
Feature activation+3.66e+00

Pos logit contributions
bek+0.181
 rect+0.163
heit+0.154
vered+0.149
osaurs+0.136
Neg logit contributions
 doctor-0.343
 hospital-0.340
 dentist-0.340
 sick-0.334
 sleep-0.327
Token,
Token ID11
Feature activation+2.78e-01

Pos logit contributions
bek+0.206
 rect+0.199
heit+0.195
vered+0.193
 angles+0.187
Neg logit contributions
 sick-0.066
 doctor-0.061
 hospital-0.061
 sleep-0.059
 dentist-0.056
Token and
Token ID290
Feature activation-2.47e-02

Pos logit contributions
bek+0.014
 rect+0.013
vered+0.013
heit+0.013
 angles+0.012
Neg logit contributions
 sick-0.007
 doctor-0.007
 hospital-0.007
 sleep-0.006
 dentist-0.006
Token said
Token ID531
Feature activation-1.31e+00

Pos logit contributions
 sick+0.001
 doctor+0.001
 hospital+0.001
 sleep+0.001
 dentist+0.001
Neg logit contributions
bek-0.001
 rect-0.001
heit-0.001
vered-0.001
 angles-0.001
Token:
Token ID25
Feature activation+4.79e-01

Pos logit contributions
 sick+0.029
 doctor+0.029
 hospital+0.028
 sleep+0.026
 dentist+0.026
Neg logit contributions
bek-0.068
 rect-0.065
heit-0.063
vered-0.063
 angles-0.062
Token line
Token ID1627
Feature activation+8.49e-03

Pos logit contributions
bek+0.112
 rect+0.110
heit+0.106
vered+0.105
 angles+0.102
Neg logit contributions
 sick-0.044
 hospital-0.042
 doctor-0.042
 sleep-0.040
 dentist-0.039
Token.
Token ID13
Feature activation+3.44e-01

Pos logit contributions
bek+0.000
 rect+0.000
vered+0.000
heit+0.000
 angles+0.000
Neg logit contributions
 sick-0.000
 hospital-0.000
 doctor-0.000
 dentist-0.000
 sleep-0.000
Token Billy
Token ID15890
Feature activation+1.83e+00

Pos logit contributions
bek+0.018
 rect+0.017
vered+0.017
heit+0.017
 angles+0.017
Neg logit contributions
 doctor-0.008
 hospital-0.008
 sick-0.007
 dentist-0.007
 veterinarian-0.007
Token watched
Token ID7342
Feature activation+4.83e-01

Pos logit contributions
bek+0.083
 rect+0.081
heit+0.078
vered+0.078
 angles+0.075
Neg logit contributions
 sick-0.051
 doctor-0.049
 hospital-0.049
 dentist-0.047
 sleep-0.047
Token in
Token ID287
Feature activation+2.02e+00

Pos logit contributions
bek+0.023
 rect+0.022
heit+0.022
vered+0.022
 angles+0.021
Neg logit contributions
 sick-0.013
 doctor-0.012
 hospital-0.012
 sleep-0.012
 dentist-0.011
Token amaz
Token ID40642
Feature activation+7.10e+00

Pos logit contributions
bek+0.071
 rect+0.064
heit+0.064
vered+0.060
x+0.057
Neg logit contributions
 sleep-0.081
 sick-0.081
 hospital-0.080
 doctor-0.079
 bat-0.078
Tokenement
Token ID972
Feature activation+4.09e+00

Pos logit contributions
bek+0.216
 rect+0.180
vered+0.169
heit+0.168
osaurs+0.165
Neg logit contributions
 doctor-0.316
 hospital-0.313
 dentist-0.312
?"-0.302
 sick-0.299
Token as
Token ID355
Feature activation-1.05e+00

Pos logit contributions
bek+0.202
 rect+0.200
vered+0.193
heit+0.192
 angles+0.186
Neg logit contributions
 sick-0.096
 hospital-0.092
 doctor-0.091
 dentist-0.089
 sleep-0.086
Token the
Token ID262
Feature activation-1.96e+00

Pos logit contributions
 sick+0.023
 doctor+0.023
 hospital+0.022
 dentist+0.021
 sleep+0.021
Neg logit contributions
bek-0.054
 rect-0.053
vered-0.051
heit-0.051
 angles-0.050
Token ro
Token ID686
Feature activation-1.74e+00

Pos logit contributions
 sleep+0.050
 hospital+0.050
 sick+0.049
 dentist+0.049
iver+0.047
Neg logit contributions
 rect-0.085
heit-0.081
bek-0.080
 angles-0.078
vered-0.078
Tokenoster
Token ID6197
Feature activation-3.51e-01

Pos logit contributions
 sick+0.065
 dentist+0.063
 hospital+0.063
 sleep+0.063
 doctor+0.062
Neg logit contributions
bek-0.064
 rect-0.061
vered-0.061
heit-0.059
ically-0.055
Token.
Token ID13
Feature activation+3.16e-01

Pos logit contributions
bek+0.055
 rect+0.054
 angles+0.053
vered+0.053
 perfectly+0.052
Neg logit contributions
 sick-0.020
 hospital-0.020
 dentist-0.020
 doctor-0.019
 itch-0.019
Token She
Token ID1375
Feature activation+1.17e+00

Pos logit contributions
bek+0.014
 rect+0.014
vered+0.013
heit+0.013
 angles+0.013
Neg logit contributions
 sick-0.009
 doctor-0.009
 hospital-0.009
 dentist-0.009
 sleep-0.009
Token looked
Token ID3114
Feature activation-6.93e-01

Pos logit contributions
bek+0.052
 rect+0.051
vered+0.049
heit+0.049
 angles+0.047
Neg logit contributions
 sick-0.033
 hospital-0.032
 doctor-0.031
 dentist-0.031
 sleep-0.030
Token up
Token ID510
Feature activation+5.53e-01

Pos logit contributions
 sick+0.018
 hospital+0.016
 sleep+0.016
 doctor+0.016
 dentist+0.015
Neg logit contributions
bek-0.035
 rect-0.034
vered-0.033
heit-0.032
anish-0.031
Token in
Token ID287
Feature activation+1.64e+00

Pos logit contributions
bek+0.026
 rect+0.026
vered+0.025
heit+0.025
 angles+0.024
Neg logit contributions
 sick-0.014
 hospital-0.014
 doctor-0.013
 dentist-0.013
 sleep-0.013
Token amaz
Token ID40642
Feature activation+7.09e+00

Pos logit contributions
bek+0.092
 rect+0.089
heit+0.087
vered+0.086
osaurs+0.081
Neg logit contributions
 sick-0.030
 sleep-0.030
 doctor-0.030
 hospital-0.029
 dentist-0.028
Tokenement
Token ID972
Feature activation+3.71e+00

Pos logit contributions
bek+0.227
 rect+0.197
heit+0.191
vered+0.184
osaurs+0.183
Neg logit contributions
 dentist-0.304
 doctor-0.302
 hospital-0.294
?"-0.288
 doctors-0.283
Token at
Token ID379
Feature activation+5.94e-01

Pos logit contributions
bek+0.207
 rect+0.203
heit+0.199
vered+0.197
 angles+0.190
Neg logit contributions
 sick-0.066
 doctor-0.061
 hospital-0.061
 sleep-0.059
 dentist-0.058
Token all
Token ID477
Feature activation-8.58e-01

Pos logit contributions
bek+0.036
 rect+0.035
heit+0.034
vered+0.033
ically+0.032
Neg logit contributions
 hospital-0.009
 sick-0.009
 doctor-0.009
 sleep-0.009
 dentist-0.009
Token of
Token ID286
Feature activation-4.19e-01

Pos logit contributions
 hospital+0.012
 sick+0.012
 doctor+0.011
 sleep+0.011
 dentist+0.010
Neg logit contributions
bek-0.053
 rect-0.051
heit-0.050
vered-0.050
ically-0.047
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
 sick+0.008
 doctor+0.007
 hospital+0.007
 dentist+0.007
 sleep+0.007
Neg logit contributions
bek-0.024
 rect-0.023
heit-0.023
vered-0.022
ically-0.022
Token at
Token ID379
Feature activation+1.87e+00

Pos logit contributions
 sick+0.015
 hospital+0.014
 doctor+0.013
 sleep+0.013
 hot+0.012
Neg logit contributions
bek-0.037
 rect-0.035
vered-0.035
heit-0.034
anish-0.033
Token the
Token ID262
Feature activation-2.81e-01

Pos logit contributions
bek+0.103
 rect+0.100
vered+0.095
heit+0.095
ically+0.090
Neg logit contributions
 sick-0.037
 doctor-0.037
 hospital-0.037
 sleep-0.034
 dentist-0.034
Token red
Token ID2266
Feature activation+1.38e+00

Pos logit contributions
 sick+0.009
 doctor+0.009
 hospital+0.009
 dentist+0.008
 sleep+0.008
Neg logit contributions
bek-0.012
heit-0.012
vered-0.011
 rect-0.011
 angles-0.010
Token blocks
Token ID7021
Feature activation+2.10e+00

Pos logit contributions
bek+0.052
heit+0.048
vered+0.047
 rect+0.046
 angles+0.043
Neg logit contributions
 doctor-0.051
 sick-0.050
 hospital-0.050
 dentist-0.049
 sleep-0.048
Token in
Token ID287
Feature activation+2.12e+00

Pos logit contributions
bek+0.120
heit+0.117
vered+0.113
 rect+0.112
 angles+0.107
Neg logit contributions
 sick-0.037
 doctor-0.034
 hospital-0.034
 sleep-0.031
 dentist-0.031
Token amaz
Token ID40642
Feature activation+7.07e+00

Pos logit contributions
bek+0.119
heit+0.113
 rect+0.112
vered+0.112
osaurs+0.106
Neg logit contributions
 sick-0.039
 doctor-0.038
 hospital-0.035
 bat-0.035
 sleep-0.035
Tokenement
Token ID972
Feature activation+3.76e+00

Pos logit contributions
bek+0.194
 rect+0.170
heit+0.165
vered+0.161
osaurs+0.149
Neg logit contributions
 dentist-0.329
 doctor-0.329
 hospital-0.325
?"-0.314
 sick-0.313
Token.
Token ID13
Feature activation+3.51e-01

Pos logit contributions
bek+0.233
 rect+0.225
heit+0.221
vered+0.219
 angles+0.212
Neg logit contributions
 sick-0.045
 doctor-0.040
 hospital-0.039
 sleep-0.038
 dentist-0.035
Token He
Token ID679
Feature activation+1.15e+00

Pos logit contributions
bek+0.016
heit+0.015
 rect+0.015
vered+0.015
 angles+0.014
Neg logit contributions
 doctor-0.010
 hospital-0.010
 sick-0.010
 dentist-0.010
 sleep-0.009
Token couldn
Token ID3521
Feature activation+2.28e+00

Pos logit contributions
bek+0.054
 rect+0.051
heit+0.050
vered+0.050
 angles+0.048
Neg logit contributions
 sick-0.031
 doctor-0.030
 hospital-0.029
 dentist-0.028
 sleep-0.028
Token't
Token ID470
Feature activation-7.79e-01

Pos logit contributions
 rect+0.073
vered+0.068
bek+0.067
heit+0.066
 perfectly+0.065
Neg logit contributions
 sick-0.090
 dentist-0.087
 hospital-0.085
 doctors-0.084
 sleep-0.083
Token away
Token ID1497
Feature activation+3.03e+00

Pos logit contributions
 sick+0.016
 doctor+0.015
 sleep+0.015
 hospital+0.015
 bat+0.014
Neg logit contributions
 rect-0.022
bek-0.022
heit-0.021
vered-0.021
 angles-0.019
Token.
Token ID13
Feature activation+8.34e-02

Pos logit contributions
bek+0.162
 rect+0.162
vered+0.157
heit+0.153
 angles+0.153
Neg logit contributions
 dentist-0.054
 sick-0.054
 hospital-0.054
 doctor-0.053
 sleep-0.047
Token Lucy
Token ID22162
Feature activation+1.94e+00

Pos logit contributions
bek+0.003
 rect+0.003
vered+0.003
heit+0.003
 angles+0.003
Neg logit contributions
 sick-0.003
 hospital-0.003
 doctor-0.003
 dentist-0.003
 sleep-0.003
Token watched
Token ID7342
Feature activation+4.25e-01

Pos logit contributions
bek+0.088
 rect+0.087
heit+0.084
vered+0.081
 angles+0.080
Neg logit contributions
 sick-0.055
 doctor-0.053
 hospital-0.052
 sleep-0.052
 dentist-0.049
Token in
Token ID287
Feature activation+2.00e+00

Pos logit contributions
bek+0.022
 rect+0.021
heit+0.021
vered+0.020
 angles+0.020
Neg logit contributions
 sick-0.010
 doctor-0.010
 sleep-0.010
 hospital-0.010
 dentist-0.009
Token amaz
Token ID40642
Feature activation+7.05e+00

Pos logit contributions
bek+0.067
 rect+0.063
heit+0.062
vered+0.056
x+0.054
Neg logit contributions
 sleep-0.086
 sick-0.085
 hospital-0.083
 doctor-0.083
 bat-0.080
Tokenement
Token ID972
Feature activation+3.87e+00

Pos logit contributions
bek+0.227
 rect+0.197
osaurs+0.185
heit+0.182
vered+0.175
Neg logit contributions
 doctor-0.302
 hospital-0.301
 dentist-0.297
 sick-0.284
?"-0.283
Token as
Token ID355
Feature activation-9.87e-01

Pos logit contributions
bek+0.191
 rect+0.189
vered+0.182
heit+0.182
 angles+0.176
Neg logit contributions
 sick-0.090
 hospital-0.087
 doctor-0.087
 dentist-0.084
 sleep-0.082
Token it
Token ID340
Feature activation-2.47e+00

Pos logit contributions
 sick+0.023
 doctor+0.022
 hospital+0.022
 sleep+0.021
 dentist+0.021
Neg logit contributions
bek-0.050
 rect-0.050
heit-0.048
vered-0.048
 angles-0.046
Token flew
Token ID13112
Feature activation-1.23e+00

Pos logit contributions
 sick+0.081
 hospital+0.079
 doctor+0.078
 dentist+0.078
 sleep+0.075
Neg logit contributions
bek-0.098
 rect-0.097
vered-0.094
heit-0.092
 angles-0.089
Token away
Token ID1497
Feature activation+2.70e+00

Pos logit contributions
 sick+0.039
 sleep+0.037
 doctor+0.037
 hospital+0.036
.+0.036
Neg logit contributions
bek-0.053
 rect-0.052
heit-0.050
vered-0.049
 angles-0.045
Tokenuddle
Token ID24500
Feature activation-1.44e+00

Pos logit contributions
 sick+0.022
 doctor+0.022
 dentist+0.022
 hospital+0.022
 sleep+0.021
Neg logit contributions
bek-0.031
 rect-0.031
heit-0.030
vered-0.029
 angles-0.029
Token!"
Token ID2474
Feature activation-1.32e+00

Pos logit contributions
 sick+0.036
 hospital+0.035
 dentist+0.035
 doctor+0.035
 sleep+0.034
Neg logit contributions
bek-0.066
 rect-0.066
heit-0.064
vered-0.064
 angles-0.062
Token she
Token ID673
Feature activation-1.96e-01

Pos logit contributions
 sick+0.037
 hospital+0.036
 doctor+0.036
 sleep+0.035
 dentist+0.034
Neg logit contributions
bek-0.061
 rect-0.059
vered-0.056
heit-0.056
 angles-0.055
Token said
Token ID531
Feature activation-8.74e-01

Pos logit contributions
 sick+0.005
 hospital+0.005
 doctor+0.005
 dentist+0.005
 sleep+0.005
Neg logit contributions
bek-0.009
 rect-0.009
vered-0.008
heit-0.008
 angles-0.008
Token in
Token ID287
Feature activation+2.30e+00

Pos logit contributions
 dentist+0.015
 hospital+0.015
 sick+0.015
 doctor+0.014
 sleep+0.012
Neg logit contributions
bek-0.048
 rect-0.047
vered-0.047
heit-0.045
 angles-0.044
Token amaz
Token ID40642
Feature activation+7.05e+00

Pos logit contributions
bek+0.117
 rect+0.113
heit+0.110
vered+0.109
 angles+0.106
Neg logit contributions
 sick-0.055
 sleep-0.053
 hospital-0.052
 doctor-0.052
 dentist-0.050
Tokenement
Token ID972
Feature activation+4.07e+00

Pos logit contributions
bek+0.189
 rect+0.172
heit+0.155
vered+0.154
osaurs+0.147
Neg logit contributions
 doctor-0.331
 hospital-0.327
 dentist-0.327
 sick-0.322
?"-0.316
Token.
Token ID13
Feature activation+2.67e-01

Pos logit contributions
 rect+0.234
bek+0.233
vered+0.227
heit+0.225
 angles+0.222
Neg logit contributions
 hospital-0.057
 sick-0.057
 dentist-0.055
 doctor-0.054
 sleep-0.046
Token
Token ID220
Feature activation+1.48e+00

Pos logit contributions
bek+0.012
 rect+0.011
vered+0.011
heit+0.011
 angles+0.011
Neg logit contributions
 doctor-0.008
 hospital-0.008
 sick-0.008
 sleep-0.007
 dentist-0.007
Token
Token ID220
Feature activation+1.46e+00

Pos logit contributions
vered+0.064
 rect+0.064
bek+0.064
heit+0.062
 angles+0.061
Neg logit contributions
 sick-0.039
 hospital-0.038
 dentist-0.038
 doctor-0.036
 itch-0.035
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
vered+0.064
 rect+0.063
bek+0.063
heit+0.062
 angles+0.060
Neg logit contributions
 sick-0.038
 hospital-0.038
 dentist-0.037
 doctor-0.036
 sleep-0.035
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 hospital+0.006
 doctor+0.006
 dentist+0.006
 sick+0.006
 paramedics+0.006
Neg logit contributions
bek-0.017
 rect-0.017
vered-0.017
 angles-0.016
 globe-0.016
Token She
Token ID1375
Feature activation-5.02e-01

Pos logit contributions
 sick+0.036
 hospital+0.035
 doctor+0.035
 dentist+0.034
 sleep+0.033
Neg logit contributions
bek-0.058
 rect-0.058
vered-0.055
heit-0.054
 angles-0.053
Token had
Token ID550
Feature activation+1.20e+00

Pos logit contributions
 hospital+0.017
 dentist+0.017
 doctor+0.017
 sick+0.017
 ambulance+0.016
Neg logit contributions
 rect-0.018
bek-0.018
vered-0.017
heit-0.016
 perfectly-0.016
Token a
Token ID257
Feature activation+5.96e-01

Pos logit contributions
bek+0.066
 rect+0.065
heit+0.063
vered+0.063
 angles+0.061
Neg logit contributions
 sick-0.023
 hospital-0.022
 doctor-0.022
 dentist-0.021
 sleep-0.020
Token bad
Token ID2089
Feature activation-1.22e-01

Pos logit contributions
bek+0.031
 rect+0.030
vered+0.030
heit+0.030
 angles+0.029
Neg logit contributions
 sick-0.014
 doctor-0.013
 hospital-0.013
 dentist-0.012
 sleep-0.012
Token tum
Token ID11814
Feature activation-7.38e+00

Pos logit contributions
 sick+0.003
 dentist+0.003
 hospital+0.003
 doctor+0.003
 doctors+0.003
Neg logit contributions
 rect-0.005
bek-0.005
heit-0.005
vered-0.005
 perfectly-0.005
Tokenmy
Token ID1820
Feature activation-6.78e-01

Pos logit contributions
 hospital+0.249
 sick+0.247
 doctor+0.240
 sleep+0.235
 dentist+0.232
Neg logit contributions
bek-0.267
 rect-0.263
 angles-0.258
vered-0.250
heit-0.240
Token ac
Token ID936
Feature activation-5.10e+00

Pos logit contributions
 doctor+0.014
 hospital+0.014
 dentist+0.014
 veterinarian+0.013
 paramedics+0.013
Neg logit contributions
vered-0.031
bek-0.031
 rect-0.030
 perfectly-0.029
heit-0.028
Tokenhe
Token ID258
Feature activation+4.21e-01

Pos logit contributions
 hospital+0.153
 paramedics+0.150
 sleep+0.150
 sick+0.149
 doctor+0.148
Neg logit contributions
bek-0.197
 globe-0.188
x-0.181
vered-0.181
 rect-0.173
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
bek+0.015
 rect+0.015
vered+0.014
heit+0.014
 perfectly+0.014
Neg logit contributions
 sick-0.014
 hospital-0.013
 dentist-0.013
 doctor-0.013
 sleep-0.013
Token her
Token ID607
Feature activation-6.70e-01

Pos logit contributions
 doctor+0.035
 hospital+0.034
 dentist+0.034
 sick+0.034
 sleep+0.033
Neg logit contributions
bek-0.040
 rect-0.038
heit-0.037
vered-0.037
 perfectly-0.035
Token<|endoftext|>
Token ID50256
Feature activation-5.95e-01

Pos logit contributions
bek+0.009
 rect+0.009
vered+0.009
heit+0.009
 angles+0.009
Neg logit contributions
 sick-0.004
 hospital-0.004
 doctor-0.004
 dentist-0.004
 sleep-0.004
TokenOnce
Token ID7454
Feature activation+2.68e+00

Pos logit contributions
 sick+0.017
 hospital+0.016
 dentist+0.015
 doctor+0.015
 itch+0.015
Neg logit contributions
bek-0.026
 rect-0.026
vered-0.026
heit-0.025
 angles-0.025
Token there
Token ID612
Feature activation+1.65e+00

Pos logit contributions
bek+0.096
 rect+0.094
heit+0.090
vered+0.090
 angles+0.088
Neg logit contributions
 sick-0.097
 hospital-0.095
 doctor-0.093
 dentist-0.093
 sleep-0.091
Token was
Token ID373
Feature activation-1.96e+00

Pos logit contributions
bek+0.083
 rect+0.081
heit+0.079
vered+0.076
 angles+0.073
Neg logit contributions
 sick-0.039
 doctor-0.039
 hospital-0.038
 sleep-0.037
 dentist-0.036
Token a
Token ID257
Feature activation-9.37e-01

Pos logit contributions
 sick+0.041
 hospital+0.040
 doctor+0.040
 dentist+0.039
 sleep+0.036
Neg logit contributions
 rect-0.099
bek-0.099
vered-0.096
heit-0.094
 angles-0.092
Token sne
Token ID10505
Feature activation-7.23e+00

Pos logit contributions
 hospital+0.027
 sick+0.026
 dentist+0.026
iver+0.025
 sleep+0.025
Neg logit contributions
bek-0.039
 rect-0.038
 globe-0.034
heit-0.033
Wow-0.033
Tokenez
Token ID8471
Feature activation-3.15e+00

Pos logit contributions
 dentist+0.300
 doctor+0.299
 sick+0.296
 hospital+0.291
 bat+0.280
Neg logit contributions
vered-0.206
bek-0.204
 rect-0.190
heit-0.188
 angles-0.181
Tokening
Token ID278
Feature activation-1.29e+00

Pos logit contributions
 dentist+0.109
 hospital+0.107
 sick+0.106
 sleep+0.106
 itch+0.100
Neg logit contributions
vered-0.108
bek-0.106
 rect-0.101
ically-0.097
 perfectly-0.097
Token boy
Token ID2933
Feature activation+3.73e-01

Pos logit contributions
 dentist+0.033
 sick+0.033
 hospital+0.033
 doctor+0.032
 sleep+0.031
Neg logit contributions
 rect-0.057
bek-0.056
vered-0.055
heit-0.054
 angles-0.052
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
bek+0.020
 rect+0.019
vered+0.019
heit+0.018
 angles+0.018
Neg logit contributions
 sick-0.007
 hospital-0.007
 doctor-0.007
 dentist-0.007
 sleep-0.006
Token He
Token ID679
Feature activation-2.10e-01

Pos logit contributions
 sick+0.030
 hospital+0.030
 dentist+0.029
 doctor+0.029
 doctors+0.027
Neg logit contributions
 rect-0.042
bek-0.041
vered-0.039
heit-0.038
 angles-0.038
Token<|endoftext|>
Token ID50256
Feature activation-1.97e+00

Pos logit contributions
 sick+0.014
 hospital+0.013
 doctor+0.012
 dentist+0.012
 sleep+0.011
Neg logit contributions
bek-0.053
 rect-0.052
vered-0.050
heit-0.050
 angles-0.049
TokenOnce
Token ID7454
Feature activation+1.76e+00

Pos logit contributions
 sick+0.049
 hospital+0.047
 doctor+0.046
 dentist+0.045
 sleep+0.044
Neg logit contributions
bek-0.094
 rect-0.093
vered-0.091
heit-0.088
 angles-0.087
Token there
Token ID612
Feature activation+6.72e-01

Pos logit contributions
bek+0.062
 rect+0.061
vered+0.058
heit+0.057
 angles+0.056
Neg logit contributions
 sick-0.065
 hospital-0.064
 doctor-0.063
 dentist-0.062
 sleep-0.061
Token was
Token ID373
Feature activation-1.83e+00

Pos logit contributions
bek+0.037
heit+0.036
 rect+0.036
vered+0.034
 angles+0.033
Neg logit contributions
 doctor-0.014
 hospital-0.014
 sick-0.012
 sleep-0.012
 dentist-0.012
Token an
Token ID281
Feature activation-5.47e-01

Pos logit contributions
 sick+0.037
 hospital+0.035
 doctor+0.034
 dentist+0.033
 sleep+0.033
Neg logit contributions
bek-0.097
 rect-0.096
vered-0.093
heit-0.093
 angles-0.090
Token ice
Token ID4771
Feature activation-7.07e+00

Pos logit contributions
 sick+0.014
 doctor+0.013
 dentist+0.013
 hospital+0.013
 hurt+0.012
Neg logit contributions
bek-0.025
 rect-0.025
heit-0.023
vered-0.023
 angles-0.023
Token cream
Token ID8566
Feature activation-3.48e+00

Pos logit contributions
 sick+0.203
 dentist+0.200
 sleep+0.197
 hospital+0.196
 doctor+0.188
Neg logit contributions
bek-0.288
 rect-0.276
 globe-0.266
 perfectly-0.257
vered-0.253
Token cone
Token ID27763
Feature activation-4.63e+00

Pos logit contributions
 sick+0.088
 sleep+0.081
 hospital+0.081
 dentist+0.077
 doctor+0.076
Neg logit contributions
bek-0.163
 rect-0.162
vered-0.149
heit-0.146
 angles-0.145
Token.
Token ID13
Feature activation-8.29e-01

Pos logit contributions
 sick+0.110
 dentist+0.106
 hospital+0.104
 sleep+0.104
 doctor+0.099
Neg logit contributions
bek-0.215
 rect-0.209
vered-0.199
 perfectly-0.198
 globe-0.190
Token It
Token ID632
Feature activation-1.23e+00

Pos logit contributions
 doctor+0.024
 sick+0.024
 sleep+0.023
 hospital+0.023
 dentist+0.023
Neg logit contributions
bek-0.036
heit-0.036
 rect-0.036
vered-0.035
 angles-0.034
Token sat
Token ID3332
Feature activation-4.54e+00

Pos logit contributions
 sick+0.044
 hospital+0.043
 doctor+0.042
 dentist+0.042
 doctors+0.041
Neg logit contributions
 rect-0.045
bek-0.044
vered-0.041
heit-0.040
 perfectly-0.038
Token different
Token ID1180
Feature activation+2.51e+00

Pos logit contributions
 doctor+0.003
 sick+0.003
 hospital+0.003
 dentist+0.003
 sleep+0.003
Neg logit contributions
bek-0.004
heit-0.003
vered-0.003
 rect-0.003
x-0.003
Token parts
Token ID3354
Feature activation-1.23e-01

Pos logit contributions
bek+0.104
heit+0.089
vered+0.084
 rect+0.084
ifying+0.078
Neg logit contributions
 sleep-0.091
 doctor-0.089
 sick-0.087
 hospital-0.087
?"-0.086
Token and
Token ID290
Feature activation+1.42e-01

Pos logit contributions
 sick+0.002
 doctor+0.002
 hospital+0.002
 sleep+0.002
 dentist+0.001
Neg logit contributions
bek-0.007
heit-0.007
 rect-0.007
vered-0.007
 angles-0.007
Token explained
Token ID4893
Feature activation+1.71e-01

Pos logit contributions
bek+0.007
vered+0.007
heit+0.007
 rect+0.007
ifying+0.007
Neg logit contributions
 doctor-0.003
 sick-0.003
 hospital-0.003
 dentist-0.003
 doctors-0.003
Token what
Token ID644
Feature activation+8.37e-01

Pos logit contributions
bek+0.009
vered+0.008
 rect+0.008
heit+0.008
osaurs+0.007
Neg logit contributions
 doctor-0.004
 sick-0.004
 hospital-0.004
 dentist-0.004
 doctors-0.004
Token each
Token ID1123
Feature activation-7.05e+00

Pos logit contributions
heit+0.043
bek+0.042
vered+0.041
 rect+0.038
 angles+0.038
Neg logit contributions
 doctor-0.021
 sick-0.021
 dentist-0.020
 hospital-0.019
?"-0.019
Token one
Token ID530
Feature activation-4.19e+00

Pos logit contributions
 doctor+0.140
 sick+0.139
 hospital+0.130
?"+0.125
 sleep+0.124
Neg logit contributions
bek-0.404
heit-0.388
vered-0.375
 rect-0.372
 wrists-0.351
Token did
Token ID750
Feature activation-6.38e-01

Pos logit contributions
 sick+0.115
 doctor+0.111
 hospital+0.109
 dentist+0.106
 sleep+0.104
Neg logit contributions
bek-0.198
heit-0.195
 rect-0.194
vered-0.187
 angles-0.177
Token.
Token ID13
Feature activation+9.72e-02

Pos logit contributions
 sick+0.007
 hospital+0.006
 doctor+0.006
 dentist+0.006
 sleep+0.005
Neg logit contributions
bek-0.040
 rect-0.039
vered-0.038
heit-0.038
 angles-0.037
Token Her
Token ID2332
Feature activation+5.63e-01

Pos logit contributions
bek+0.005
heit+0.004
 rect+0.004
vered+0.004
 angles+0.004
Neg logit contributions
 doctor-0.003
 sick-0.003
 hospital-0.003
 sleep-0.002
 dentist-0.002
Token mom
Token ID1995
Feature activation-2.05e+00

Pos logit contributions
bek+0.034
 rect+0.033
heit+0.032
vered+0.032
 angles+0.031
Neg logit contributions
 sick-0.008
 doctor-0.008
 hospital-0.007
 dentist-0.007
 sleep-0.007
Token apple
Token ID17180
Feature activation-2.12e+00

Pos logit contributions
bek+0.085
 rect+0.085
vered+0.082
heit+0.081
 angles+0.079
Neg logit contributions
 sick-0.052
 hospital-0.051
 doctor-0.049
 dentist-0.047
 sleep-0.046
Token.
Token ID13
Feature activation+2.12e-01

Pos logit contributions
 sick+0.036
 hospital+0.035
 doctor+0.034
 dentist+0.033
 sleep+0.032
Neg logit contributions
bek-0.117
 rect-0.116
vered-0.112
heit-0.111
 angles-0.109
Token With
Token ID2080
Feature activation+2.80e+00

Pos logit contributions
bek+0.010
 rect+0.010
vered+0.010
heit+0.010
 angles+0.009
Neg logit contributions
 sick-0.005
 doctor-0.005
 hospital-0.005
 dentist-0.005
 sleep-0.005
Token a
Token ID257
Feature activation+4.23e-01

Pos logit contributions
bek+0.157
 rect+0.151
vered+0.145
heit+0.144
 angles+0.143
Neg logit contributions
 sick-0.056
 doctor-0.051
 hospital-0.051
 sleep-0.050
 dentist-0.047
Token full
Token ID1336
Feature activation-1.42e+00

Pos logit contributions
bek+0.023
 rect+0.023
vered+0.022
heit+0.022
 angles+0.021
Neg logit contributions
 sick-0.008
 doctor-0.008
 hospital-0.008
 dentist-0.007
 sleep-0.007
Token tum
Token ID11814
Feature activation-7.02e+00

Pos logit contributions
 sick+0.030
 hospital+0.029
 dentist+0.028
 doctor+0.027
 doctors+0.026
Neg logit contributions
bek-0.071
 rect-0.070
vered-0.067
heit-0.067
 angles-0.066
Tokenmy
Token ID1820
Feature activation-3.40e-01

Pos logit contributions
 sick+0.203
 hospital+0.197
 doctor+0.195
 dentist+0.190
 sleep+0.188
Neg logit contributions
bek-0.309
 rect-0.304
vered-0.293
heit-0.291
 angles-0.283
Token,
Token ID11
Feature activation-8.15e-01

Pos logit contributions
 sick+0.007
 doctor+0.007
 hospital+0.007
 sleep+0.007
 dentist+0.007
Neg logit contributions
bek-0.018
 rect-0.017
heit-0.017
vered-0.017
 angles-0.016
Token he
Token ID339
Feature activation-6.80e-01

Pos logit contributions
 sick+0.016
 doctor+0.016
 hospital+0.015
 dentist+0.014
 sleep+0.014
Neg logit contributions
bek-0.045
 rect-0.044
heit-0.043
vered-0.043
 angles-0.041
Token marched
Token ID23558
Feature activation+9.36e-01

Pos logit contributions
 sick+0.021
 hospital+0.021
 doctor+0.021
 dentist+0.020
 sleep+0.020
Neg logit contributions
bek-0.027
 rect-0.027
vered-0.026
heit-0.026
 angles-0.025
Token back
Token ID736
Feature activation-1.39e+00

Pos logit contributions
bek+0.042
 rect+0.041
vered+0.039
heit+0.039
 angles+0.038
Neg logit contributions
 sick-0.027
 hospital-0.026
 doctor-0.026
 dentist-0.025
 sleep-0.024
Token did
Token ID750
Feature activation+1.07e+00

Pos logit contributions
 sick+0.056
 hospital+0.054
 doctor+0.054
 dentist+0.053
 sleep+0.052
Neg logit contributions
bek-0.066
 rect-0.065
vered-0.062
heit-0.062
 angles-0.060
Token not
Token ID407
Feature activation-8.57e-01

Pos logit contributions
bek+0.050
 rect+0.049
vered+0.048
heit+0.048
 angles+0.046
Neg logit contributions
 sick-0.027
 hospital-0.027
 doctor-0.026
 dentist-0.026
 sleep-0.025
Token have
Token ID423
Feature activation-1.33e-01

Pos logit contributions
 dentist+0.021
 hospital+0.021
 doctor+0.021
 sick+0.021
 doctors+0.018
Neg logit contributions
bek-0.038
 rect-0.038
vered-0.037
 angles-0.036
heit-0.036
Token a
Token ID257
Feature activation+7.01e-01

Pos logit contributions
 sick+0.003
 hospital+0.003
 doctor+0.003
 dentist+0.003
 sleep+0.003
Neg logit contributions
bek-0.006
 rect-0.006
vered-0.006
heit-0.006
 angles-0.006
Token messy
Token ID24097
Feature activation-2.14e-01

Pos logit contributions
 rect+0.035
bek+0.035
vered+0.034
heit+0.034
 angles+0.033
Neg logit contributions
 sick-0.018
 doctor-0.017
 hospital-0.017
 sleep-0.016
 dentist-0.016
Token tum
Token ID11814
Feature activation-7.00e+00

Pos logit contributions
 sick+0.006
 hospital+0.005
 dentist+0.005
 doctor+0.005
 sleep+0.005
Neg logit contributions
 rect-0.009
bek-0.009
vered-0.009
heit-0.009
 angles-0.009
Tokenmy
Token ID1820
Feature activation-3.72e-01

Pos logit contributions
 sick+0.214
 hospital+0.212
 doctor+0.207
 sleep+0.203
 dentist+0.202
Neg logit contributions
bek-0.289
 rect-0.284
vered-0.273
 angles-0.268
heit-0.266
Token anymore
Token ID7471
Feature activation-9.10e-01

Pos logit contributions
 hospital+0.007
 sick+0.007
 doctor+0.007
 dentist+0.007
 sleep+0.007
Neg logit contributions
bek-0.019
 rect-0.019
vered-0.018
 perfectly-0.017
heit-0.017
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
 sick+0.011
 hospital+0.010
 doctor+0.010
 dentist+0.009
 sleep+0.008
Neg logit contributions
bek-0.055
 rect-0.054
vered-0.053
heit-0.053
 angles-0.052
Token<|endoftext|>
Token ID50256
Feature activation-9.31e-01

Pos logit contributions
 doctor+0.003
 hospital+0.003
 dentist+0.003
 sleep+0.003
 sick+0.003
Neg logit contributions
bek-0.006
 rect-0.006
heit-0.006
vered-0.006
 angles-0.006
TokenBen
Token ID11696
Feature activation-1.44e+00

Pos logit contributions
 sick+0.025
 hospital+0.024
 doctor+0.023
 dentist+0.023
 itch+0.023
Neg logit contributions
bek-0.041
 rect-0.041
vered-0.041
heit-0.039
 angles-0.039
Token hungry
Token ID14720
Feature activation-4.40e+00

Pos logit contributions
 hospital+0.027
 sick+0.026
 dentist+0.026
 sleep+0.026
 doctor+0.025
Neg logit contributions
bek-0.045
 rect-0.044
heit-0.040
vered-0.039
 globe-0.039
Token little
Token ID1310
Feature activation+3.21e-01

Pos logit contributions
iver+0.125
 sick+0.124
 hospital+0.117
 rested+0.111
 doctors+0.110
Neg logit contributions
 rect-0.165
bek-0.164
 globe-0.158
heit-0.146
Wow-0.143
Token girl
Token ID2576
Feature activation-4.63e-01

Pos logit contributions
bek+0.013
 rect+0.013
heit+0.012
vered+0.012
 angles+0.011
Neg logit contributions
 hospital-0.010
 sick-0.009
 dentist-0.009
 doctor-0.009
 sleep-0.009
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
 hospital+0.012
 dentist+0.012
 sick+0.011
 doctor+0.011
 itch+0.011
Neg logit contributions
bek-0.020
 rect-0.020
vered-0.019
 angles-0.018
ra-0.018
Token Her
Token ID2332
Feature activation-5.08e-01

Pos logit contributions
 sick+0.040
 hospital+0.039
 doctor+0.038
 dentist+0.038
 sleep+0.036
Neg logit contributions
 rect-0.053
bek-0.052
vered-0.049
heit-0.048
 angles-0.048
Token tum
Token ID11814
Feature activation-6.98e+00

Pos logit contributions
aster+0.018
 hospital+0.018
 paramedics+0.017
iver+0.017
 dentist+0.017
Neg logit contributions
bek-0.015
 magn-0.014
 meanings-0.014
 perfectly-0.014
ra-0.014
Tokenmy
Token ID1820
Feature activation-6.07e-01

Pos logit contributions
 hospital+0.230
 sick+0.230
 doctor+0.224
 sleep+0.218
 dentist+0.217
Neg logit contributions
bek-0.264
 rect-0.256
vered-0.249
 angles-0.248
heit-0.243
Token was
Token ID373
Feature activation-6.44e-01

Pos logit contributions
 hospital+0.023
 doctor+0.022
 ambulance+0.022
 dentist+0.022
 paramedics+0.022
Neg logit contributions
vered-0.018
bek-0.017
 perfectly-0.017
 rect-0.016
heit-0.016
Token so
Token ID523
Feature activation-1.29e+00

Pos logit contributions
 hospital+0.019
 dentist+0.019
 doctor+0.019
 paramedics+0.019
 veterinarian+0.019
Neg logit contributions
 perfectly-0.022
 rect-0.021
ifying-0.020
bek-0.020
heit-0.019
Token big
Token ID1263
Feature activation-1.79e+00

Pos logit contributions
 hospital+0.036
 dentist+0.035
 veterinarian+0.035
 doctor+0.034
 ambulance+0.034
Neg logit contributions
heit-0.051
bek-0.050
 rect-0.048
 perfectly-0.048
ifying-0.048
Token and
Token ID290
Feature activation-2.24e+00

Pos logit contributions
 sick+0.046
 hospital+0.043
 doctor+0.042
 dentist+0.041
 sleep+0.040
Neg logit contributions
 rect-0.083
bek-0.081
vered-0.078
heit-0.078
 angles-0.076
Token many
Token ID867
Feature activation+1.44e+00

Pos logit contributions
 hospital+0.127
 dentist+0.125
 doctor+0.125
 ambulance+0.122
 doctors+0.121
Neg logit contributions
 rect-0.109
bek-0.108
 perfectly-0.105
heit-0.105
vered-0.104
Token carrots
Token ID34397
Feature activation-1.05e+00

Pos logit contributions
 rect+0.061
bek+0.061
vered+0.059
heit+0.058
 angles+0.057
Neg logit contributions
 sick-0.043
 hospital-0.042
 doctor-0.042
 dentist-0.041
 sleep-0.039
Token and
Token ID290
Feature activation+3.66e-02

Pos logit contributions
 hospital+0.023
 dentist+0.023
 sick+0.023
 doctor+0.023
 doctors+0.021
Neg logit contributions
 rect-0.051
bek-0.051
vered-0.049
heit-0.048
 perfectly-0.047
Token got
Token ID1392
Feature activation-2.04e+00

Pos logit contributions
bek+0.002
 rect+0.002
vered+0.002
heit+0.002
 angles+0.002
Neg logit contributions
 sick-0.001
 hospital-0.001
 doctor-0.001
 dentist-0.001
 sleep-0.001
Token a
Token ID257
Feature activation+1.64e+00

Pos logit contributions
 hospital+0.052
 dentist+0.052
 doctor+0.051
iver+0.049
 sleep+0.049
Neg logit contributions
 rect-0.090
bek-0.086
vered-0.085
heit-0.084
 angles-0.084
Token tum
Token ID11814
Feature activation-6.93e+00

Pos logit contributions
bek+0.083
 rect+0.082
heit+0.078
vered+0.078
 angles+0.076
Neg logit contributions
 sick-0.034
 hospital-0.034
 doctor-0.033
 dentist-0.033
 sleep-0.031
Tokenmy
Token ID1820
Feature activation-6.20e-02

Pos logit contributions
 hospital+0.267
 sick+0.253
 sleep+0.252
 doctor+0.251
 veterinarian+0.243
Neg logit contributions
bek-0.220
 angles-0.215
 rect-0.212
ogram-0.207
vered-0.201
Token ac
Token ID936
Feature activation-4.55e+00

Pos logit contributions
 hospital+0.001
 doctor+0.001
 dentist+0.001
 veterinarian+0.001
 paramedics+0.001
Neg logit contributions
vered-0.003
 rect-0.003
 perfectly-0.003
bek-0.003
 angles-0.003
Tokenhe
Token ID258
Feature activation+1.12e+00

Pos logit contributions
 doctors+0.153
 sleep+0.151
 hospital+0.150
 lecture+0.149
 paramedics+0.149
Neg logit contributions
bek-0.160
x-0.150
 globe-0.147
vered-0.145
 perfectly-0.138
Token.
Token ID13
Feature activation-4.20e-01

Pos logit contributions
bek+0.057
 rect+0.057
vered+0.056
 perfectly+0.054
heit+0.054
Neg logit contributions
 sick-0.020
 hospital-0.019
 dentist-0.019
 doctor-0.019
 sleep-0.018
Token His
Token ID2399
Feature activation+7.33e-01

Pos logit contributions
 sick+0.013
 doctor+0.013
 hospital+0.013
 dentist+0.012
 sleep+0.012
Neg logit contributions
bek-0.018
 rect-0.017
vered-0.017
heit-0.017
 angles-0.016
Token or
Token ID393
Feature activation+4.84e-01

Pos logit contributions
 dentist+0.000
 hospital+0.000
 doctor+0.000
 veterinarian+0.000
iver+0.000
Neg logit contributions
 rect-0.000
heit-0.000
 angles-0.000
vered-0.000
 topp-0.000
Token you
Token ID345
Feature activation+2.58e-01

Pos logit contributions
 rect+0.021
bek+0.021
heit+0.021
vered+0.020
 angles+0.020
Neg logit contributions
 dentist-0.012
 hospital-0.012
 doctor-0.012
 veterinarian-0.011
iver-0.011
Token will
Token ID481
Feature activation+7.67e-02

Pos logit contributions
bek+0.009
 perfectly+0.009
vered+0.008
 rect+0.008
heit+0.008
Neg logit contributions
 dentist-0.009
 hospital-0.009
 doctors-0.009
 doctor-0.009
 ambulance-0.009
Token get
Token ID651
Feature activation-2.46e+00

Pos logit contributions
 rect+0.003
heit+0.003
 perfectly+0.003
 angles+0.003
ra+0.003
Neg logit contributions
 dentist-0.002
 doctors-0.002
 ambulance-0.002
 doctor-0.002
 elderly-0.002
Token a
Token ID257
Feature activation+1.42e+00

Pos logit contributions
 dentist+0.062
 hospital+0.057
 doctor+0.057
 veterinarian+0.056
iver+0.056
Neg logit contributions
 perfectly-0.106
heit-0.106
 rect-0.104
vered-0.103
bek-0.102
Token tum
Token ID11814
Feature activation-6.92e+00

Pos logit contributions
bek+0.065
 rect+0.063
heit+0.061
vered+0.058
 angles+0.057
Neg logit contributions
 dentist-0.035
 hospital-0.035
 doctors-0.034
 sick-0.034
 doctor-0.033
Tokenmy
Token ID1820
Feature activation+1.08e-01

Pos logit contributions
 hospital+0.260
 sick+0.252
 doctor+0.247
 sleep+0.245
 veterinarian+0.241
Neg logit contributions
bek-0.224
 rect-0.218
 angles-0.215
ogram-0.208
anish-0.203
Token ac
Token ID936
Feature activation-4.70e+00

Pos logit contributions
 rect+0.005
vered+0.005
bek+0.005
 perfectly+0.005
heit+0.005
Neg logit contributions
 hospital-0.002
 dentist-0.002
 doctor-0.002
 veterinarian-0.002
 doctors-0.002
Tokenhe
Token ID258
Feature activation+1.09e+00

Pos logit contributions
 doctors+0.164
 sleep+0.163
 sne+0.156
 paramedics+0.155
 dentist+0.153
Neg logit contributions
bek-0.162
 globe-0.151
izing-0.143
x-0.143
vered-0.136
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
bek+0.054
 rect+0.054
vered+0.052
heit+0.052
 angles+0.050
Neg logit contributions
 sick-0.022
 hospital-0.022
 dentist-0.022
 doctor-0.021
 sleep-0.021
Token Stay
Token ID16160
Feature activation-1.75e+00

Pos logit contributions
bek+0.001
heit+0.001
vered+0.001
 rect+0.001
 angles+0.001
Neg logit contributions
 doctor-0.001
 sick-0.001
 hospital-0.001
 dentist-0.001
 sleep-0.001
TokenWhat
Token ID2061
Feature activation+1.35e+00

Pos logit contributions
 rect+0.160
heit+0.159
bek+0.157
vered+0.151
 angles+0.149
Neg logit contributions
 sick-0.120
 doctor-0.114
 dentist-0.111
 itch-0.110
 hospital-0.109
Token if
Token ID611
Feature activation-1.77e-01

Pos logit contributions
bek+0.057
 rect+0.056
heit+0.054
vered+0.054
 angles+0.052
Neg logit contributions
 sick-0.043
 doctor-0.041
 hospital-0.041
 dentist-0.040
 sleep-0.039
Token I
Token ID314
Feature activation-2.18e+00

Pos logit contributions
 sick+0.007
 hospital+0.007
 dentist+0.006
 sleep+0.006
 doctor+0.006
Neg logit contributions
bek-0.006
 rect-0.006
heit-0.006
vered-0.006
 angles-0.005
Token get
Token ID651
Feature activation-4.03e+00

Pos logit contributions
 dentist+0.066
 hospital+0.065
 doctors+0.065
 doctor+0.063
 paramedics+0.062
Neg logit contributions
bek-0.083
vered-0.083
 rect-0.079
heit-0.078
ra-0.078
Token a
Token ID257
Feature activation+7.52e-01

Pos logit contributions
 hospital+0.105
 dentist+0.104
 doctor+0.100
 sleep+0.099
iver+0.094
Neg logit contributions
bek-0.172
 rect-0.170
heit-0.170
vered-0.169
 angles-0.164
Token tum
Token ID11814
Feature activation-6.92e+00

Pos logit contributions
bek+0.034
 rect+0.033
heit+0.032
vered+0.032
 angles+0.031
Neg logit contributions
 hospital-0.019
 sick-0.019
 dentist-0.019
 doctor-0.018
 doctors-0.018
Tokenmy
Token ID1820
Feature activation-2.53e-01

Pos logit contributions
 hospital+0.261
 sleep+0.246
 sick+0.241
 veterinarian+0.241
 doctor+0.240
Neg logit contributions
bek-0.219
 rect-0.215
 angles-0.214
ogram-0.209
anish-0.209
Token ac
Token ID936
Feature activation-4.90e+00

Pos logit contributions
 hospital+0.005
 paramedics+0.005
 dentist+0.005
 doctor+0.005
 veterinarian+0.005
Neg logit contributions
 perfectly-0.011
 rect-0.011
ra-0.011
vered-0.011
bek-0.010
Tokenhe
Token ID258
Feature activation+6.75e-01

Pos logit contributions
 sleep+0.168
 doctors+0.168
 paramedics+0.163
 hospital+0.161
 dentist+0.155
Neg logit contributions
bek-0.173
izing-0.152
 globe-0.151
x-0.148
vered-0.148
Token?"
Token ID1701
Feature activation-2.55e+00

Pos logit contributions
bek+0.036
 rect+0.036
heit+0.035
vered+0.035
 perfectly+0.034
Neg logit contributions
 hospital-0.010
 sick-0.009
 dentist-0.009
 sleep-0.009
 doctor-0.008
Token But
Token ID887
Feature activation+1.74e+00

Pos logit contributions
 sick+0.072
 doctor+0.070
 hospital+0.069
 dentist+0.067
 sleep+0.066
Neg logit contributions
bek-0.117
 rect-0.115
vered-0.110
heit-0.110
 angles-0.106
Token time
Token ID640
Feature activation+1.22e+00

Pos logit contributions
 sick+0.040
 hospital+0.038
 doctor+0.038
 dentist+0.037
 sleep+0.035
Neg logit contributions
bek-0.070
 rect-0.069
heit-0.067
vered-0.066
 angles-0.064
Token there
Token ID612
Feature activation+9.76e-01

Pos logit contributions
bek+0.078
 rect+0.073
heit+0.072
vered+0.072
 angles+0.071
Neg logit contributions
?"-0.015
 doctor-0.013
 sick-0.013
 sleep-0.013
 hospital-0.011
Token lived
Token ID5615
Feature activation-2.35e-01

Pos logit contributions
bek+0.044
 rect+0.043
heit+0.041
vered+0.041
 angles+0.039
Neg logit contributions
 sick-0.028
 hospital-0.027
 doctor-0.027
 dentist-0.026
 sleep-0.026
Token a
Token ID257
Feature activation-8.79e-01

Pos logit contributions
 hospital+0.007
 dentist+0.006
 sick+0.006
 itch+0.006
 doctor+0.006
Neg logit contributions
 rect-0.010
bek-0.010
vered-0.010
Alex-0.009
anish-0.009
Token little
Token ID1310
Feature activation+6.04e-01

Pos logit contributions
 hospital+0.023
 sick+0.023
 dentist+0.022
 doctor+0.022
 sleep+0.021
Neg logit contributions
bek-0.039
 rect-0.038
heit-0.035
vered-0.034
 globe-0.034
Token chip
Token ID11594
Feature activation-6.90e+00

Pos logit contributions
bek+0.026
 rect+0.025
vered+0.024
heit+0.024
 angles+0.023
Neg logit contributions
 sick-0.018
 hospital-0.017
 dentist-0.017
 doctor-0.017
 sleep-0.016
Tokenm
Token ID76
Feature activation-4.54e+00

Pos logit contributions
 dentist+0.203
 sleep+0.200
 hospital+0.193
 doctor+0.191
 sick+0.191
Neg logit contributions
bek-0.265
 rect-0.260
vered-0.254
 angles-0.235
heit-0.233
Tokenunk
Token ID2954
Feature activation-2.73e+00

Pos logit contributions
 hospital+0.210
 dentist+0.208
 sleep+0.203
 doctor+0.201
 veterinarian+0.197
Neg logit contributions
 rect-0.106
vered-0.101
bek-0.099
ra-0.095
ogram-0.089
Token named
Token ID3706
Feature activation-2.30e+00

Pos logit contributions
 dentist+0.067
 sick+0.066
 hospital+0.065
 sleep+0.063
 doctor+0.062
Neg logit contributions
bek-0.123
 rect-0.122
vered-0.118
 angles-0.111
heit-0.111
Token Spike
Token ID26309
Feature activation+7.14e-01

Pos logit contributions
 hospital+0.069
 sick+0.067
 doctor+0.067
 dentist+0.067
 doctors+0.064
Neg logit contributions
bek-0.090
 rect-0.090
heit-0.085
vered-0.082
 angles-0.080
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
vered+0.035
 rect+0.035
bek+0.034
 perfectly+0.033
heit+0.032
Neg logit contributions
 dentist-0.015
 hospital-0.014
 itch-0.014
 doctor-0.014
 sick-0.014
Tokeny
Token ID88
Feature activation+1.21e+00

Pos logit contributions
bek+0.018
 rect+0.017
heit+0.017
vered+0.016
 angles+0.015
Neg logit contributions
 sick-0.018
 doctor-0.017
 hospital-0.016
 bat-0.016
?"-0.016
Token ears
Token ID11368
Feature activation-2.19e-01

Pos logit contributions
bek+0.053
heit+0.047
 rect+0.046
vered+0.045
vin+0.043
Neg logit contributions
 sick-0.040
 sleep-0.038
 hospital-0.037
 doctor-0.037
 itch-0.036
Token and
Token ID290
Feature activation+1.92e-01

Pos logit contributions
 sick+0.004
 doctor+0.004
 hospital+0.004
 sleep+0.003
 dentist+0.003
Neg logit contributions
bek-0.013
heit-0.012
 rect-0.012
vered-0.012
 angles-0.011
Token a
Token ID257
Feature activation+3.31e-01

Pos logit contributions
bek+0.009
 rect+0.008
heit+0.008
vered+0.008
vin+0.007
Neg logit contributions
 sick-0.006
 sleep-0.006
 hospital-0.005
 doctor-0.005
 bat-0.005
Token round
Token ID2835
Feature activation-9.05e-01

Pos logit contributions
bek+0.016
 rect+0.015
vered+0.015
heit+0.015
 angles+0.015
Neg logit contributions
 sick-0.009
 doctor-0.008
 hospital-0.008
 sleep-0.008
 dentist-0.008
Token tum
Token ID11814
Feature activation-6.89e+00

Pos logit contributions
 sick+0.026
 hospital+0.025
 doctor+0.025
 dentist+0.025
 sleep+0.024
Neg logit contributions
bek-0.039
 rect-0.039
vered-0.037
heit-0.037
 angles-0.036
Tokenmy
Token ID1820
Feature activation-1.53e-01

Pos logit contributions
 hospital+0.213
 sick+0.213
 doctor+0.204
 sleep+0.202
 dentist+0.199
Neg logit contributions
 rect-0.279
bek-0.278
 angles-0.270
vered-0.263
heit-0.259
Token.
Token ID13
Feature activation-6.89e-01

Pos logit contributions
 sick+0.004
 hospital+0.003
 doctor+0.003
 dentist+0.003
 sleep+0.003
Neg logit contributions
 rect-0.007
bek-0.007
vered-0.007
heit-0.007
 angles-0.007
Token Where
Token ID6350
Feature activation+1.65e+00

Pos logit contributions
 sick+0.021
 hospital+0.020
 doctor+0.020
 dentist+0.020
 sleep+0.020
Neg logit contributions
 rect-0.029
bek-0.028
vered-0.027
heit-0.027
 angles-0.026
Tokenver
Token ID332
Feature activation+1.99e+00

Pos logit contributions
bek+0.064
 rect+0.063
vered+0.061
heit+0.060
 angles+0.058
Neg logit contributions
 sick-0.056
 hospital-0.055
 doctor-0.054
 dentist-0.053
 sleep-0.053
Token he
Token ID339
Feature activation-2.82e+00

Pos logit contributions
bek+0.093
 rect+0.091
vered+0.088
heit+0.088
 angles+0.085
Neg logit contributions
 sick-0.052
 hospital-0.050
 doctor-0.050
 dentist-0.049
 sleep-0.048
Token their
Token ID511
Feature activation+2.39e-01

Pos logit contributions
 sick+0.034
 hospital+0.033
 doctor+0.033
 dentist+0.032
 sleep+0.031
Neg logit contributions
bek-0.058
 rect-0.057
vered-0.055
heit-0.055
 angles-0.053
Token snack
Token ID26906
Feature activation-3.97e-01

Pos logit contributions
 rect+0.012
bek+0.012
heit+0.011
vered+0.011
 angles+0.011
Neg logit contributions
 sick-0.005
 hospital-0.005
 doctor-0.005
 dentist-0.005
 doctors-0.005
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
 sick+0.007
 hospital+0.007
 doctor+0.007
 dentist+0.007
 sleep+0.006
Neg logit contributions
 rect-0.021
bek-0.021
vered-0.021
heit-0.020
 perfectly-0.020
Token With
Token ID2080
Feature activation+2.78e+00

Pos logit contributions
 doctor+0.005
 hospital+0.004
 sick+0.004
 sleep+0.004
 dentist+0.004
Neg logit contributions
bek-0.011
 rect-0.011
vered-0.011
heit-0.011
 angles-0.010
Token happy
Token ID3772
Feature activation+1.44e+00

Pos logit contributions
bek+0.152
 rect+0.145
vered+0.140
heit+0.139
 angles+0.138
Neg logit contributions
 sick-0.059
 sleep-0.056
 doctor-0.055
 hospital-0.055
 dentist-0.050
Token tum
Token ID11814
Feature activation-6.87e+00

Pos logit contributions
bek+0.060
 rect+0.059
vered+0.056
heit+0.056
 angles+0.054
Neg logit contributions
 sick-0.046
 hospital-0.045
 doctor-0.044
 dentist-0.043
 sleep-0.043
Tokenm
Token ID76
Feature activation-2.56e+00

Pos logit contributions
 sick+0.210
 doctor+0.203
 hospital+0.201
 dentist+0.199
 sleep+0.194
Neg logit contributions
bek-0.295
 rect-0.290
heit-0.277
vered-0.277
 angles-0.260
Tokenies
Token ID444
Feature activation-1.32e+00

Pos logit contributions
 hospital+0.085
 sick+0.084
 doctor+0.084
 dentist+0.084
 sleep+0.080
Neg logit contributions
bek-0.098
 rect-0.098
vered-0.094
heit-0.094
 angles-0.090
Token,
Token ID11
Feature activation-8.74e-01

Pos logit contributions
 sick+0.030
 doctor+0.029
 hospital+0.029
 sleep+0.028
 dentist+0.027
Neg logit contributions
bek-0.067
 rect-0.065
heit-0.064
vered-0.063
 angles-0.061
Token they
Token ID484
Feature activation-4.94e-01

Pos logit contributions
 sick+0.024
 doctor+0.024
 hospital+0.023
 sleep+0.023
 doctors+0.021
Neg logit contributions
bek-0.042
 rect-0.041
heit-0.041
vered-0.040
 angles-0.040
Token went
Token ID1816
Feature activation-1.48e+00

Pos logit contributions
 hospital+0.016
 sick+0.016
 doctor+0.016
 dentist+0.016
 doctors+0.015
Neg logit contributions
bek-0.020
 rect-0.020
heit-0.019
vered-0.019
 angles-0.018
Token eating
Token ID6600
Feature activation-2.38e+00

Pos logit contributions
bek+0.022
 rect+0.022
vered+0.021
heit+0.021
 angles+0.020
Neg logit contributions
 sick-0.013
 hospital-0.013
 doctor-0.013
 dentist-0.012
 sleep-0.012
Token too
Token ID1165
Feature activation-3.40e+00

Pos logit contributions
 sick+0.057
 hospital+0.054
 doctor+0.054
 dentist+0.052
 sleep+0.051
Neg logit contributions
bek-0.118
 rect-0.116
vered-0.112
heit-0.112
 angles-0.108
Token much
Token ID881
Feature activation+3.23e-01

Pos logit contributions
 hospital+0.100
 dentist+0.099
 doctor+0.099
 ambulance+0.094
 sick+0.094
Neg logit contributions
bek-0.136
vered-0.133
heit-0.131
 angles-0.130
 rect-0.130
Token because
Token ID780
Feature activation-7.25e-01

Pos logit contributions
 rect+0.017
vered+0.016
bek+0.016
 angles+0.016
heit+0.016
Neg logit contributions
 hospital-0.006
 sick-0.006
 doctor-0.006
 dentist-0.006
 bat-0.005
Token his
Token ID465
Feature activation-7.70e-01

Pos logit contributions
 sick+0.019
 hospital+0.018
 doctor+0.018
 dentist+0.018
 sleep+0.017
Neg logit contributions
bek-0.034
 rect-0.033
vered-0.032
heit-0.032
 angles-0.031
Token tum
Token ID11814
Feature activation-6.87e+00

Pos logit contributions
 sick+0.021
 hospital+0.020
 doctor+0.020
 dentist+0.020
 sleep+0.018
Neg logit contributions
bek-0.034
 rect-0.033
vered-0.033
heit-0.032
 angles-0.032
Tokenmy
Token ID1820
Feature activation-1.85e-01

Pos logit contributions
 sick+0.220
 hospital+0.219
 doctor+0.212
 sleep+0.207
 dentist+0.204
Neg logit contributions
bek-0.274
 rect-0.267
vered-0.259
 angles-0.255
heit-0.252
Token hurt
Token ID5938
Feature activation-4.21e+00

Pos logit contributions
 sick+0.005
 hospital+0.005
 doctor+0.005
 dentist+0.005
 sleep+0.005
Neg logit contributions
bek-0.008
 rect-0.008
vered-0.008
heit-0.007
 angles-0.007
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
 sick+0.055
 hospital+0.052
 doctor+0.051
 dentist+0.048
 sleep+0.046
Neg logit contributions
bek-0.250
 rect-0.248
vered-0.242
heit-0.240
 angles-0.234
Token The
Token ID383
Feature activation-1.05e+00

Pos logit contributions
 sick+0.015
 hospital+0.014
 doctor+0.014
 dentist+0.014
 sleep+0.014
Neg logit contributions
bek-0.024
 rect-0.024
vered-0.023
heit-0.023
 angles-0.022
Token bear
Token ID6842
Feature activation-4.97e-01

Pos logit contributions
 sick+0.024
iver+0.024
 hospital+0.023
 sleep+0.023
 bat+0.023
Neg logit contributions
bek-0.045
 rect-0.044
heit-0.043
 angles-0.043
vered-0.043
Token time
Token ID640
Feature activation+1.00e+00

Pos logit contributions
 sick+0.055
 hospital+0.054
 doctor+0.054
 dentist+0.053
 doctors+0.050
Neg logit contributions
 rect-0.083
bek-0.081
heit-0.077
vered-0.076
 angles-0.072
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
bek+0.058
 rect+0.056
heit+0.055
vered+0.055
 angles+0.054
Neg logit contributions
 sick-0.016
 doctor-0.015
?"-0.015
 sleep-0.015
 hospital-0.014
Token there
Token ID612
Feature activation+5.74e-01

Pos logit contributions
 sick+0.040
?"+0.036
 doctor+0.034
 hospital+0.032
 cold+0.032
Neg logit contributions
bek-0.097
 rect-0.095
vered-0.093
heit-0.093
 angles-0.091
Token was
Token ID373
Feature activation-1.77e+00

Pos logit contributions
bek+0.022
 rect+0.022
vered+0.021
heit+0.021
 angles+0.020
Neg logit contributions
 sick-0.019
 hospital-0.019
 doctor-0.018
 dentist-0.018
 sleep-0.018
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
 sick+0.036
 hospital+0.035
 doctor+0.034
 dentist+0.033
 sleep+0.032
Neg logit contributions
bek-0.092
 rect-0.091
vered-0.088
heit-0.087
 angles-0.085
Token chip
Token ID11594
Feature activation-6.84e+00

Pos logit contributions
 hospital+0.027
 sick+0.027
 dentist+0.026
 sleep+0.026
 doctor+0.025
Neg logit contributions
bek-0.048
 rect-0.047
heit-0.043
vered-0.042
 globe-0.041
Tokenm
Token ID76
Feature activation-4.47e+00

Pos logit contributions
 hospital+0.181
 sleep+0.180
 sick+0.180
 dentist+0.180
 doctor+0.173
Neg logit contributions
 rect-0.291
bek-0.291
vered-0.280
heit-0.265
 angles-0.260
Tokenunk
Token ID2954
Feature activation-2.82e+00

Pos logit contributions
 hospital+0.201
 dentist+0.197
 doctor+0.192
 sleep+0.192
 sick+0.185
Neg logit contributions
 rect-0.109
bek-0.106
vered-0.104
ra-0.098
ogram-0.095
Token who
Token ID508
Feature activation-7.76e-02

Pos logit contributions
 dentist+0.072
 sick+0.072
 hospital+0.071
 sleep+0.069
 doctor+0.067
Neg logit contributions
bek-0.122
 rect-0.121
vered-0.115
 angles-0.108
 perfectly-0.108
Token lived
Token ID5615
Feature activation-8.30e-02

Pos logit contributions
 sick+0.002
 hospital+0.002
 doctor+0.002
 sleep+0.002
 dentist+0.002
Neg logit contributions
bek-0.003
 rect-0.003
heit-0.003
vered-0.003
 angles-0.003
Token in
Token ID287
Feature activation+1.04e-01

Pos logit contributions
 dentist+0.004
 hospital+0.003
 doctor+0.003
 tum+0.003
 sleep+0.003
Neg logit contributions
bek-0.002
 perfectly-0.002
vered-0.002
rave-0.002
 rect-0.002
Token the
Token ID262
Feature activation-1.60e+00

Pos logit contributions
 sick+0.032
 hospital+0.030
 doctor+0.030
 dentist+0.030
 sleep+0.028
Neg logit contributions
 rect-0.060
bek-0.059
heit-0.057
vered-0.057
 angles-0.056
Token jungle
Token ID20712
Feature activation+2.21e-01

Pos logit contributions
 sick+0.041
 hospital+0.040
 doctor+0.039
 dentist+0.038
 sleep+0.038
Neg logit contributions
bek-0.076
 rect-0.075
vered-0.073
heit-0.072
 angles-0.069
Token and
Token ID290
Feature activation-7.33e-01

Pos logit contributions
bek+0.011
 rect+0.011
heit+0.011
vered+0.011
 angles+0.010
Neg logit contributions
 sick-0.005
 hospital-0.005
 doctor-0.005
 sleep-0.005
 dentist-0.004
Token see
Token ID766
Feature activation-2.35e+00

Pos logit contributions
 sick+0.013
 hospital+0.012
 doctor+0.012
 sleep+0.012
 dentist+0.011
Neg logit contributions
bek-0.042
 rect-0.041
vered-0.040
heit-0.040
 angles-0.038
Token what
Token ID644
Feature activation+9.57e-01

Pos logit contributions
 sick+0.064
 hospital+0.062
 doctor+0.062
 dentist+0.061
 sleep+0.060
Neg logit contributions
 rect-0.104
bek-0.101
heit-0.099
vered-0.099
 angles-0.098
Token each
Token ID1123
Feature activation-6.84e+00

Pos logit contributions
 rect+0.040
bek+0.040
vered+0.038
heit+0.037
 angles+0.036
Neg logit contributions
 sick-0.029
 hospital-0.028
 doctor-0.028
 dentist-0.027
 sleep-0.027
Token new
Token ID649
Feature activation+9.12e-01

Pos logit contributions
 sick+0.162
 hospital+0.154
 doctor+0.154
 dentist+0.147
 sleep+0.142
Neg logit contributions
bek-0.351
 rect-0.343
vered-0.336
heit-0.334
 angles-0.315
Token day
Token ID1110
Feature activation-2.17e-01

Pos logit contributions
bek+0.047
heit+0.045
 rect+0.044
vered+0.043
ifying+0.041
Neg logit contributions
 sick-0.022
 hospital-0.022
 doctor-0.021
 sleep-0.021
 dentist-0.020
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
 hospital+0.006
 doctor+0.005
 sick+0.005
 dentist+0.005
 veterinarian+0.005
Neg logit contributions
 rect-0.010
bek-0.009
vered-0.009
 perfectly-0.009
 angles-0.009
Token night
Token ID1755
Feature activation-9.32e-01

Pos logit contributions
 sick+0.036
 hospital+0.034
 doctor+0.034
 sleep+0.033
 dentist+0.032
Neg logit contributions
bek-0.066
 rect-0.064
vered-0.062
heit-0.062
 angles-0.058
Token brought
Token ID3181
Feature activation-7.76e-01

Pos logit contributions
 dentist+0.018
 doctor+0.018
 hospital+0.018
 ambulance+0.017
 veterinarian+0.017
Neg logit contributions
 rect-0.049
ically-0.046
 perfectly-0.044
bek-0.043
 angles-0.043
Token
Token ID220
Feature activation+1.02e+00

Pos logit contributions
 doctor+0.010
 sick+0.010
 hospital+0.010
 dentist+0.010
 sleep+0.010
Neg logit contributions
bek-0.020
 rect-0.019
vered-0.019
heit-0.019
 angles-0.018
Token
Token ID198
Feature activation+2.53e-02

Pos logit contributions
bek+0.046
 rect+0.046
vered+0.045
heit+0.044
 angles+0.044
Neg logit contributions
 sick-0.026
 hospital-0.026
 dentist-0.024
 doctor-0.024
 sleep-0.024
Token
Token ID198
Feature activation-3.81e-01

Pos logit contributions
bek+0.001
 rect+0.001
vered+0.001
heit+0.001
 angles+0.001
Neg logit contributions
 sick-0.001
 hospital-0.001
 doctor-0.001
 dentist-0.000
 sleep-0.000
TokenWith
Token ID3152
Feature activation+2.43e+00

Pos logit contributions
 sick+0.014
 hospital+0.013
 doctor+0.013
 dentist+0.013
 sleep+0.013
Neg logit contributions
bek-0.014
 rect-0.014
vered-0.013
heit-0.013
 angles-0.013
Token his
Token ID465
Feature activation-3.30e-01

Pos logit contributions
bek+0.133
 rect+0.128
vered+0.124
heit+0.123
 angles+0.120
Neg logit contributions
 sick-0.049
 hospital-0.046
 doctor-0.046
 sleep-0.044
 dentist-0.043
Token tum
Token ID11814
Feature activation-6.81e+00

Pos logit contributions
 sick+0.008
 doctor+0.008
 hospital+0.008
 sleep+0.008
 dentist+0.008
Neg logit contributions
bek-0.016
 rect-0.016
vered-0.015
heit-0.015
 angles-0.015
Tokenmy
Token ID1820
Feature activation-4.73e-01

Pos logit contributions
 sick+0.191
 doctor+0.183
 hospital+0.182
 dentist+0.179
 sleep+0.174
Neg logit contributions
bek-0.307
 rect-0.304
heit-0.291
vered-0.290
 angles-0.277
Token satisfied
Token ID11378
Feature activation+1.92e-01

Pos logit contributions
 sick+0.012
 doctor+0.011
 hospital+0.010
 sleep+0.010
 dentist+0.010
Neg logit contributions
bek-0.024
 rect-0.023
heit-0.023
 angles-0.022
vered-0.022
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
 rect+0.009
bek+0.009
vered+0.009
heit+0.009
 angles+0.009
Neg logit contributions
 sick-0.005
 hospital-0.004
 doctor-0.004
 dentist-0.004
 sleep-0.004
Token the
Token ID262
Feature activation-2.69e+00

Pos logit contributions
 sick+0.034
 doctor+0.034
 hospital+0.033
 dentist+0.031
 sleep+0.031
Neg logit contributions
bek-0.081
 rect-0.079
vered-0.079
heit-0.078
 angles-0.075
Token fox
Token ID21831
Feature activation-1.27e+00

Pos logit contributions
 sick+0.068
 hospital+0.067
 doctor+0.065
 dentist+0.064
 sleep+0.063
Neg logit contributions
bek-0.125
 rect-0.123
heit-0.119
vered-0.118
 angles-0.116
Token was
Token ID373
Feature activation-7.03e-01

Pos logit contributions
 sick+0.034
 hospital+0.034
 doctor+0.034
 dentist+0.033
 sleep+0.032
Neg logit contributions
 rect-0.035
bek-0.035
vered-0.034
heit-0.033
 angles-0.031
Token feeling
Token ID4203
Feature activation-3.10e+00

Pos logit contributions
 dentist+0.025
 doctor+0.024
 hospital+0.024
iver+0.024
 cook+0.022
Neg logit contributions
 perfectly-0.021
bek-0.020
 rect-0.019
heit-0.018
rave-0.018
Token uncomfortable
Token ID12916
Feature activation-1.31e+00

Pos logit contributions
 dentist+0.105
 doctor+0.094
 hospital+0.093
 paramedics+0.091
 veterinarian+0.091
Neg logit contributions
 perfectly-0.108
 rect-0.102
vered-0.101
bek-0.093
heit-0.093
Token because
Token ID780
Feature activation-1.47e+00

Pos logit contributions
 dentist+0.030
 doctor+0.028
 hospital+0.028
 sick+0.028
 sleep+0.026
Neg logit contributions
 rect-0.061
bek-0.060
vered-0.060
 perfectly-0.059
 angles-0.056
Token his
Token ID465
Feature activation-1.25e+00

Pos logit contributions
 sick+0.041
 hospital+0.040
 doctor+0.040
 dentist+0.039
 sleep+0.039
Neg logit contributions
bek-0.062
 rect-0.062
vered-0.059
heit-0.059
 angles-0.058
Token tum
Token ID11814
Feature activation-6.80e+00

Pos logit contributions
 dentist+0.033
aster+0.033
 hospital+0.032
ores+0.031
 doctor+0.031
Neg logit contributions
vered-0.046
 perfectly-0.045
bek-0.044
heit-0.044
 fists-0.044
Tokenmy
Token ID1820
Feature activation-2.94e-01

Pos logit contributions
 hospital+0.232
 sick+0.229
 doctor+0.225
 sleep+0.219
 dentist+0.215
Neg logit contributions
bek-0.247
 rect-0.243
 angles-0.239
vered-0.233
heit-0.225
Token hurt
Token ID5938
Feature activation-4.19e+00

Pos logit contributions
 dentist+0.008
 hospital+0.008
 doctor+0.008
 veterinarian+0.007
 paramedics+0.007
Neg logit contributions
vered-0.013
 rect-0.012
 perfectly-0.012
bek-0.012
heit-0.012
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 sick+0.069
 hospital+0.067
 dentist+0.066
 doctor+0.066
 sleep+0.061
Neg logit contributions
bek-0.227
 rect-0.226
vered-0.222
heit-0.218
 angles-0.214
Token He
Token ID679
Feature activation+2.20e-01

Pos logit contributions
 sick+0.039
 hospital+0.037
 dentist+0.035
 doctor+0.035
 itch+0.034
Neg logit contributions
 rect-0.041
bek-0.038
Wow-0.038
vered-0.037
heit-0.036
Token walked
Token ID6807
Feature activation-1.41e+00

Pos logit contributions
vered+0.007
bek+0.007
heit+0.006
 rect+0.006
 perfectly+0.006
Neg logit contributions
 dentist-0.008
 doctor-0.008
 hospital-0.008
iver-0.008
 elderly-0.008
Token too
Token ID1165
Feature activation-2.96e+00

Pos logit contributions
 sick+0.060
 hospital+0.057
 doctor+0.055
 sleep+0.052
 dentist+0.051
Neg logit contributions
bek-0.119
 rect-0.112
vered-0.108
heit-0.108
 angles-0.104
Token many
Token ID867
Feature activation+1.81e+00

Pos logit contributions
 hospital+0.100
 doctor+0.100
 dentist+0.099
 doctors+0.097
 ambulance+0.097
Neg logit contributions
bek-0.104
vered-0.099
heit-0.099
 perfectly-0.097
 rect-0.096
Token and
Token ID290
Feature activation-1.56e-02

Pos logit contributions
bek+0.091
 rect+0.088
heit+0.085
vered+0.084
 angles+0.081
Neg logit contributions
 sick-0.044
 hospital-0.042
 doctor-0.042
 dentist-0.040
 sleep-0.040
Token got
Token ID1392
Feature activation-1.98e+00

Pos logit contributions
 sick+0.000
 hospital+0.000
 doctor+0.000
 dentist+0.000
 sleep+0.000
Neg logit contributions
bek-0.001
 rect-0.001
vered-0.001
heit-0.001
 angles-0.001
Token a
Token ID257
Feature activation+1.59e+00

Pos logit contributions
 dentist+0.038
 hospital+0.037
 doctor+0.037
 sleep+0.036
iver+0.032
Neg logit contributions
 rect-0.100
bek-0.100
vered-0.099
heit-0.096
 perfectly-0.095
Token tum
Token ID11814
Feature activation-6.76e+00

Pos logit contributions
bek+0.066
 rect+0.065
heit+0.061
vered+0.061
 angles+0.059
Neg logit contributions
 sick-0.048
 hospital-0.047
 doctor-0.047
 dentist-0.047
 sleep-0.045
Tokenmy
Token ID1820
Feature activation-3.20e-02

Pos logit contributions
 sick+0.218
 hospital+0.218
 doctor+0.212
 sleep+0.207
 dentist+0.204
Neg logit contributions
bek-0.268
 rect-0.260
 angles-0.249
vered-0.249
heit-0.242
Token ac
Token ID936
Feature activation-4.55e+00

Pos logit contributions
 hospital+0.001
 doctor+0.001
 sick+0.001
 dentist+0.001
 sleep+0.001
Neg logit contributions
bek-0.002
 rect-0.002
vered-0.002
heit-0.002
 perfectly-0.002
Tokenhe
Token ID258
Feature activation+1.04e+00

Pos logit contributions
 doctors+0.128
 sick+0.128
 hospital+0.125
 sleep+0.123
 doctor+0.123
Neg logit contributions
bek-0.196
vered-0.177
 globe-0.175
 perfectly-0.173
x-0.172
Token.
Token ID13
Feature activation-3.99e-01

Pos logit contributions
bek+0.056
 rect+0.055
vered+0.054
heit+0.053
 angles+0.052
Neg logit contributions
 sick-0.019
 hospital-0.018
 doctor-0.017
 dentist-0.017
 sleep-0.016
Token
Token ID198
Feature activation-2.24e-01

Pos logit contributions
 sick+0.012
 hospital+0.011
 doctor+0.011
 dentist+0.011
 sleep+0.011
Neg logit contributions
bek-0.017
 rect-0.017
vered-0.016
heit-0.016
 angles-0.016
Tokenista
Token ID12523
Feature activation+3.81e-01

Pos logit contributions
bek+0.008
 rect+0.007
heit+0.007
vered+0.007
 angles+0.006
Neg logit contributions
 sick-0.010
 hospital-0.010
 doctor-0.010
 dentist-0.010
 sleep-0.010
Token became
Token ID2627
Feature activation-1.29e+00

Pos logit contributions
bek+0.017
 rect+0.017
vered+0.016
heit+0.016
 angles+0.016
Neg logit contributions
 sick-0.010
 hospital-0.010
 doctor-0.010
 dentist-0.010
 sleep-0.010
Token sleepy
Token ID42023
Feature activation-7.75e-01

Pos logit contributions
 sick+0.035
 hospital+0.034
 doctor+0.034
 dentist+0.033
 sleep+0.032
Neg logit contributions
bek-0.059
 rect-0.058
vered-0.056
heit-0.055
 angles-0.054
Token and
Token ID290
Feature activation-9.87e-01

Pos logit contributions
 sick+0.018
 hospital+0.018
 dentist+0.018
 doctor+0.017
 itch+0.016
Neg logit contributions
 rect-0.035
bek-0.033
vered-0.033
 angles-0.032
 perfectly-0.032
Token her
Token ID607
Feature activation-7.35e-01

Pos logit contributions
 dentist+0.029
iver+0.028
 doctor+0.028
 hospital+0.028
 sick+0.027
Neg logit contributions
bek-0.039
 rect-0.038
heit-0.037
vered-0.037
 angles-0.036
Token tum
Token ID11814
Feature activation-6.75e+00

Pos logit contributions
 sick+0.019
 hospital+0.018
 dentist+0.018
 doctor+0.017
 sleep+0.017
Neg logit contributions
bek-0.034
 rect-0.034
vered-0.032
heit-0.032
 angles-0.031
Tokenmy
Token ID1820
Feature activation-1.68e-01

Pos logit contributions
 sick+0.220
 hospital+0.220
 doctor+0.212
 sleep+0.211
 dentist+0.206
Neg logit contributions
bek-0.264
 rect-0.256
 angles-0.250
vered-0.248
heit-0.238
Token began
Token ID2540
Feature activation-1.40e+00

Pos logit contributions
 hospital+0.004
 dentist+0.004
 doctor+0.004
 sick+0.004
 veterinarian+0.004
Neg logit contributions
vered-0.007
 rect-0.007
bek-0.007
 perfectly-0.007
heit-0.006
Token to
Token ID284
Feature activation-1.48e+00

Pos logit contributions
 dentist+0.038
 hospital+0.037
 doctor+0.036
 sick+0.035
?"+0.033
Neg logit contributions
 rect-0.059
bek-0.058
vered-0.057
 angles-0.056
 perfectly-0.054
Token g
Token ID308
Feature activation+9.42e-01

Pos logit contributions
 dentist+0.038
 doctors+0.035
 doctor+0.035
 hospital+0.034
 paramedics+0.033
Neg logit contributions
 rect-0.059
heit-0.058
 angles-0.056
 perfectly-0.056
vered-0.056
Tokenurg
Token ID3686
Feature activation-2.96e+00

Pos logit contributions
 rect+0.031
heit+0.029
 angles+0.029
 perfectly+0.028
vered+0.028
Neg logit contributions
 dentist-0.037
 doctor-0.036
 hospital-0.035
 veterinarian-0.035
 vet-0.034
Token to
Token ID284
Feature activation-1.38e+00

Pos logit contributions
 sick+0.007
 hospital+0.007
 doctor+0.007
 dentist+0.007
 sleep+0.007
Neg logit contributions
bek-0.022
 rect-0.022
heit-0.021
vered-0.021
 angles-0.021
Token be
Token ID307
Feature activation-3.22e+00

Pos logit contributions
 sick+0.037
 dentist+0.034
 tum+0.034
 veterinarian+0.033
 doctor+0.033
Neg logit contributions
 angles-0.054
 perfectly-0.054
 rect-0.054
bek-0.054
heit-0.053
Token able
Token ID1498
Feature activation+6.85e-01

Pos logit contributions
 hospital+0.096
 doctor+0.092
 itch+0.088
 veterinarian+0.087
 dentist+0.087
Neg logit contributions
bek-0.136
 rect-0.133
 perfectly-0.131
vered-0.124
 angles-0.121
Token to
Token ID284
Feature activation-1.41e+00

Pos logit contributions
bek+0.033
 rect+0.032
vered+0.031
heit+0.031
 angles+0.030
Neg logit contributions
 sick-0.016
 hospital-0.016
 doctor-0.016
 dentist-0.016
 sleep-0.015
Token gaze
Token ID17841
Feature activation-2.04e-01

Pos logit contributions
 dentist+0.040
 sick+0.038
 doctors+0.037
 doctor+0.036
 hospital+0.035
Neg logit contributions
heit-0.055
bek-0.055
 angles-0.054
 rect-0.054
vered-0.053
Token upon
Token ID2402
Feature activation+5.02e+00

Pos logit contributions
 sick+0.005
 hospital+0.005
 doctor+0.005
 dentist+0.005
 sleep+0.005
Neg logit contributions
bek-0.009
 rect-0.009
vered-0.009
heit-0.009
 angles-0.009
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
bek+0.246
 rect+0.243
heit+0.234
vered+0.234
 angles+0.228
Neg logit contributions
 sick-0.114
 hospital-0.111
 doctor-0.109
 dentist-0.107
 sleep-0.101
Token stadium
Token ID10308
Feature activation+2.77e+00

Pos logit contributions
 sick+0.033
 doctor+0.032
 hospital+0.032
 dentist+0.031
 sleep+0.030
Neg logit contributions
bek-0.045
 rect-0.044
vered-0.043
heit-0.042
 angles-0.039
Token and
Token ID290
Feature activation+8.59e-02

Pos logit contributions
bek+0.167
 rect+0.165
heit+0.163
vered+0.161
 angles+0.155
Neg logit contributions
 sick-0.036
 doctor-0.033
 hospital-0.033
 sleep-0.031
 dentist-0.030
Token now
Token ID783
Feature activation-1.60e+00

Pos logit contributions
bek+0.004
 rect+0.004
vered+0.004
heit+0.004
 angles+0.004
Neg logit contributions
 sick-0.002
 hospital-0.002
 doctor-0.002
 dentist-0.002
 sleep-0.002
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
 sick+0.041
 doctor+0.039
 hospital+0.039
 dentist+0.039
 sleep+0.036
Neg logit contributions
bek-0.074
 rect-0.074
vered-0.071
heit-0.071
 angles-0.069
Token understood
Token ID7247
Feature activation+5.35e-01

Pos logit contributions
 sick+0.019
 hospital+0.018
 doctor+0.018
 dentist+0.018
 sleep+0.017
Neg logit contributions
bek-0.027
 rect-0.026
vered-0.025
heit-0.025
 angles-0.024
Token.
Token ID13
Feature activation+2.90e-01

Pos logit contributions
bek+0.027
 rect+0.026
vered+0.026
heit+0.025
 angles+0.025
Neg logit contributions
 sick-0.012
 hospital-0.012
 doctor-0.012
 dentist-0.011
 sleep-0.011
Token He
Token ID679
Feature activation+8.58e-01

Pos logit contributions
bek+0.015
 rect+0.014
heit+0.014
vered+0.014
 angles+0.013
Neg logit contributions
 doctor-0.007
 sick-0.006
 hospital-0.006
 dentist-0.006
 sleep-0.006
Token said
Token ID531
Feature activation-8.64e-01

Pos logit contributions
bek+0.039
 rect+0.038
vered+0.037
heit+0.037
 angles+0.035
Neg logit contributions
 sick-0.024
 hospital-0.023
 doctor-0.023
 dentist-0.022
 sleep-0.022
Token,
Token ID11
Feature activation+9.46e-01

Pos logit contributions
 doctor+0.017
 sick+0.017
 hospital+0.016
 dentist+0.015
 sleep+0.014
Neg logit contributions
bek-0.047
 rect-0.045
heit-0.044
 angles-0.044
vered-0.044
Token "
Token ID366
Feature activation+4.33e+00

Pos logit contributions
bek+0.047
vered+0.045
heit+0.045
 angles+0.045
 rect+0.044
Neg logit contributions
 doctor-0.024
 sick-0.023
 hospital-0.021
 dentist-0.021
 cook-0.020
TokenThat
Token ID2504
Feature activation+1.90e+00

Pos logit contributions
heit+0.185
 rect+0.181
vered+0.167
bek+0.167
 angles+0.165
Neg logit contributions
 doctor-0.133
 sick-0.129
 dentist-0.124
 bat-0.119
 doctors-0.119
Token's
Token ID338
Feature activation-1.34e+00

Pos logit contributions
 rect+0.070
bek+0.070
vered+0.066
heit+0.066
 angles+0.063
Neg logit contributions
 sick-0.066
 hospital-0.064
 doctor-0.063
 dentist-0.063
 sleep-0.063
Token okay
Token ID8788
Feature activation+9.72e-01

Pos logit contributions
 sick+0.031
 doctor+0.028
 hospital+0.028
 dentist+0.027
 sleep+0.026
Neg logit contributions
 rect-0.071
vered-0.069
bek-0.069
heit-0.066
 angles-0.066
Token.
Token ID13
Feature activation+2.82e-02

Pos logit contributions
bek+0.053
 rect+0.052
vered+0.051
heit+0.050
 angles+0.049
Neg logit contributions
 sick-0.019
 doctor-0.018
 hospital-0.018
 dentist-0.017
 sleep-0.016
Token I
Token ID314
Feature activation-3.64e-01

Pos logit contributions
bek+0.001
heit+0.001
 rect+0.001
vered+0.001
 angles+0.001
Neg logit contributions
 doctor-0.001
 sleep-0.001
?"-0.001
 hospital-0.001
 dentist-0.001
Token how
Token ID703
Feature activation+8.30e-02

Pos logit contributions
bek+0.081
 rect+0.079
vered+0.076
heit+0.076
 angles+0.073
Neg logit contributions
 sick-0.050
 hospital-0.048
 doctor-0.048
 sleep-0.047
 dentist-0.046
Token much
Token ID881
Feature activation-1.65e+00

Pos logit contributions
bek+0.003
 rect+0.003
vered+0.003
heit+0.003
 perfectly+0.003
Neg logit contributions
 hospital-0.002
 dentist-0.002
 doctor-0.002
 veterinarian-0.002
 sick-0.002
Token they
Token ID484
Feature activation-1.03e+00

Pos logit contributions
 sick+0.047
 hospital+0.046
 doctor+0.046
 dentist+0.044
 sleep+0.044
Neg logit contributions
bek-0.073
 rect-0.072
vered-0.069
heit-0.069
 angles-0.066
Token loved
Token ID6151
Feature activation+3.20e-01

Pos logit contributions
 doctor+0.039
 hospital+0.038
 dentist+0.038
 ambulance+0.036
 sick+0.036
Neg logit contributions
bek-0.034
 rect-0.034
vered-0.033
 perfectly-0.032
heit-0.031
Token spending
Token ID4581
Feature activation-1.57e+00

Pos logit contributions
bek+0.019
heit+0.019
 rect+0.018
vered+0.018
 angles+0.018
Neg logit contributions
 sick-0.005
 sleep-0.005
 doctor-0.005
 hospital-0.004
 dentist-0.004
Token time
Token ID640
Feature activation+4.21e+00

Pos logit contributions
 sick+0.038
 hospital+0.036
 doctor+0.036
 dentist+0.035
 sleep+0.034
Neg logit contributions
bek-0.077
 rect-0.076
vered-0.073
heit-0.073
 angles-0.071
Token together
Token ID1978
Feature activation+3.38e+00

Pos logit contributions
 rect+0.119
bek+0.118
vered+0.111
heit+0.107
 angles+0.103
Neg logit contributions
 hospital-0.184
 sick-0.184
 doctor-0.183
 dentist-0.181
 sleep-0.174
Token in
Token ID287
Feature activation+1.48e+00

Pos logit contributions
 rect+0.198
bek+0.197
vered+0.192
heit+0.188
 angles+0.185
Neg logit contributions
 sick-0.046
 hospital-0.044
 doctor-0.043
 dentist-0.042
 sleep-0.036
Token the
Token ID262
Feature activation-8.38e-01

Pos logit contributions
 rect+0.060
bek+0.058
vered+0.058
 angles+0.057
 perfectly+0.057
Neg logit contributions
 dentist-0.043
 doctor-0.041
 hospital-0.041
 itch-0.040
 sick-0.040
Token garden
Token ID11376
Feature activation+5.90e-01

Pos logit contributions
 sick+0.022
iver+0.021
 paramedics+0.021
 dentist+0.021
 itch+0.020
Neg logit contributions
 rect-0.031
bek-0.030
 angles-0.029
Wow-0.029
 perfectly-0.028
Token.
Token ID13
Feature activation-2.14e-01

Pos logit contributions
 rect+0.034
bek+0.034
vered+0.033
 angles+0.032
 perfectly+0.031
Neg logit contributions
 sick-0.008
 hospital-0.008
 dentist-0.008
 doctor-0.008
 itch-0.007
TokenM
Token ID44
Feature activation-4.10e-01

Pos logit contributions
 sick+0.006
 doctor+0.006
 hospital+0.006
 dentist+0.005
 sleep+0.005
Neg logit contributions
bek-0.007
 rect-0.007
vered-0.007
heit-0.007
 angles-0.006
Tokenagg
Token ID9460
Feature activation-1.62e+00

Pos logit contributions
 sick+0.014
 hospital+0.014
 doctor+0.013
 sleep+0.013
 dentist+0.013
Neg logit contributions
bek-0.016
 rect-0.016
vered-0.015
heit-0.015
 angles-0.014
Tokenie
Token ID494
Feature activation-4.22e-01

Pos logit contributions
 sick+0.043
 doctor+0.041
 hospital+0.041
 sleep+0.039
 dentist+0.039
Neg logit contributions
bek-0.076
 rect-0.074
heit-0.071
vered-0.070
 angles-0.068
Token shouted
Token ID17293
Feature activation-4.27e-01

Pos logit contributions
 sick+0.012
 hospital+0.012
 doctor+0.012
 dentist+0.012
 sleep+0.011
Neg logit contributions
bek-0.018
 rect-0.018
vered-0.017
heit-0.017
 angles-0.017
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 sick+0.010
 doctor+0.010
 hospital+0.010
 sleep+0.009
 dentist+0.009
Neg logit contributions
bek-0.022
 rect-0.021
heit-0.020
vered-0.020
 angles-0.020
Token "
Token ID366
Feature activation+4.43e+00

Pos logit contributions
bek+0.069
 rect+0.067
vered+0.066
heit+0.065
 angles+0.064
Neg logit contributions
 sick-0.043
 doctor-0.042
 hospital-0.040
 sleep-0.039
 bat-0.038
TokenBe
Token ID3856
Feature activation-1.88e+00

Pos logit contributions
 rect+0.180
heit+0.178
bek+0.176
vered+0.170
 angles+0.168
Neg logit contributions
 doctor-0.139
 dentist-0.133
 sick-0.131
 bat-0.130
 veterinarian-0.126
Token careful
Token ID8161
Feature activation+1.11e+00

Pos logit contributions
 sick+0.057
 hospital+0.055
 doctor+0.055
 dentist+0.054
 sleep+0.053
Neg logit contributions
bek-0.081
 rect-0.079
vered-0.076
heit-0.076
 angles-0.073
Token brother
Token ID3956
Feature activation+4.01e-01

Pos logit contributions
bek+0.049
 rect+0.049
vered+0.047
heit+0.047
 angles+0.046
Neg logit contributions
 sick-0.030
 hospital-0.030
 dentist-0.029
 doctor-0.028
 sleep-0.027
Token!"
Token ID2474
Feature activation-1.11e+00

Pos logit contributions
bek+0.019
 rect+0.018
vered+0.018
heit+0.018
 angles+0.017
Neg logit contributions
 sick-0.010
 hospital-0.010
 doctor-0.010
 dentist-0.010
 sleep-0.009
Token But
Token ID887
Feature activation+1.67e+00

Pos logit contributions
 doctor+0.026
 sick+0.024
 hospital+0.023
 sleep+0.023
.+0.021
Neg logit contributions
bek-0.060
heit-0.058
 rect-0.058
vered-0.056
 angles-0.055
Token mom
Token ID1995
Feature activation-1.14e+00

Pos logit contributions
bek+0.095
 rect+0.095
heit+0.091
vered+0.091
 angles+0.089
Neg logit contributions
 sick-0.033
 hospital-0.032
 doctor-0.031
 dentist-0.031
 sleep-0.029
Token and
Token ID290
Feature activation+8.93e-01

Pos logit contributions
 sick+0.029
 hospital+0.028
 doctor+0.027
 dentist+0.027
 sleep+0.026
Neg logit contributions
bek-0.054
 rect-0.053
vered-0.051
heit-0.050
 angles-0.049
Token dad
Token ID9955
Feature activation-6.29e-01

Pos logit contributions
 rect+0.039
bek+0.039
heit+0.036
 angles+0.035
vered+0.035
Neg logit contributions
 sick-0.023
iver-0.022
 hospital-0.022
 dentist-0.021
 doctor-0.021
Token and
Token ID290
Feature activation+6.21e-01

Pos logit contributions
 sick+0.010
 hospital+0.009
 doctor+0.009
 dentist+0.009
 doctors+0.008
Neg logit contributions
bek-0.035
 rect-0.035
vered-0.034
heit-0.034
 angles-0.033
Token say
Token ID910
Feature activation-1.23e+00

Pos logit contributions
bek+0.028
 rect+0.028
heit+0.026
vered+0.026
 angles+0.025
Neg logit contributions
iver-0.015
 sick-0.015
 dentist-0.014
 hospital-0.014
 doctor-0.014
Token "
Token ID366
Feature activation+4.70e+00

Pos logit contributions
 sick+0.039
 doctor+0.037
 hospital+0.037
 dentist+0.036
 sleep+0.036
Neg logit contributions
bek-0.052
 rect-0.051
vered-0.049
heit-0.049
 angles-0.047
Tokenthank
Token ID40716
Feature activation-1.41e+00

Pos logit contributions
bek+0.177
 rect+0.175
heit+0.168
vered+0.167
 angles+0.161
Neg logit contributions
 sick-0.165
 doctor-0.159
 hospital-0.159
 dentist-0.156
 sleep-0.154
Token you
Token ID345
Feature activation+2.91e-01

Pos logit contributions
 doctor+0.044
 sick+0.044
 hospital+0.042
 sleep+0.041
?"+0.041
Neg logit contributions
bek-0.059
 rect-0.057
heit-0.056
vered-0.056
 angles-0.054
Token for
Token ID329
Feature activation-2.09e+00

Pos logit contributions
bek+0.013
 rect+0.013
heit+0.012
vered+0.012
 angles+0.012
Neg logit contributions
 sick-0.008
 hospital-0.008
 doctor-0.007
 dentist-0.007
 sleep-0.007
Token taking
Token ID2263
Feature activation-1.06e-01

Pos logit contributions
 sick+0.071
 hospital+0.069
 doctor+0.069
 dentist+0.068
 sleep+0.066
Neg logit contributions
bek-0.081
 rect-0.080
heit-0.076
vered-0.076
 angles-0.073
Token us
Token ID514
Feature activation-9.18e-01

Pos logit contributions
 sick+0.003
 sleep+0.003
 hospital+0.003
 doctor+0.003
 doctors+0.003
Neg logit contributions
bek-0.005
 rect-0.005
vered-0.005
heit-0.005
 angles-0.005
Token she
Token ID673
Feature activation-3.94e-01

Pos logit contributions
 sick+0.016
 hospital+0.016
 doctor+0.015
 dentist+0.015
 sleep+0.015
Neg logit contributions
bek-0.028
 rect-0.028
vered-0.027
heit-0.027
 angles-0.026
Token went
Token ID1816
Feature activation-1.48e+00

Pos logit contributions
 sick+0.012
 hospital+0.012
 dentist+0.011
 doctor+0.011
 sleep+0.011
Neg logit contributions
bek-0.017
 rect-0.016
vered-0.015
heit-0.015
 perfectly-0.015
Token back
Token ID736
Feature activation-1.17e+00

Pos logit contributions
 sick+0.045
 dentist+0.045
 itch+0.044
iver+0.042
 hospital+0.042
Neg logit contributions
bek-0.056
vered-0.053
 rect-0.053
 perfectly-0.053
heit-0.052
Token to
Token ID284
Feature activation+1.71e-01

Pos logit contributions
 sick+0.033
 dentist+0.032
 hospital+0.031
 itch+0.031
 doctor+0.031
Neg logit contributions
bek-0.049
vered-0.048
 rect-0.046
 perfectly-0.045
heit-0.045
Token gently
Token ID15165
Feature activation+1.32e+00

Pos logit contributions
bek+0.007
 rect+0.007
vered+0.007
heit+0.007
 angles+0.007
Neg logit contributions
 sick-0.004
 dentist-0.004
 doctor-0.004
 hospital-0.004
 doctors-0.004
Token washing
Token ID20518
Feature activation+7.44e-01

Pos logit contributions
bek+0.057
 rect+0.057
vered+0.057
heit+0.054
 angles+0.053
Neg logit contributions
 sick-0.035
 dentist-0.034
 doctor-0.033
 hospital-0.033
 doctors-0.032
Token her
Token ID607
Feature activation+1.58e+00

Pos logit contributions
bek+0.042
 rect+0.042
heit+0.040
vered+0.039
 angles+0.038
Neg logit contributions
 sick-0.014
 doctor-0.014
 hospital-0.013
 sleep-0.013
 bat-0.012
Token delicate
Token ID19217
Feature activation+2.18e+00

Pos logit contributions
bek+0.062
 rect+0.061
vered+0.059
heit+0.058
 angles+0.056
Neg logit contributions
 sick-0.053
 hospital-0.051
 doctor-0.051
 dentist-0.051
 sleep-0.049
Token laundry
Token ID25724
Feature activation+1.16e+00

Pos logit contributions
bek+0.088
 rect+0.088
vered+0.086
heit+0.084
 angles+0.082
Neg logit contributions
 sick-0.066
 dentist-0.064
 hospital-0.064
 doctor-0.063
 sleep-0.060
Token.
Token ID13
Feature activation+3.05e-01

Pos logit contributions
bek+0.058
 rect+0.057
vered+0.055
heit+0.055
 angles+0.053
Neg logit contributions
 sick-0.026
 hospital-0.025
 doctor-0.025
 dentist-0.024
 sleep-0.024
Token Everyone
Token ID11075
Feature activation+1.25e+00

Pos logit contributions
bek+0.015
 rect+0.015
vered+0.015
heit+0.014
 angles+0.014
Neg logit contributions
 sick-0.007
 hospital-0.007
 doctor-0.007
 dentist-0.006
 sleep-0.006
Token'm
Token ID1101
Feature activation-2.83e-01

Pos logit contributions
 dentist+0.001
 hospital+0.001
 doctor+0.001
 sick+0.001
 doctors+0.001
Neg logit contributions
bek-0.001
vered-0.001
 rect-0.001
heit-0.001
 angles-0.001
Token sorry
Token ID7926
Feature activation-5.52e-01

Pos logit contributions
 dentist+0.008
 hospital+0.008
 doctor+0.008
 sleep+0.008
 sick+0.008
Neg logit contributions
 rect-0.011
bek-0.011
 perfectly-0.010
heit-0.010
 angles-0.010
Token,
Token ID11
Feature activation-3.10e-01

Pos logit contributions
 hospital+0.017
 dentist+0.017
 sick+0.017
iver+0.016
 rested+0.016
Neg logit contributions
 rect-0.021
 perfectly-0.020
heit-0.019
bek-0.019
 angles-0.019
Token mom
Token ID1995
Feature activation-2.30e+00

Pos logit contributions
 sick+0.007
 doctor+0.007
 hospital+0.007
 dentist+0.007
 sleep+0.007
Neg logit contributions
bek-0.015
 rect-0.015
vered-0.015
heit-0.015
 angles-0.014
Token."
Token ID526
Feature activation-8.43e-01

Pos logit contributions
 hospital+0.041
 sick+0.039
 dentist+0.038
 sleep+0.038
 doctor+0.037
Neg logit contributions
bek-0.118
 rect-0.116
 angles-0.114
vered-0.113
 perfectly-0.111
Token
Token ID198
Feature activation+3.07e-01

Pos logit contributions
 doctor+0.016
 sick+0.016
 hospital+0.015
 sleep+0.015
 dentist+0.014
Neg logit contributions
bek-0.047
 rect-0.046
heit-0.046
vered-0.045
 angles-0.044
Token
Token ID198
Feature activation-6.76e-02

Pos logit contributions
bek+0.013
heit+0.013
 rect+0.013
vered+0.013
Alex+0.012
Neg logit contributions
 sick-0.008
 hospital-0.008
 dentist-0.008
 itch-0.008
 doctors-0.008
TokenMom
Token ID29252
Feature activation-1.66e+00

Pos logit contributions
 sick+0.002
 doctor+0.002
 hospital+0.002
 dentist+0.002
 sleep+0.002
Neg logit contributions
bek-0.003
 rect-0.003
vered-0.003
heit-0.003
 angles-0.003
Token sigh
Token ID19680
Feature activation-2.59e+00

Pos logit contributions
 sick+0.062
 hospital+0.058
 sleep+0.057
 doctor+0.057
 dentist+0.056
Neg logit contributions
bek-0.058
 rect-0.056
vered-0.054
heit-0.054
 angles-0.052
Tokens
Token ID82
Feature activation+1.09e+00

Pos logit contributions
 dentist+0.119
 hospital+0.116
 doctor+0.115
 sick+0.114
 sleep+0.113
Neg logit contributions
 rect-0.068
vered-0.066
bek-0.065
heit-0.062
 angles-0.059
Token.
Token ID13
Feature activation+4.02e-01

Pos logit contributions
bek+0.055
vered+0.054
 rect+0.054
 perfectly+0.053
heit+0.053
Neg logit contributions
 dentist-0.021
 hospital-0.020
 sick-0.019
 doctor-0.019
 doctors-0.018
Token He
Token ID679
Feature activation+1.35e+00

Pos logit contributions
bek+0.005
 rect+0.005
vered+0.005
heit+0.005
 angles+0.005
Neg logit contributions
 sick-0.003
 hospital-0.002
 doctor-0.002
 dentist-0.002
 sleep-0.002
Token says
Token ID1139
Feature activation-6.88e-01

Pos logit contributions
bek+0.055
 rect+0.050
vered+0.049
heit+0.048
 angles+0.046
Neg logit contributions
 sick-0.048
 hospital-0.047
 doctor-0.046
 sleep-0.045
 dentist-0.045
Token he
Token ID339
Feature activation+5.38e-01

Pos logit contributions
 sick+0.016
 doctor+0.016
 hospital+0.016
 dentist+0.015
 sleep+0.015
Neg logit contributions
bek-0.035
 rect-0.033
heit-0.033
vered-0.033
 angles-0.032
Token is
Token ID318
Feature activation+1.09e+00

Pos logit contributions
bek+0.016
 rect+0.016
vered+0.016
heit+0.015
 angles+0.014
Neg logit contributions
 sick-0.022
 hospital-0.022
 doctor-0.022
 dentist-0.021
 sleep-0.021
Token on
Token ID319
Feature activation+1.97e+00

Pos logit contributions
 rect+0.049
bek+0.048
heit+0.046
vered+0.045
 angles+0.045
Neg logit contributions
 hospital-0.028
 sick-0.028
 doctor-0.027
 dentist-0.027
 sleep-0.026
Token the
Token ID262
Feature activation+2.25e-01

Pos logit contributions
 rect+0.093
bek+0.093
vered+0.090
heit+0.089
 angles+0.087
Neg logit contributions
 sick-0.047
 hospital-0.045
 doctor-0.045
 dentist-0.043
 sleep-0.042
Token sun
Token ID4252
Feature activation-7.03e-02

Pos logit contributions
bek+0.011
vered+0.010
 rect+0.010
heit+0.010
 angles+0.009
Neg logit contributions
 hospital-0.006
 sick-0.006
 doctor-0.006
 sleep-0.006
 dentist-0.006
Token.
Token ID13
Feature activation-3.62e-03

Pos logit contributions
 sick+0.001
 hospital+0.001
 doctor+0.001
 dentist+0.001
 sleep+0.001
Neg logit contributions
bek-0.004
 rect-0.004
heit-0.004
vered-0.004
 angles-0.004
Token
Token ID198
Feature activation+2.76e-01

Pos logit contributions
 sick+0.000
 hospital+0.000
 doctor+0.000
 dentist+0.000
 sleep+0.000
Neg logit contributions
bek-0.000
 rect-0.000
vered-0.000
heit-0.000
 angles-0.000
Token
Token ID198
Feature activation-2.61e-01

Pos logit contributions
 rect+0.010
bek+0.010
vered+0.010
heit+0.010
 angles+0.010
Neg logit contributions
 sick-0.008
 hospital-0.008
 itch-0.008
 dentist-0.008
 doctors-0.008
TokenHe
Token ID1544
Feature activation+3.74e-01

Pos logit contributions
 sick+0.008
 hospital+0.008
 doctor+0.007
 dentist+0.007
 sleep+0.007
Neg logit contributions
bek-0.011
 rect-0.011
vered-0.011
heit-0.011
 angles-0.010
Token!"
Token ID2474
Feature activation-1.08e+00

Pos logit contributions
bek+0.140
 rect+0.130
vered+0.125
heit+0.124
 angles+0.122
Neg logit contributions
 sick-0.055
 sleep-0.050
 hospital-0.050
?"-0.049
 doctor-0.049
Token Tim
Token ID5045
Feature activation+1.39e+00

Pos logit contributions
 sick+0.028
 doctor+0.028
 hospital+0.028
 dentist+0.028
 sleep+0.026
Neg logit contributions
bek-0.050
 rect-0.049
vered-0.047
heit-0.047
 angles-0.045
Tokenmy
Token ID1820
Feature activation+1.11e-01

Pos logit contributions
bek+0.062
 rect+0.061
vered+0.059
heit+0.058
 angles+0.056
Neg logit contributions
 sick-0.039
 hospital-0.038
 doctor-0.038
 dentist-0.037
 sleep-0.036
Token exclaimed
Token ID33552
Feature activation-7.20e-01

Pos logit contributions
bek+0.005
 rect+0.005
vered+0.005
heit+0.005
 angles+0.005
Neg logit contributions
 sick-0.003
 hospital-0.003
 doctor-0.003
 dentist-0.003
 sleep-0.003
Token.
Token ID13
Feature activation+5.84e-01

Pos logit contributions
 dentist+0.012
 hospital+0.011
 sick+0.010
 doctor+0.010
 itch+0.009
Neg logit contributions
 rect-0.039
bek-0.039
vered-0.039
 perfectly-0.037
 angles-0.037
Token
Token ID220
Feature activation+1.62e+00

Pos logit contributions
bek+0.021
 rect+0.021
vered+0.020
heit+0.020
 angles+0.019
Neg logit contributions
 sick-0.021
 hospital-0.021
 doctor-0.021
 dentist-0.020
 sleep-0.020
Token
Token ID198
Feature activation+1.83e-01

Pos logit contributions
 rect+0.059
vered+0.058
bek+0.058
heit+0.057
 angles+0.057
Neg logit contributions
 hospital-0.051
 dentist-0.050
 itch-0.049
 sick-0.049
 sleep-0.047
Token
Token ID198
Feature activation-3.04e-01

Pos logit contributions
bek+0.008
 rect+0.008
heit+0.007
vered+0.007
 angles+0.007
Neg logit contributions
 sick-0.005
 hospital-0.005
 itch-0.005
 dentist-0.005
 doctors-0.005
Token"
Token ID1
Feature activation+1.98e+00

Pos logit contributions
 sick+0.009
 hospital+0.009
 doctor+0.009
 dentist+0.009
 sleep+0.009
Neg logit contributions
bek-0.013
 rect-0.013
vered-0.012
heit-0.012
 angles-0.012
TokenDon
Token ID3987
Feature activation+2.41e+00

Pos logit contributions
heit+0.081
vered+0.076
bek+0.075
 angles+0.074
osaurs+0.072
Neg logit contributions
 sick-0.065
 bat-0.064
 doctor-0.062
 elderly-0.061
 hurt-0.059
Token't
Token ID470
Feature activation-1.80e+00

Pos logit contributions
bek+0.082
 rect+0.079
heit+0.078
vered+0.074
rave+0.071
Neg logit contributions
 sick-0.092
 hospital-0.089
 dentist-0.088
 sleep-0.087
 doctor-0.086
Token very
Token ID845
Feature activation-8.51e-02

Pos logit contributions
 dentist+0.088
 doctor+0.078
 hospital+0.075
 veterinarian+0.075
 ambulance+0.073
Neg logit contributions
vered-0.121
 perfectly-0.120
 rect-0.117
heit-0.112
 angles-0.112
Token sick
Token ID6639
Feature activation-2.89e+00

Pos logit contributions
 dentist+0.003
 veterinarian+0.003
 doctor+0.002
 ambulance+0.002
 doctors+0.002
Neg logit contributions
 perfectly-0.003
heit-0.003
vered-0.003
 angles-0.003
 rect-0.003
Token.
Token ID13
Feature activation-3.95e-01

Pos logit contributions
 sick+0.044
 dentist+0.042
 hospital+0.041
 doctor+0.039
 sleep+0.036
Neg logit contributions
 rect-0.159
vered-0.154
bek-0.153
 perfectly-0.150
 angles-0.149
Token
Token ID198
Feature activation-2.19e-02

Pos logit contributions
 doctor+0.009
 sick+0.009
 hospital+0.009
 dentist+0.009
 sleep+0.009
Neg logit contributions
 rect-0.020
bek-0.020
vered-0.019
heit-0.019
 angles-0.018
Token
Token ID198
Feature activation-3.91e-01

Pos logit contributions
 sick+0.001
 hospital+0.001
 doctor+0.000
 dentist+0.000
 itch+0.000
Neg logit contributions
bek-0.001
vered-0.001
heit-0.001
 rect-0.001
 angles-0.001
TokenMary
Token ID24119
Feature activation+7.34e-01

Pos logit contributions
 sick+0.016
 hospital+0.016
 doctor+0.016
 dentist+0.016
 sleep+0.015
Neg logit contributions
bek-0.012
 rect-0.012
vered-0.011
heit-0.011
 angles-0.011
Token felt
Token ID2936
Feature activation-1.95e+00

Pos logit contributions
bek+0.030
 rect+0.029
vered+0.029
heit+0.028
 angles+0.027
Neg logit contributions
 sick-0.023
 hospital-0.022
 doctor-0.022
 dentist-0.022
 sleep-0.021
Token very
Token ID845
Feature activation-1.10e-01

Pos logit contributions
 dentist+0.060
 cook+0.053
 doctor+0.053
 sleep+0.051
 elderly+0.050
Neg logit contributions
vered-0.076
 perfectly-0.071
heit-0.070
 rect-0.069
bek-0.066
Token helpless
Token ID21144
Feature activation+7.08e-01

Pos logit contributions
 dentist+0.004
 veterinarian+0.003
 doctor+0.003
 ambulance+0.003
 paramedics+0.003
Neg logit contributions
heit-0.004
vered-0.004
 perfectly-0.004
 angles-0.004
 index-0.003
Token and
Token ID290
Feature activation-8.08e-01

Pos logit contributions
 rect+0.036
bek+0.036
vered+0.035
heit+0.034
 angles+0.034
Neg logit contributions
 sick-0.014
 hospital-0.014
 doctor-0.014
 dentist-0.014
 sleep-0.013
Token realized
Token ID6939
Feature activation-3.37e-01

Pos logit contributions
 sick+0.024
 doctor+0.023
 hospital+0.023
 dentist+0.023
 sleep+0.022
Neg logit contributions
bek-0.034
 rect-0.034
vered-0.033
heit-0.033
 angles-0.031
Token as
Token ID355
Feature activation-1.38e+00

Pos logit contributions
 sick+0.033
 hospital+0.032
 doctor+0.032
 dentist+0.031
 sleep+0.030
Neg logit contributions
bek-0.052
 rect-0.052
vered-0.050
heit-0.049
 angles-0.048
Token they
Token ID484
Feature activation-6.77e-01

Pos logit contributions
 sick+0.036
 doctor+0.034
 hospital+0.032
 dentist+0.032
 sleep+0.031
Neg logit contributions
bek-0.067
 rect-0.067
heit-0.065
vered-0.064
 angles-0.062
Token walked
Token ID6807
Feature activation-1.20e+00

Pos logit contributions
 sick+0.021
 hospital+0.021
 doctor+0.021
 dentist+0.021
 sleep+0.019
Neg logit contributions
bek-0.027
vered-0.026
 rect-0.026
heit-0.025
 angles-0.024
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
 sick+0.035
 doctor+0.034
 hospital+0.034
 sleep+0.033
 dentist+0.031
Neg logit contributions
bek-0.054
 rect-0.053
heit-0.051
vered-0.051
 angles-0.048
Token That
Token ID1320
Feature activation+1.81e+00

Pos logit contributions
 doctor+0.004
 sick+0.004
 hospital+0.004
 sleep+0.004
 dentist+0.003
Neg logit contributions
bek-0.007
 rect-0.007
heit-0.007
vered-0.007
 angles-0.006
Token made
Token ID925
Feature activation-3.50e+00

Pos logit contributions
bek+0.079
 rect+0.077
heit+0.075
vered+0.074
 angles+0.071
Neg logit contributions
 sick-0.054
 hospital-0.053
 doctor-0.053
 dentist-0.051
 sleep-0.051
Token Sandy
Token ID17581
Feature activation+4.68e-01

Pos logit contributions
 sick+0.081
 doctor+0.079
 hospital+0.078
 sleep+0.075
 dentist+0.071
Neg logit contributions
bek-0.186
 rect-0.182
vered-0.173
heit-0.172
 angles-0.165
Token happy
Token ID3772
Feature activation+1.64e+00

Pos logit contributions
bek+0.022
 rect+0.022
vered+0.022
heit+0.021
 angles+0.020
Neg logit contributions
 sick-0.011
 hospital-0.011
 dentist-0.011
 doctor-0.011
 sleep-0.010
Token.
Token ID13
Feature activation-7.26e-01

Pos logit contributions
bek+0.099
 rect+0.097
heit+0.095
vered+0.094
 angles+0.091
Neg logit contributions
 sick-0.023
 doctor-0.022
 hospital-0.022
 sleep-0.020
 dentist-0.020
Token They
Token ID1119
Feature activation+3.75e-01

Pos logit contributions
 sick+0.018
 doctor+0.018
 hospital+0.018
 dentist+0.017
 sleep+0.017
Neg logit contributions
bek-0.034
 rect-0.034
vered-0.033
heit-0.033
 angles-0.031
Token paused
Token ID24487
Feature activation+1.49e+00

Pos logit contributions
bek+0.016
 rect+0.015
vered+0.015
heit+0.015
 angles+0.014
Neg logit contributions
 sick-0.011
 doctor-0.011
 hospital-0.011
 dentist-0.011
 sleep-0.010
Token stranger
Token ID16195
Feature activation-2.04e-01

Pos logit contributions
 sick+0.074
 sleep+0.071
iver+0.068
 bat+0.067
 hospital+0.067
Neg logit contributions
bek-0.113
heit-0.113
 rect-0.110
vered-0.108
 angles-0.107
Token said
Token ID531
Feature activation-1.58e+00

Pos logit contributions
 sick+0.005
 hospital+0.005
 doctor+0.005
 dentist+0.005
 sleep+0.005
Neg logit contributions
bek-0.009
 rect-0.009
vered-0.009
heit-0.009
 angles-0.009
Token,
Token ID11
Feature activation+2.12e-01

Pos logit contributions
 sick+0.041
 hospital+0.039
 doctor+0.038
 dentist+0.038
 sleep+0.038
Neg logit contributions
bek-0.073
 rect-0.072
vered-0.070
heit-0.069
 angles-0.066
Token "
Token ID366
Feature activation+3.89e+00

Pos logit contributions
bek+0.009
 rect+0.009
vered+0.009
heit+0.008
 angles+0.008
Neg logit contributions
 sick-0.007
 doctor-0.007
 hospital-0.006
 dentist-0.006
 sleep-0.006
TokenIt
Token ID1026
Feature activation-3.95e-01

Pos logit contributions
bek+0.141
 rect+0.135
vered+0.131
heit+0.129
 angles+0.123
Neg logit contributions
 sick-0.143
 hospital-0.141
 doctor-0.139
 sleep-0.138
 dentist-0.137
Token's
Token ID338
Feature activation-1.87e-01

Pos logit contributions
 sick+0.012
 hospital+0.012
 doctor+0.012
 sleep+0.011
 dentist+0.011
Neg logit contributions
bek-0.017
 rect-0.017
vered-0.016
heit-0.016
 angles-0.016
Token okay
Token ID8788
Feature activation+1.43e+00

Pos logit contributions
 doctor+0.006
 hospital+0.006
 dentist+0.006
 sick+0.005
 cook+0.005
Neg logit contributions
bek-0.007
heit-0.007
 perfectly-0.007
 rect-0.007
vered-0.007
Token,
Token ID11
Feature activation-8.29e-01

Pos logit contributions
bek+0.066
 rect+0.063
vered+0.062
heit+0.062
 perfectly+0.060
Neg logit contributions
 sick-0.035
 hospital-0.035
 dentist-0.034
 doctor-0.033
 sleep-0.032
Token I
Token ID314
Feature activation-5.35e-01

Pos logit contributions
 sick+0.024
 doctor+0.023
 hospital+0.023
 dentist+0.023
 sleep+0.022
Neg logit contributions
bek-0.037
 rect-0.036
vered-0.036
heit-0.035
 angles-0.033
Token just
Token ID655
Feature activation+4.51e-01

Pos logit contributions
 dentist+0.022
 doctors+0.021
 hospital+0.021
 elderly+0.021
 sick+0.021
Neg logit contributions
bek-0.015
vered-0.015
heit-0.014
 perfectly-0.013
 angles-0.013
Token want
Token ID765
Feature activation-3.01e-01

Pos logit contributions
bek+0.022
 rect+0.021
vered+0.021
heit+0.021
 angles+0.020
Neg logit contributions
 dentist-0.010
 sick-0.010
 doctor-0.010
 hospital-0.010
 doctors-0.009
Token it
Token ID340
Feature activation-2.49e+00

Pos logit contributions
 sick+0.079
 hospital+0.077
 doctor+0.076
 dentist+0.076
 sleep+0.074
Neg logit contributions
 rect-0.120
bek-0.119
vered-0.114
heit-0.113
 angles-0.110
Token was
Token ID373
Feature activation+1.18e+00

Pos logit contributions
 sick+0.089
 doctor+0.088
 hospital+0.088
 dentist+0.088
 sleep+0.085
Neg logit contributions
 rect-0.088
bek-0.087
vered-0.084
heit-0.083
 angles-0.080
Token wet
Token ID9583
Feature activation-1.83e+00

Pos logit contributions
 rect+0.053
bek+0.053
vered+0.051
heit+0.051
 angles+0.050
Neg logit contributions
 hospital-0.030
 sick-0.030
 doctor-0.030
 dentist-0.030
 sleep-0.028
Token.
Token ID13
Feature activation+3.84e-01

Pos logit contributions
 sick+0.041
 doctor+0.040
 hospital+0.040
 dentist+0.040
 sleep+0.037
Neg logit contributions
 rect-0.090
bek-0.088
vered-0.086
heit-0.084
 angles-0.083
Token She
Token ID1375
Feature activation+7.15e-01

Pos logit contributions
bek+0.018
 rect+0.018
vered+0.017
heit+0.017
 angles+0.016
Neg logit contributions
 doctor-0.010
 sick-0.010
 hospital-0.010
 dentist-0.009
 sleep-0.009
Token said
Token ID531
Feature activation-5.58e-01

Pos logit contributions
bek+0.030
 rect+0.030
vered+0.029
heit+0.029
 angles+0.027
Neg logit contributions
 sick-0.021
 hospital-0.020
 doctor-0.020
 dentist-0.020
 sleep-0.020
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
 sick+0.014
 doctor+0.013
 hospital+0.013
 dentist+0.013
 sleep+0.013
Neg logit contributions
bek-0.027
 rect-0.027
vered-0.026
heit-0.026
 angles-0.025
Token "
Token ID366
Feature activation+4.48e+00

Pos logit contributions
bek+0.056
vered+0.054
heit+0.054
 rect+0.054
 angles+0.052
Neg logit contributions
 doctor-0.032
 sick-0.032
 hospital-0.030
 dentist-0.029
 sleep-0.029
TokenIt
Token ID1026
Feature activation+2.95e-02

Pos logit contributions
 rect+0.188
heit+0.187
bek+0.181
vered+0.177
 angles+0.174
Neg logit contributions
 sick-0.142
 doctor-0.141
 dentist-0.134
 hospital-0.133
 bat-0.131
Token looks
Token ID3073
Feature activation-1.16e+00

Pos logit contributions
bek+0.001
 rect+0.001
heit+0.001
vered+0.001
 angles+0.001
Neg logit contributions
 sick-0.001
 hospital-0.001
 doctor-0.001
 hurt-0.001
 sleep-0.001
Token like
Token ID588
Feature activation+1.02e+00

Pos logit contributions
 sick+0.029
 hospital+0.028
 doctor+0.028
 dentist+0.027
 sleep+0.027
Neg logit contributions
bek-0.055
 rect-0.054
vered-0.053
heit-0.052
 angles-0.051
Token everywhere
Token ID8347
Feature activation+4.78e+00

Pos logit contributions
 sick+0.038
 hospital+0.035
 sleep+0.034
 doctor+0.033
 dentist+0.031
Neg logit contributions
bek-0.066
 rect-0.063
vered-0.061
heit-0.060
 angles-0.058
Token but
Token ID475
Feature activation+2.64e+00

Pos logit contributions
bek+0.247
 rect+0.239
heit+0.236
vered+0.235
 angles+0.224
Neg logit contributions
 sick-0.109
 sleep-0.101
 doctor-0.100
 hospital-0.099
 dentist-0.092
Token he
Token ID339
Feature activation-3.86e-01

Pos logit contributions
bek+0.112
 rect+0.108
heit+0.105
vered+0.101
 angles+0.099
Neg logit contributions
 sick-0.087
 hospital-0.083
 doctor-0.082
 sleep-0.081
 bat-0.076
Token couldn
Token ID3521
Feature activation+2.26e+00

Pos logit contributions
 sick+0.015
 hospital+0.014
 doctor+0.014
 dentist+0.014
 sleep+0.014
Neg logit contributions
bek-0.013
 rect-0.013
vered-0.013
heit-0.012
 angles-0.012
Token't
Token ID470
Feature activation-8.01e-01

Pos logit contributions
 rect+0.052
vered+0.050
 perfectly+0.049
heit+0.045
bek+0.045
Neg logit contributions
 sick-0.101
 dentist-0.101
 doctors-0.099
 hospital-0.097
 thirsty-0.096
Token find
Token ID1064
Feature activation-1.88e+00

Pos logit contributions
 paramedics+0.026
 dentist+0.024
 paramed+0.024
 elderly+0.024
 doctors+0.024
Neg logit contributions
 perfectly-0.030
vered-0.029
 rect-0.028
 angles-0.027
ra-0.026
Token it
Token ID340
Feature activation-1.83e+00

Pos logit contributions
 sick+0.043
 doctor+0.042
 hospital+0.040
 sleep+0.040
.+0.039
Neg logit contributions
bek-0.108
 rect-0.097
heit-0.096
rave-0.091
vered-0.090
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
 sick+0.026
 hospital+0.025
 doctor+0.024
 dentist+0.024
 sleep+0.022
Neg logit contributions
bek-0.105
 rect-0.104
vered-0.102
heit-0.101
 angles-0.099
Token He
Token ID679
Feature activation+9.50e-01

Pos logit contributions
bek+0.011
heit+0.011
 rect+0.011
vered+0.011
 angles+0.010
Neg logit contributions
 doctor-0.005
 hospital-0.005
 sick-0.005
 sleep-0.005
 dentist-0.005
Token was
Token ID373
Feature activation+6.80e-01

Pos logit contributions
 rect+0.043
bek+0.043
heit+0.040
vered+0.039
 angles+0.039
Neg logit contributions
 sick-0.028
 hospital-0.027
 doctor-0.026
 sleep-0.026
 dentist-0.025
Token so
Token ID523
Feature activation-1.48e-01

Pos logit contributions
 rect+0.024
vered+0.024
bek+0.024
 perfectly+0.024
 angles+0.023
Neg logit contributions
 dentist-0.022
 doctor-0.022
 hospital-0.022
 sick-0.021
 sleep-0.021
Token friends
Token ID2460
Feature activation-3.18e-02

Pos logit contributions
 rect+0.091
vered+0.087
bek+0.085
 angles+0.081
 perfectly+0.080
Neg logit contributions
 dentist-0.062
 sick-0.060
 hospital-0.060
 doctor-0.058
 itch-0.054
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
 sick+0.001
 hospital+0.001
 doctor+0.001
 dentist+0.001
 sleep+0.001
Neg logit contributions
bek-0.002
 rect-0.002
vered-0.002
heit-0.002
 angles-0.002
Token And
Token ID843
Feature activation+2.98e+00

Pos logit contributions
 hospital+0.002
 sick+0.002
 doctor+0.002
 sleep+0.002
 dentist+0.002
Neg logit contributions
bek-0.006
heit-0.005
 rect-0.005
vered-0.005
 angles-0.005
Token they
Token ID484
Feature activation-6.80e-01

Pos logit contributions
heit+0.173
vered+0.164
bek+0.163
 rect+0.162
 angles+0.158
Neg logit contributions
 sleep-0.061
 sick-0.060
 doctor-0.057
 hospital-0.055
 left-0.051
Token promised
Token ID8072
Feature activation+2.36e-01

Pos logit contributions
 sick+0.022
 dentist+0.022
 doctor+0.022
 hospital+0.022
 doctors+0.021
Neg logit contributions
bek-0.027
 rect-0.026
vered-0.025
heit-0.024
 perfectly-0.023
Token to
Token ID284
Feature activation-1.18e+00

Pos logit contributions
bek+0.011
 rect+0.011
vered+0.011
heit+0.010
 angles+0.010
Neg logit contributions
 sick-0.006
 hospital-0.006
 dentist-0.006
 doctor-0.005
 sleep-0.005
Token play
Token ID711
Feature activation-1.71e+00

Pos logit contributions
 sick+0.031
 dentist+0.030
 hospital+0.030
 doctor+0.029
 doctors+0.027
Neg logit contributions
bek-0.053
 rect-0.052
heit-0.049
vered-0.049
 angles-0.047
Token together
Token ID1978
Feature activation+2.92e+00

Pos logit contributions
 sick+0.073
 dentist+0.072
 hospital+0.070
 doctor+0.070
 itch+0.067
Neg logit contributions
bek-0.050
 rect-0.049
vered-0.048
heit-0.044
 perfectly-0.043
Token again
Token ID757
Feature activation-2.91e+00

Pos logit contributions
 rect+0.137
bek+0.137
vered+0.132
heit+0.128
 angles+0.126
Neg logit contributions
 sick-0.074
 hospital-0.072
 doctor-0.070
 dentist-0.070
 sleep-0.066
Token soon
Token ID2582
Feature activation-7.56e-01

Pos logit contributions
 sick+0.065
 hospital+0.063
 doctor+0.062
 dentist+0.062
 sleep+0.057
Neg logit contributions
bek-0.146
 rect-0.143
vered-0.139
heit-0.136
 angles-0.132
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
 dentist+0.013
 hospital+0.013
 doctor+0.012
 sick+0.012
 doctors+0.010
Neg logit contributions
 rect-0.041
vered-0.040
bek-0.040
 perfectly-0.039
heit-0.038
Token mom
Token ID1995
Feature activation-1.70e+00

Pos logit contributions
bek+0.024
 rect+0.024
vered+0.023
heit+0.023
 angles+0.022
Neg logit contributions
 sick-0.008
 hospital-0.007
 doctor-0.007
 dentist-0.007
 sleep-0.007
Token,
Token ID11
Feature activation+6.56e-01

Pos logit contributions
 sick+0.044
 doctor+0.044
 hospital+0.042
 dentist+0.041
 sleep+0.040
Neg logit contributions
bek-0.082
 rect-0.080
heit-0.078
vered-0.078
 angles-0.072
Token "
Token ID366
Feature activation+4.24e+00

Pos logit contributions
bek+0.030
 rect+0.030
heit+0.029
vered+0.029
 angles+0.028
Neg logit contributions
 doctor-0.018
 sick-0.018
 hospital-0.017
 dentist-0.016
 bat-0.016
TokenWhat
Token ID2061
Feature activation+1.48e+00

Pos logit contributions
 rect+0.164
heit+0.164
vered+0.163
bek+0.156
 angles+0.155
Neg logit contributions
 dentist-0.137
 doctor-0.137
 cook-0.134
 sick-0.131
 bat-0.130
Token is
Token ID318
Feature activation-3.22e+00

Pos logit contributions
bek+0.063
 rect+0.061
heit+0.059
vered+0.059
 angles+0.057
Neg logit contributions
 sick-0.047
 doctor-0.045
 hospital-0.045
 dentist-0.044
 sleep-0.043
Token it
Token ID340
Feature activation-4.61e+00

Pos logit contributions
 sick+0.091
 hospital+0.082
 doctor+0.082
 sleep+0.079
?"+0.078
Neg logit contributions
bek-0.156
vered-0.148
 rect-0.147
heit-0.141
 angles-0.141
Token?"
Token ID1701
Feature activation-2.49e+00

Pos logit contributions
 hospital+0.088
 dentist+0.081
 doctor+0.078
 sick+0.075
 sleep+0.075
Neg logit contributions
 rect-0.231
heit-0.228
bek-0.224
 perfectly-0.219
rave-0.212
Token She
Token ID1375
Feature activation+8.72e-01

Pos logit contributions
 doctor+0.071
 sick+0.070
 hospital+0.068
 dentist+0.065
 sleep+0.065
Neg logit contributions
bek-0.114
 rect-0.112
heit-0.110
vered-0.109
 angles-0.105
Token said
Token ID531
Feature activation-1.12e+00

Pos logit contributions
bek+0.041
 rect+0.040
vered+0.039
heit+0.038
 angles+0.037
Neg logit contributions
 sick-0.023
 hospital-0.022
 doctor-0.022
 dentist-0.021
 sleep-0.021
Token,
Token ID11
Feature activation+9.75e-01

Pos logit contributions
 doctor+0.019
 sick+0.018
 hospital+0.016
.+0.015
?"+0.015
Neg logit contributions
bek-0.066
heit-0.063
 rect-0.062
 angles-0.061
vered-0.061
Token "
Token ID366
Feature activation+4.72e+00

Pos logit contributions
bek+0.045
 rect+0.043
vered+0.043
heit+0.042
 angles+0.041
Neg logit contributions
 sick-0.027
 doctor-0.027
 hospital-0.026
 dentist-0.025
 sleep-0.025
Token care
Token ID1337
Feature activation-1.22e+00

Pos logit contributions
 dentist+0.005
 hospital+0.005
 doctor+0.005
 sick+0.005
 tum+0.005
Neg logit contributions
 rect-0.007
bek-0.006
 angles-0.006
vered-0.006
heit-0.006
Token about
Token ID546
Feature activation+8.55e-01

Pos logit contributions
 sick+0.017
 doctor+0.016
 hospital+0.016
?"+0.013
 sleep+0.013
Neg logit contributions
bek-0.074
 rect-0.073
heit-0.072
vered-0.070
 angles-0.069
Token their
Token ID511
Feature activation+1.30e-01

Pos logit contributions
bek+0.041
 rect+0.040
vered+0.038
heit+0.038
 angles+0.036
Neg logit contributions
 sick-0.023
 hospital-0.023
 doctor-0.022
 dentist-0.021
 sleep-0.021
Token aunt
Token ID25949
Feature activation+5.81e-01

Pos logit contributions
bek+0.007
 rect+0.006
heit+0.006
vered+0.006
ogram+0.006
Neg logit contributions
 sick-0.003
 hospital-0.003
 doctor-0.003
 sleep-0.003
 dentist-0.003
Token or
Token ID393
Feature activation+9.80e-01

Pos logit contributions
bek+0.029
 rect+0.028
heit+0.027
vered+0.027
 angles+0.026
Neg logit contributions
 sick-0.014
 hospital-0.013
 doctor-0.013
 dentist-0.013
 sleep-0.013
Token each
Token ID1123
Feature activation-4.06e+00

Pos logit contributions
bek+0.049
 rect+0.048
vered+0.047
heit+0.047
 angles+0.045
Neg logit contributions
 sick-0.024
 hospital-0.024
 doctor-0.024
 dentist-0.023
 sleep-0.022
Token other
Token ID584
Feature activation+1.18e+00

Pos logit contributions
 sick+0.088
 doctor+0.084
 hospital+0.084
 dentist+0.077
 sleep+0.075
Neg logit contributions
bek-0.216
 rect-0.213
heit-0.209
vered-0.205
 angles-0.197
Token.
Token ID13
Feature activation-1.88e-01

Pos logit contributions
bek+0.072
 rect+0.070
heit+0.069
vered+0.068
 angles+0.066
Neg logit contributions
 sick-0.016
 hospital-0.015
 doctor-0.014
 sleep-0.013
 dentist-0.013
Token
Token ID198
Feature activation-6.07e-02

Pos logit contributions
 sick+0.004
 hospital+0.004
 doctor+0.004
 dentist+0.004
 sleep+0.004
Neg logit contributions
bek-0.010
 rect-0.010
vered-0.009
heit-0.009
 angles-0.009
Token
Token ID198
Feature activation-5.31e-01

Pos logit contributions
 itch+0.002
 sick+0.002
 dentist+0.002
 hospital+0.002
 doctors+0.002
Neg logit contributions
bek-0.002
heit-0.002
 rect-0.002
vered-0.002
 angles-0.002
TokenSoon
Token ID28093
Feature activation+4.58e-01

Pos logit contributions
 dentist+0.021
 sleep+0.021
 hospital+0.021
 sick+0.021
 doctor+0.021
Neg logit contributions
bek-0.017
 rect-0.016
heit-0.016
vered-0.016
Wow-0.014
Token what
Token ID644
Feature activation+1.51e+00

Pos logit contributions
 sick+0.048
 doctor+0.047
 hospital+0.046
?"+0.045
 dentist+0.045
Neg logit contributions
bek-0.073
 rect-0.068
vered-0.066
heit-0.065
ically-0.062
Token was
Token ID373
Feature activation-2.13e+00

Pos logit contributions
bek+0.064
 rect+0.061
heit+0.060
vered+0.059
 angles+0.056
Neg logit contributions
 sick-0.048
 doctor-0.047
 hospital-0.047
 dentist-0.045
 sleep-0.044
Token on
Token ID319
Feature activation-1.40e-01

Pos logit contributions
 sick+0.055
 hospital+0.053
 doctor+0.052
 dentist+0.051
 sleep+0.050
Neg logit contributions
bek-0.102
 rect-0.100
vered-0.097
heit-0.096
 angles-0.092
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
 sick+0.003
 hospital+0.003
 doctor+0.003
 dentist+0.003
 sleep+0.003
Neg logit contributions
bek-0.007
 rect-0.007
vered-0.006
heit-0.006
 angles-0.006
Token TV
Token ID3195
Feature activation+2.49e+00

Pos logit contributions
 sick+0.038
 hospital+0.037
 doctor+0.037
 dentist+0.035
 sleep+0.035
Neg logit contributions
bek-0.060
 rect-0.058
vered-0.057
heit-0.056
 angles-0.053
Token too
Token ID1165
Feature activation-4.16e+00

Pos logit contributions
bek+0.152
 rect+0.150
vered+0.146
heit+0.146
 angles+0.142
Neg logit contributions
 sick-0.029
 hospital-0.027
 doctor-0.027
 dentist-0.025
 sleep-0.024
Token.
Token ID13
Feature activation+5.55e-02

Pos logit contributions
 sick+0.051
 hospital+0.050
 doctor+0.049
 dentist+0.047
 sleep+0.043
Neg logit contributions
bek-0.249
 rect-0.247
vered-0.241
heit-0.241
 angles-0.235
Token Tim
Token ID5045
Feature activation+6.49e-01

Pos logit contributions
bek+0.003
 rect+0.003
heit+0.003
vered+0.003
 angles+0.003
Neg logit contributions
 doctor-0.001
 hospital-0.001
 sick-0.001
 dentist-0.001
 sleep-0.001
Token laughed
Token ID13818
Feature activation+7.45e-01

Pos logit contributions
bek+0.030
 rect+0.029
vered+0.028
heit+0.028
 angles+0.027
Neg logit contributions
 sick-0.018
 hospital-0.017
 doctor-0.017
 dentist-0.017
 sleep-0.016
Token and
Token ID290
Feature activation+5.79e-03

Pos logit contributions
 rect+0.035
bek+0.035
vered+0.034
heit+0.033
 angles+0.032
Neg logit contributions
 sick-0.019
 hospital-0.018
 dentist-0.018
 doctor-0.018
 sleep-0.017
Token said
Token ID531
Feature activation-1.34e+00

Pos logit contributions
bek+0.000
 rect+0.000
vered+0.000
heit+0.000
 angles+0.000
Neg logit contributions
 sick-0.000
 hospital-0.000
 doctor-0.000
 sleep-0.000
 dentist-0.000
Token,
Token ID11
Feature activation-5.75e-01

Pos logit contributions
bek+0.111
 rect+0.108
vered+0.104
heit+0.103
 angles+0.100
Neg logit contributions
 sick-0.069
 doctor-0.067
 hospital-0.066
 dentist-0.064
 sleep-0.063
Token you
Token ID345
Feature activation-4.96e-01

Pos logit contributions
 doctor+0.016
 sick+0.016
 hospital+0.016
 dentist+0.015
 sleep+0.015
Neg logit contributions
bek-0.027
 rect-0.026
vered-0.026
heit-0.025
 angles-0.024
Token can
Token ID460
Feature activation+1.13e+00

Pos logit contributions
 sick+0.020
 hospital+0.020
 doctor+0.019
 dentist+0.019
 sleep+0.019
Neg logit contributions
bek-0.016
 rect-0.016
vered-0.015
heit-0.015
 angles-0.014
Token have
Token ID423
Feature activation-1.02e+00

Pos logit contributions
 rect+0.050
vered+0.048
bek+0.047
 perfectly+0.047
heit+0.047
Neg logit contributions
 sick-0.030
 hospital-0.029
 dentist-0.029
 doctors-0.029
 doctor-0.027
Token some
Token ID617
Feature activation-2.21e+00

Pos logit contributions
 sick+0.028
 hospital+0.027
 doctor+0.027
 dentist+0.026
 sleep+0.026
Neg logit contributions
 rect-0.046
bek-0.046
heit-0.044
vered-0.044
 angles-0.042
Token too
Token ID1165
Feature activation-4.20e+00

Pos logit contributions
 sick+0.053
 hospital+0.053
 doctor+0.050
 dentist+0.050
 sleep+0.049
Neg logit contributions
 rect-0.105
bek-0.102
vered-0.100
heit-0.099
 angles-0.097
Token."
Token ID526
Feature activation-1.48e+00

Pos logit contributions
 hospital+0.119
 dentist+0.111
 doctor+0.110
 doctors+0.108
 sick+0.103
Neg logit contributions
 rect-0.181
bek-0.179
heit-0.176
vered-0.175
 perfectly-0.173
Token
Token ID198
Feature activation-3.02e-01

Pos logit contributions
 sick+0.035
 hospital+0.033
 doctor+0.032
 dentist+0.032
 sleep+0.031
Neg logit contributions
bek-0.072
 rect-0.072
vered-0.069
heit-0.069
 angles-0.067
Token
Token ID198
Feature activation-5.98e-01

Pos logit contributions
 sick+0.009
 hospital+0.009
 itch+0.009
 doctors+0.008
 dentist+0.008
Neg logit contributions
bek-0.012
 rect-0.012
heit-0.011
 angles-0.011
Alex-0.011
TokenThey
Token ID2990
Feature activation+3.64e-01

Pos logit contributions
 sick+0.016
 doctor+0.016
 hospital+0.015
 dentist+0.015
 sleep+0.015
Neg logit contributions
bek-0.028
 rect-0.028
vered-0.027
heit-0.027
 angles-0.026
Token sit
Token ID1650
Feature activation-3.13e+00

Pos logit contributions
 rect+0.017
bek+0.017
heit+0.017
vered+0.016
 angles+0.016
Neg logit contributions
 sick-0.009
 dentist-0.008
 doctor-0.008
 hospital-0.008
 doctors-0.008
Token it
Token ID340
Feature activation-2.62e+00

Pos logit contributions
 sick+0.008
 doctor+0.007
 hospital+0.007
 sleep+0.007
.+0.007
Neg logit contributions
bek-0.019
 rect-0.017
heit-0.017
ogram-0.016
 angles-0.016
Token was
Token ID373
Feature activation+2.27e-01

Pos logit contributions
 sick+0.071
 doctor+0.066
 hospital+0.066
 sleep+0.064
 hurt+0.063
Neg logit contributions
bek-0.128
 rect-0.119
heit-0.117
vered-0.114
 angles-0.112
Token and
Token ID290
Feature activation+2.10e-01

Pos logit contributions
bek+0.013
 rect+0.013
vered+0.012
heit+0.012
 angles+0.012
Neg logit contributions
 sick-0.004
 hospital-0.003
 doctor-0.003
 dentist-0.003
 sleep-0.003
Token how
Token ID703
Feature activation+1.50e-02

Pos logit contributions
bek+0.010
 rect+0.009
heit+0.009
vered+0.009
anish+0.009
Neg logit contributions
 sick-0.006
 hospital-0.005
 doctor-0.005
 sleep-0.005
 hurt-0.005
Token great
Token ID1049
Feature activation+1.69e+00

Pos logit contributions
bek+0.001
 rect+0.001
vered+0.001
heit+0.001
 angles+0.001
Neg logit contributions
 sick-0.000
 hospital-0.000
 doctor-0.000
 dentist-0.000
 sleep-0.000
Token it
Token ID340
Feature activation-4.31e+00

Pos logit contributions
bek+0.089
 rect+0.082
vered+0.080
heit+0.080
ogram+0.078
Neg logit contributions
 sick-0.040
 doctor-0.040
 hospital-0.039
 sleep-0.038
 doctors-0.036
Token looked
Token ID3114
Feature activation-2.57e+00

Pos logit contributions
 sick+0.131
 hospital+0.127
 doctor+0.126
 dentist+0.122
 sleep+0.121
Neg logit contributions
bek-0.185
 rect-0.180
heit-0.174
vered-0.173
 angles-0.167
Token on
Token ID319
Feature activation+1.35e+00

Pos logit contributions
 sick+0.032
 hospital+0.027
 doctor+0.027
 sleep+0.025
 dentist+0.024
Neg logit contributions
bek-0.159
 rect-0.155
heit-0.152
vered-0.151
 angles-0.146
Token her
Token ID607
Feature activation+7.93e-01

Pos logit contributions
bek+0.092
heit+0.086
vered+0.083
 rect+0.083
osaurs+0.082
Neg logit contributions
.-0.017
 sick-0.013
 sleep-0.011
 doctor-0.010
 hot-0.010
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
bek+0.038
 rect+0.035
vered+0.035
heit+0.035
Wow+0.032
Neg logit contributions
 sick-0.022
 sleep-0.022
 doctor-0.021
 hospital-0.021
 bat-0.020
Token
Token ID220
Feature activation+8.36e-01

Pos logit contributions
 doctor+0.005
 hospital+0.005
 sleep+0.005
 sick+0.005
 veterinarian+0.005
Neg logit contributions
bek-0.014
 rect-0.014
heit-0.014
vered-0.013
 angles-0.013