TOP ACTIVATIONS
MAX = 0.158

had told her to call 911 if there was an emergency
" Hello , 911 , what is your emergency
to pick up their suit cases but they couldn 't find
. Thank you for calling 911 ." L ily
, they checked their suit cases in and walked to their
the dad carried the suit cases again . They had a
off and walked around . Other people stayed on the train
and all the big suit cases . But
, Tony grabbed the suit cases and carried them out of
their families . They call 911 . They hope it is
running around with their suit cases looked like tiny ants .
tall towers and winding stair cases but most of all she
big belt . The suit cases went into the plane .
rest in the spot . Other times it was a squirrel
hurt badly . She called 911 and waited for the ambulance
He put all the suit cases in the car , and
down on the path . Other animals saw the fox and
a lot of food . Other birds saw the fat bird
making a big fuss . Other customers in the store started
and not miserable anymore . Other kids heard Tom whistle and

MIN ACTIVATIONS
MIN = -0.213

like lettuce , tomatoes and cuc umbers . He smiled as
John chose some carrots and cuc umbers to take home .
. You found a nice cuc umber . It will be
has lettuce and tomatoes and cuc umbers . And it has
garden . They cut the cuc umber and put it in
sip and nib ble and smile . They feel happy and
proud of them . They smile and say thank you .
girl ." With a curious smile , the girl opened the
They were filled with fresh cuc umber and cheese . It
saw carrots , potatoes , cuc umbers and tomatoes . He
girl and swing with a smile on his face . <|endoftext|>
wise man with a kind smile . The wise man told
thank you . We love cuc umbers . And we love
finished , Mrs . P ige on said Jack had done
. They hold hands and smile . " Are
He also has a big smile on his face .
was happy to put a smile on her friend â €
says . They smile at each other . They
smiled a big , joyful smile . Soon they
cuc umber . Mama puts cuc umbers in our salad .

INTERVAL 0.065 - 0.158
CONTAINS 1.076%

Sam said , pet ting Rex . " But we have
want to play with Sue anymore . He took the dish
look up . It was raining hard , and the wind
away . We can see other fish at the aquarium ,
They promise to see each other again . They go home

INTERVAL -0.027 - 0.065
CONTAINS 84.521%

room . Now he could see it all the time and
, what is this ?" Lily asked . "
play outside on the ground . One day , Robert was
to follow her . The nurse followed the
'm always here to help ". <|endoftext|> Once there were two

INTERVAL -0.120 - -0.027
CONTAINS 14.355%

hearing her prayers and returned home with a smile on her
was brave and strong . <|endoftext|> Once upon a time ,
come and help them . They hugged each other and waited
blanket . Her mother had been showing her how to do
just curious and excited . He likes to zoom around on

INTERVAL -0.213 - -0.120
CONTAINS 0.048%

old turtle said with a smile . Harry thanked
." The balloon seems to smile back at her . Anna
and she smiled a big smile . When they
beautiful hat and a bright smile . The
her mom with a big smile and said , " Mom
Token had
Token ID550
Feature activation-3.39e-02

Pos logit contributions
 stations+0.049
adders+0.049
 basement+0.047
 themselves+0.046
chairs+0.045
Neg logit contributions
ory-0.019
 ate-0.017
abby-0.016
 che-0.016
hire-0.015
Token told
Token ID1297
Feature activation+4.78e-02

Pos logit contributions
 che+0.031
 taste+0.031
ory+0.030
 cuc+0.029
 ate+0.029
Neg logit contributions
chairs-0.069
 stations-0.066
 Michael-0.063
 basement-0.062
adders-0.061
Token her
Token ID607
Feature activation-3.07e-03

Pos logit contributions
chairs+0.097
 stations+0.093
adders+0.091
 basement+0.089
 cinema+0.087
Neg logit contributions
 taste-0.046
ory-0.043
hire-0.040
 smile-0.038
 smell-0.038
Token to
Token ID284
Feature activation-3.10e-03

Pos logit contributions
 taste+0.004
va+0.004
hire+0.004
ory+0.004
 smile+0.004
Neg logit contributions
 stations-0.005
chairs-0.005
 ambul-0.005
 cinema-0.005
 basement-0.004
Token call
Token ID869
Feature activation+4.75e-02

Pos logit contributions
 taste+0.003
 smile+0.002
 lick+0.002
ory+0.002
hire+0.002
Neg logit contributions
chairs-0.007
adders-0.007
 stations-0.007
 ambul-0.006
 Michael-0.006
Token 911
Token ID16679
Feature activation+1.58e-01

Pos logit contributions
chairs+0.086
 stations+0.082
adders+0.079
 basement+0.076
 fault+0.075
Neg logit contributions
 taste-0.046
hire-0.045
 smile-0.044
ory-0.043
 smell-0.043
Token if
Token ID611
Feature activation+3.70e-02

Pos logit contributions
chairs+0.305
adders+0.305
 emergencies+0.303
Hours+0.300
 stations+0.299
Neg logit contributions
y-0.118
ho-0.101
ory-0.100
ny-0.087
r-0.085
Token there
Token ID612
Feature activation+1.49e-02

Pos logit contributions
 stations+0.067
chairs+0.064
adders+0.062
 fault+0.060
 rails+0.058
Neg logit contributions
va-0.037
 taste-0.037
ory-0.035
 smell-0.035
 che-0.034
Token was
Token ID373
Feature activation+3.90e-02

Pos logit contributions
 stations+0.032
adders+0.031
 cinema+0.030
 emergencies+0.029
 basement+0.029
Neg logit contributions
 taste-0.012
 smell-0.010
 ate-0.009
ho-0.009
 smile-0.009
Token an
Token ID281
Feature activation+3.24e-05

Pos logit contributions
chairs+0.070
 stations+0.061
adders+0.061
path+0.058
cases+0.056
Neg logit contributions
 taste-0.047
 che-0.043
 smell-0.043
 mor-0.041
 l-0.041
Token emergency
Token ID6334
Feature activation+9.21e-02

Pos logit contributions
chairs+0.000
 themselves+0.000
 Rex+0.000
 stations+0.000
 Ali+0.000
Neg logit contributions
hire-0.000
ory-0.000
lder-0.000
abby-0.000
ka-0.000
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
 stations+0.019
 ambul+0.017
adders+0.017
chairs+0.016
 radios+0.016
Neg logit contributions
ory-0.007
va-0.007
aire-0.006
 cuc-0.005
 pineapple-0.005
Token
Token ID198
Feature activation+1.33e-02

Pos logit contributions
 stations+0.024
chairs+0.023
 ambul+0.023
adders+0.022
 emergencies+0.021
Neg logit contributions
abby-0.016
ory-0.015
va-0.015
hire-0.013
lder-0.013
Token"
Token ID1
Feature activation-3.72e-02

Pos logit contributions
chairs+0.026
 stations+0.024
 ambul+0.024
adders+0.024
 tracks+0.023
Neg logit contributions
ory-0.015
va-0.015
hire-0.015
abby-0.015
 cuc-0.013
TokenHello
Token ID15496
Feature activation+2.87e-03

Pos logit contributions
ory+0.035
va+0.035
hire+0.035
abby+0.033
 cuc+0.032
Neg logit contributions
chairs-0.076
adders-0.074
 tracks-0.072
 themselves-0.070
 stations-0.069
Token,
Token ID11
Feature activation+9.80e-03

Pos logit contributions
 stations+0.005
chairs+0.005
 themselves+0.005
adders+0.004
 tracks+0.004
Neg logit contributions
ory-0.004
va-0.004
lder-0.004
 cuc-0.004
ka-0.004
Token 911
Token ID16679
Feature activation+1.57e-01

Pos logit contributions
chairs+0.018
 themselves+0.017
 stations+0.016
adders+0.016
 fault+0.015
Neg logit contributions
va-0.011
 taste-0.011
hire-0.011
ory-0.010
 smell-0.010
Token,
Token ID11
Feature activation+8.38e-03

Pos logit contributions
chairs+0.260
 stations+0.257
 themselves+0.254
adders+0.251
 tracks+0.248
Neg logit contributions
ka-0.192
ory-0.190
lder-0.177
va-0.173
hire-0.169
Token what
Token ID644
Feature activation+2.73e-03

Pos logit contributions
chairs+0.016
 stations+0.014
 themselves+0.013
 tracks+0.013
 pads+0.013
Neg logit contributions
 taste-0.009
 smell-0.009
hire-0.008
 grant-0.008
va-0.008
Token is
Token ID318
Feature activation+4.03e-02

Pos logit contributions
chairs+0.005
 stations+0.005
 themselves+0.005
 tracks+0.004
 ambul+0.004
Neg logit contributions
ory-0.003
va-0.003
 cuc-0.003
hire-0.003
 taste-0.003
Token your
Token ID534
Feature activation+1.74e-02

Pos logit contributions
chairs+0.060
 stations+0.058
 themselves+0.054
 beds+0.054
adders+0.054
Neg logit contributions
 cuc-0.057
hire-0.054
lder-0.054
ory-0.053
 mineral-0.053
Token emergency
Token ID6334
Feature activation+8.88e-02

Pos logit contributions
chairs+0.034
adders+0.033
 stations+0.032
 themselves+0.032
ions+0.031
Neg logit contributions
 name-0.019
 taste-0.017
 smell-0.017
lder-0.016
 ring-0.015
Token to
Token ID284
Feature activation-3.09e-03

Pos logit contributions
chairs+0.010
adders+0.010
 stations+0.010
 emergencies+0.009
 ambul+0.009
Neg logit contributions
ory-0.008
abby-0.007
 smile-0.007
ka-0.007
 token-0.006
Token pick
Token ID2298
Feature activation-6.54e-02

Pos logit contributions
 taste+0.003
 smile+0.003
 smell+0.003
 receive+0.003
 pick+0.003
Neg logit contributions
chairs-0.007
adders-0.006
lights-0.006
oses-0.006
 stations-0.006
Token up
Token ID510
Feature activation+3.81e-02

Pos logit contributions
ka+0.089
ory+0.086
 taste+0.079
by+0.078
 peanut+0.077
Neg logit contributions
 emergencies-0.105
adders-0.099
 stations-0.097
 ambul-0.097
 basement-0.097
Token their
Token ID511
Feature activation+3.99e-02

Pos logit contributions
chairs+0.060
adders+0.058
 stations+0.058
 emergencies+0.057
 basement+0.056
Neg logit contributions
ory-0.052
 che-0.047
 taste-0.047
 l-0.047
 cuc-0.046
Token suit
Token ID6050
Feature activation+1.46e-04

Pos logit contributions
chairs+0.064
adders+0.062
 stations+0.059
 Michael+0.059
 themselves+0.059
Neg logit contributions
ory-0.056
ka-0.055
 cuc-0.053
 taste-0.052
 mineral-0.051
Tokencases
Token ID33964
Feature activation+1.57e-01

Pos logit contributions
 emergencies+0.000
 stations+0.000
 ambul+0.000
 basement+0.000
adders+0.000
Neg logit contributions
ory-0.000
app-0.000
abby-0.000
 smile-0.000
hire-0.000
Token but
Token ID475
Feature activation+4.55e-02

Pos logit contributions
 emergencies+0.283
chairs+0.277
adders+0.276
 ambul+0.275
 stations+0.270
Neg logit contributions
ory-0.193
 smile-0.177
abby-0.175
 taste-0.170
 cuc-0.167
Token they
Token ID484
Feature activation+5.74e-02

Pos logit contributions
chairs+0.072
adders+0.071
 stations+0.070
 basement+0.069
 emergencies+0.068
Neg logit contributions
ory-0.053
 picked-0.052
hire-0.050
 took-0.049
 wore-0.049
Token couldn
Token ID3521
Feature activation+3.29e-02

Pos logit contributions
 stations+0.122
chairs+0.118
 emergencies+0.116
 ambul+0.111
adders+0.111
Neg logit contributions
ory-0.061
abby-0.052
hire-0.050
 ate-0.049
va-0.048
Token't
Token ID470
Feature activation+2.65e-02

Pos logit contributions
 stations+0.074
chairs+0.066
adders+0.066
 ambul+0.066
 emergencies+0.066
Neg logit contributions
va-0.023
app-0.023
 taste-0.022
ory-0.022
lder-0.022
Token find
Token ID1064
Feature activation-4.66e-03

Pos logit contributions
chairs+0.055
 stations+0.052
 ambul+0.051
adders+0.050
 Michael+0.049
Neg logit contributions
ory-0.029
hire-0.026
abby-0.024
 mineral-0.023
 cuc-0.023
Token.
Token ID13
Feature activation+4.66e-03

Pos logit contributions
ory+0.006
ka+0.006
hire+0.005
va+0.005
 cuc+0.005
Neg logit contributions
chairs-0.010
adders-0.010
 basement-0.009
 ambul-0.009
 rails-0.009
Token Thank
Token ID6952
Feature activation-5.18e-03

Pos logit contributions
chairs+0.007
 stations+0.007
 themselves+0.007
adders+0.007
 ambul+0.007
Neg logit contributions
ory-0.007
 cuc-0.006
va-0.006
hire-0.006
ka-0.006
Token you
Token ID345
Feature activation-3.86e-03

Pos logit contributions
ory+0.011
abby+0.011
hire+0.011
 cuc+0.011
 mineral+0.010
Neg logit contributions
 themselves-0.005
chairs-0.004
 Rex-0.004
 stations-0.004
 ambul-0.004
Token for
Token ID329
Feature activation+3.85e-02

Pos logit contributions
ory+0.005
va+0.004
 cuc+0.004
abby+0.004
hire+0.004
Neg logit contributions
chairs-0.008
 stations-0.007
adders-0.007
 basement-0.007
 emergencies-0.007
Token calling
Token ID4585
Feature activation+1.93e-02

Pos logit contributions
chairs+0.054
 stations+0.049
adders+0.049
 ambul+0.046
 Michael+0.046
Neg logit contributions
ory-0.064
 cuc-0.063
hire-0.061
va-0.061
abby-0.060
Token 911
Token ID16679
Feature activation+1.56e-01

Pos logit contributions
chairs+0.037
adders+0.035
 stations+0.035
 basement+0.033
 ambul+0.033
Neg logit contributions
ory-0.022
hire-0.021
ka-0.020
lder-0.020
abby-0.019
Token."
Token ID526
Feature activation-3.49e-04

Pos logit contributions
chairs+0.277
 basement+0.269
 emergencies+0.269
adders+0.265
 elbows+0.264
Neg logit contributions
ory-0.148
ka-0.140
lder-0.133
hire-0.127
 Crown-0.127
Token
Token ID198
Feature activation+4.33e-03

Pos logit contributions
ory+0.000
va+0.000
 cuc+0.000
abby+0.000
 pineapple+0.000
Neg logit contributions
 stations-0.001
chairs-0.001
 ambul-0.001
adders-0.001
 emergencies-0.001
Token
Token ID198
Feature activation+5.07e-03

Pos logit contributions
chairs+0.006
 stations+0.005
 ambul+0.005
adders+0.005
 themselves+0.005
Neg logit contributions
ory-0.008
abby-0.007
va-0.007
hire-0.007
 cuc-0.007
TokenL
Token ID43
Feature activation+4.19e-03

Pos logit contributions
chairs+0.008
 stations+0.008
 themselves+0.008
 ambul+0.008
adders+0.008
Neg logit contributions
ory-0.007
 cuc-0.007
hire-0.007
abby-0.007
va-0.006
Tokenily
Token ID813
Feature activation-2.48e-03

Pos logit contributions
chairs+0.006
 stations+0.006
 themselves+0.005
 ambul+0.005
adders+0.005
Neg logit contributions
ory-0.007
hire-0.007
abby-0.007
 cuc-0.007
va-0.007
Token,
Token ID11
Feature activation+2.58e-03

Pos logit contributions
chairs+0.035
 stations+0.033
 cords+0.033
adders+0.032
 emergencies+0.032
Neg logit contributions
ory-0.012
 smile-0.009
by-0.009
va-0.008
abby-0.008
Token they
Token ID484
Feature activation+5.17e-02

Pos logit contributions
chairs+0.005
adders+0.005
 stations+0.004
 rails+0.004
 fault+0.004
Neg logit contributions
ory-0.003
va-0.003
 cuc-0.003
 taste-0.003
 smell-0.003
Token checked
Token ID10667
Feature activation+1.91e-02

Pos logit contributions
chairs+0.112
 stations+0.108
 emergencies+0.107
 ambul+0.106
 themselves+0.106
Neg logit contributions
ory-0.051
abby-0.044
hire-0.042
 cuc-0.041
va-0.040
Token their
Token ID511
Feature activation+3.89e-02

Pos logit contributions
chairs+0.039
path+0.033
adders+0.033
 ambul+0.033
opl+0.032
Neg logit contributions
ory-0.021
ka-0.018
by-0.017
 smile-0.017
 mineral-0.016
Token suit
Token ID6050
Feature activation-1.77e-03

Pos logit contributions
adders+0.055
chairs+0.053
 themselves+0.047
 Ali+0.045
ions+0.045
Neg logit contributions
ory-0.066
by-0.063
ka-0.059
 mineral-0.056
 smile-0.056
Tokencases
Token ID33964
Feature activation+1.55e-01

Pos logit contributions
ouse+0.002
ory+0.002
abby+0.002
hire+0.002
ka+0.002
Neg logit contributions
 emergencies-0.003
 ambul-0.002
 stations-0.002
adders-0.002
 earthquakes-0.002
Token in
Token ID287
Feature activation-2.38e-02

Pos logit contributions
chairs+0.279
 emergencies+0.270
 ambul+0.268
adders+0.268
 Ali+0.263
Neg logit contributions
ory-0.186
 smile-0.186
abby-0.175
 taste-0.173
hire-0.167
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
 mineral+0.026
 smile+0.025
ory+0.025
 ruby+0.025
ka+0.024
Neg logit contributions
adders-0.046
chairs-0.045
oses-0.039
path-0.036
 ambul-0.036
Token walked
Token ID6807
Feature activation+6.35e-03

Pos logit contributions
adders+0.039
chairs+0.038
 fault+0.037
ion+0.036
 ambul+0.035
Neg logit contributions
ory-0.015
 smile-0.015
 ate-0.014
 che-0.014
 picked-0.014
Token to
Token ID284
Feature activation-1.31e-03

Pos logit contributions
chairs+0.012
 ambul+0.011
adders+0.011
 emergencies+0.011
 cords+0.010
Neg logit contributions
ory-0.009
ka-0.008
abby-0.008
hire-0.008
va-0.007
Token their
Token ID511
Feature activation+4.05e-02

Pos logit contributions
ory+0.002
 taste+0.001
 smile+0.001
 receive+0.001
 smell+0.001
Neg logit contributions
chairs-0.003
adders-0.002
 cords-0.002
 pads-0.002
oses-0.002
Token the
Token ID262
Feature activation-5.08e-04

Pos logit contributions
chairs+0.001
adders+0.001
 stations+0.001
 rails+0.001
 supervision+0.001
Neg logit contributions
 mineral-0.001
ory-0.001
 cuc-0.001
 crab-0.001
va-0.001
Token dad
Token ID9955
Feature activation-3.29e-02

Pos logit contributions
ory+0.000
 mineral+0.000
ka+0.000
hire+0.000
 cuc+0.000
Neg logit contributions
chairs-0.001
 themselves-0.001
adders-0.001
 ambul-0.001
 stations-0.001
Token carried
Token ID5281
Feature activation-1.81e-02

Pos logit contributions
ory+0.036
va+0.031
 cuc+0.031
abby+0.030
hire+0.030
Neg logit contributions
 themselves-0.065
chairs-0.065
adders-0.064
 stations-0.064
 emergencies-0.063
Token the
Token ID262
Feature activation+2.81e-04

Pos logit contributions
ka+0.024
ory+0.024
 mor+0.022
by+0.022
 che+0.021
Neg logit contributions
chairs-0.032
adders-0.030
 basement-0.030
 emergencies-0.029
 ambul-0.029
Token suit
Token ID6050
Feature activation-6.70e-03

Pos logit contributions
 themselves+0.001
adders+0.001
chairs+0.000
oses+0.000
 Ali+0.000
Neg logit contributions
ory-0.000
 mineral-0.000
ka-0.000
 jewel-0.000
 ruby-0.000
Tokencases
Token ID33964
Feature activation+1.54e-01

Pos logit contributions
ory+0.008
by+0.008
hire+0.008
abby+0.007
 smile+0.007
Neg logit contributions
 emergencies-0.013
 ambul-0.012
 stations-0.012
 cables-0.012
 earthquakes-0.012
Token again
Token ID757
Feature activation+6.59e-02

Pos logit contributions
 emergencies+0.266
chairs+0.261
 ambul+0.260
 stations+0.251
 Michael+0.243
Neg logit contributions
ory-0.211
abby-0.200
 smile-0.193
hire-0.185
 cuc-0.183
Token.
Token ID13
Feature activation+9.88e-03

Pos logit contributions
chairs+0.146
adders+0.139
 stations+0.136
 ambul+0.133
 cables+0.130
Neg logit contributions
ory-0.076
 che-0.066
by-0.061
 smile-0.059
abby-0.059
Token They
Token ID1119
Feature activation+1.67e-02

Pos logit contributions
 stations+0.017
 ambul+0.016
chairs+0.016
adders+0.016
 emergencies+0.015
Neg logit contributions
ory-0.014
va-0.013
 cuc-0.013
 pineapple-0.012
abby-0.012
Token had
Token ID550
Feature activation-3.32e-02

Pos logit contributions
chairs+0.030
 stations+0.029
 ambul+0.028
 emergencies+0.028
adders+0.028
Neg logit contributions
ory-0.022
hire-0.019
abby-0.019
 cuc-0.019
va-0.018
Token a
Token ID257
Feature activation-4.12e-03

Pos logit contributions
 che+0.047
ory+0.044
 ate+0.044
 cuc+0.044
 pineapple+0.042
Neg logit contributions
chairs-0.055
 911-0.050
adders-0.049
 basement-0.048
 cords-0.047
Token off
Token ID572
Feature activation+3.46e-02

Pos logit contributions
ory+0.014
ka+0.013
hire+0.012
 cuc+0.012
by+0.011
Neg logit contributions
chairs-0.021
 stations-0.020
 Michael-0.020
adders-0.020
 basement-0.020
Token and
Token ID290
Feature activation+1.95e-02

Pos logit contributions
chairs+0.076
adders+0.075
 supervision+0.073
 fault+0.073
 stations+0.071
Neg logit contributions
 smile-0.019
 mineral-0.014
 cre-0.012
 che-0.012
 taste-0.011
Token walked
Token ID6807
Feature activation+6.15e-03

Pos logit contributions
chairs+0.034
 fault+0.033
adders+0.033
oses+0.030
 stations+0.030
Neg logit contributions
 ate-0.021
 che-0.021
 taste-0.021
 swoop-0.020
 smell-0.020
Token around
Token ID1088
Feature activation+3.54e-02

Pos logit contributions
chairs+0.012
 emergencies+0.012
adders+0.011
 ambul+0.011
 supervision+0.011
Neg logit contributions
ory-0.007
ka-0.007
by-0.006
abby-0.006
hire-0.006
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
chairs+0.076
adders+0.068
cases+0.065
ion+0.060
ups+0.059
Neg logit contributions
 che-0.033
 smile-0.033
 cuc-0.030
 mineral-0.030
 smell-0.028
Token Other
Token ID3819
Feature activation+1.54e-01

Pos logit contributions
 stations+0.028
chairs+0.028
 supervision+0.027
 ambul+0.027
 emergencies+0.027
Neg logit contributions
ory-0.011
 cuc-0.011
va-0.010
 pineapple-0.010
aire-0.009
Token people
Token ID661
Feature activation+5.14e-02

Pos logit contributions
 Michael+0.227
 fault+0.226
 basement+0.223
 Ali+0.222
adders+0.220
Neg logit contributions
ory-0.214
ka-0.203
hire-0.175
va-0.172
 che-0.172
Token stayed
Token ID9658
Feature activation+2.03e-02

Pos logit contributions
chairs+0.104
 emergencies+0.101
 stations+0.100
 themselves+0.098
adders+0.098
Neg logit contributions
ory-0.049
 ate-0.043
va-0.043
hire-0.041
abby-0.041
Token on
Token ID319
Feature activation+4.70e-02

Pos logit contributions
chairs+0.037
adders+0.034
 stations+0.032
 Michael+0.032
 ambul+0.031
Neg logit contributions
ory-0.028
ka-0.024
hire-0.024
abby-0.024
 che-0.023
Token the
Token ID262
Feature activation+6.43e-03

Pos logit contributions
adders+0.097
chairs+0.096
ird+0.090
path+0.089
chair+0.082
Neg logit contributions
 che-0.037
 mineral-0.033
 mor-0.031
 caul-0.030
 l-0.030
Token train
Token ID4512
Feature activation+1.02e-01

Pos logit contributions
chairs+0.006
adders+0.006
 themselves+0.005
 Michael+0.005
 Ali+0.005
Neg logit contributions
ory-0.012
hire-0.012
ka-0.012
 cuc-0.012
lder-0.012
Token and
Token ID290
Feature activation+1.49e-02

Pos logit contributions
 emergencies+0.032
 ambul+0.032
 stations+0.032
chairs+0.031
 Rex+0.031
Neg logit contributions
 smile-0.022
hire-0.022
 mineral-0.021
ory-0.021
 cuc-0.021
Token all
Token ID477
Feature activation-1.64e-02

Pos logit contributions
adders+0.030
chairs+0.028
 basement+0.028
 911+0.027
 fault+0.026
Neg logit contributions
 cuc-0.015
 che-0.015
 mineral-0.015
 swoop-0.014
 pineapple-0.014
Token the
Token ID262
Feature activation+6.48e-03

Pos logit contributions
hire+0.019
ka+0.019
ory+0.019
va+0.019
 mineral+0.019
Neg logit contributions
adders-0.028
 emergencies-0.027
 themselves-0.026
 stations-0.026
 basement-0.025
Token big
Token ID1263
Feature activation-3.84e-03

Pos logit contributions
 themselves+0.010
adders+0.010
chairs+0.009
 Michael+0.009
 Rex+0.009
Neg logit contributions
ka-0.009
ory-0.009
 che-0.009
 cuc-0.009
hire-0.008
Token suit
Token ID6050
Feature activation-2.85e-03

Pos logit contributions
 cuc+0.005
ory+0.005
ka+0.005
 smile+0.005
 che+0.005
Neg logit contributions
adders-0.006
 themselves-0.006
chairs-0.006
 Rex-0.005
 Michael-0.005
Tokencases
Token ID33964
Feature activation+1.54e-01

Pos logit contributions
ory+0.003
hire+0.003
ka+0.003
 che+0.003
 ate+0.003
Neg logit contributions
 emergencies-0.005
 basement-0.005
 stations-0.005
 911-0.005
 ambul-0.005
Token.
Token ID13
Feature activation+7.43e-03

Pos logit contributions
chairs+0.251
 Ali+0.243
adders+0.243
 stations+0.240
 Rex+0.240
Neg logit contributions
ory-0.196
hire-0.190
 smile-0.190
 taste-0.188
 cuc-0.183
Token
Token ID220
Feature activation-3.70e-02

Pos logit contributions
 stations+0.016
 ambul+0.015
chairs+0.014
 rails+0.014
adders+0.014
Neg logit contributions
ory-0.009
va-0.008
 cuc-0.008
 mineral-0.007
abby-0.007
Token
Token ID198
Feature activation+1.85e-02

Pos logit contributions
abby+0.038
ory+0.037
va+0.032
emo+0.031
hire+0.030
Neg logit contributions
 stations-0.075
 ambul-0.073
chairs-0.069
 rails-0.068
 umb-0.067
Token
Token ID198
Feature activation+1.84e-02

Pos logit contributions
 stations+0.029
chairs+0.029
 ambul+0.028
adders+0.027
 rails+0.026
Neg logit contributions
ory-0.028
abby-0.028
va-0.027
 cuc-0.026
hire-0.025
TokenBut
Token ID1537
Feature activation+5.11e-02

Pos logit contributions
chairs+0.031
 stations+0.030
 ambul+0.030
adders+0.029
 themselves+0.029
Neg logit contributions
ory-0.025
abby-0.024
va-0.024
hire-0.023
 cuc-0.023
Token,
Token ID11
Feature activation+1.05e-02

Pos logit contributions
chairs+0.152
 stations+0.148
 ambul+0.143
adders+0.141
 emergencies+0.140
Neg logit contributions
ory-0.130
va-0.109
 cuc-0.107
by-0.103
 lobster-0.102
Token Tony
Token ID8832
Feature activation+1.47e-02

Pos logit contributions
chairs+0.016
 stations+0.015
 ambul+0.015
adders+0.015
 emergencies+0.014
Neg logit contributions
ory-0.019
 cuc-0.017
va-0.017
 lobster-0.017
 mineral-0.016
Token grabbed
Token ID13176
Feature activation-1.12e-02

Pos logit contributions
chairs+0.032
 emergencies+0.032
 stations+0.032
 themselves+0.031
adders+0.031
Neg logit contributions
ory-0.015
 cuc-0.012
va-0.012
 che-0.011
lder-0.011
Token the
Token ID262
Feature activation+7.17e-03

Pos logit contributions
 cuc+0.016
ory+0.016
 mineral+0.016
ka+0.016
 pineapple+0.015
Neg logit contributions
 stations-0.018
chairs-0.018
 ambul-0.017
adders-0.017
 emergencies-0.016
Token suit
Token ID6050
Feature activation-2.61e-03

Pos logit contributions
chairs+0.012
 themselves+0.012
adders+0.011
 stations+0.011
 Michael+0.011
Neg logit contributions
 mineral-0.010
ory-0.010
 cuc-0.009
 jewel-0.009
ka-0.009
Tokencases
Token ID33964
Feature activation+1.53e-01

Pos logit contributions
ory+0.003
hire+0.003
abby+0.003
 mineral+0.003
ouse+0.003
Neg logit contributions
 emergencies-0.005
 stations-0.005
 ambul-0.004
 basement-0.004
 earthquakes-0.004
Token and
Token ID290
Feature activation+1.87e-02

Pos logit contributions
chairs+0.261
 emergencies+0.260
adders+0.252
 ambul+0.251
 stations+0.251
Neg logit contributions
ory-0.205
abby-0.185
hire-0.184
 mineral-0.183
 cuc-0.183
Token carried
Token ID5281
Feature activation-1.36e-02

Pos logit contributions
 fault+0.040
chairs+0.038
adders+0.038
 stations+0.038
 emergencies+0.038
Neg logit contributions
 cuc-0.018
ory-0.017
 mineral-0.016
 che-0.016
hire-0.015
Token them
Token ID606
Feature activation+2.09e-02

Pos logit contributions
ory+0.020
ka+0.019
hire+0.019
 mineral+0.018
 cuc+0.018
Neg logit contributions
 ambul-0.022
chairs-0.022
 basement-0.021
 fault-0.021
 emergencies-0.020
Token out
Token ID503
Feature activation+2.80e-02

Pos logit contributions
chairs+0.038
 ambul+0.038
 emergencies+0.038
 stations+0.036
 basement+0.035
Neg logit contributions
ory-0.025
by-0.023
 mineral-0.022
ka-0.021
 cuc-0.019
Token of
Token ID286
Feature activation+5.45e-02

Pos logit contributions
chairs+0.052
 basement+0.051
 stations+0.049
 ambul+0.049
 emergencies+0.047
Neg logit contributions
ory-0.027
 cuc-0.027
ka-0.026
 smile-0.026
 mineral-0.025
Token their
Token ID511
Feature activation+3.03e-02

Pos logit contributions
 mineral+0.013
ory+0.013
 taste+0.013
 peanut+0.013
 che+0.013
Neg logit contributions
chairs-0.029
 outage-0.027
cases-0.026
 cinema-0.026
 stations-0.026
Token families
Token ID4172
Feature activation+3.09e-02

Pos logit contributions
chairs+0.055
 Michael+0.047
 Chloe+0.045
 stations+0.045
adders+0.044
Neg logit contributions
ory-0.042
 mineral-0.042
 cuc-0.039
 smile-0.037
 pineapple-0.036
Token.
Token ID13
Feature activation-5.88e-04

Pos logit contributions
chairs+0.071
adders+0.066
 Ali+0.064
 stations+0.064
 Michael+0.064
Neg logit contributions
ory-0.029
 smile-0.025
 mineral-0.023
 cuc-0.022
 trade-0.020
Token They
Token ID1119
Feature activation-1.96e-03

Pos logit contributions
ory+0.001
va+0.000
Thank+0.000
lder+0.000
aire+0.000
Neg logit contributions
chairs-0.001
 stations-0.001
adders-0.001
 beds-0.001
 ambul-0.001
Token call
Token ID869
Feature activation+4.30e-02

Pos logit contributions
ory+0.003
abby+0.003
va+0.003
hire+0.003
 cuc+0.003
Neg logit contributions
 stations-0.003
chairs-0.003
 themselves-0.003
adders-0.003
 Rex-0.003
Token 911
Token ID16679
Feature activation+1.53e-01

Pos logit contributions
chairs+0.078
path+0.063
 fault+0.062
ird+0.062
liv+0.062
Neg logit contributions
 smile-0.051
 taste-0.048
ory-0.047
 hug-0.046
 share-0.044
Token.
Token ID13
Feature activation-1.05e-03

Pos logit contributions
chairs+0.302
liv+0.291
 emergencies+0.277
 basement+0.274
adders+0.272
Neg logit contributions
ho-0.123
y-0.116
 ring-0.112
ory-0.108
 smile-0.106
Token They
Token ID1119
Feature activation-2.69e-03

Pos logit contributions
ory+0.001
Thank+0.001
va+0.001
 pineapple+0.001
�+0.000
Neg logit contributions
 stations-0.002
chairs-0.002
adders-0.002
 beds-0.002
 basement-0.002
Token hope
Token ID2911
Feature activation+2.25e-02

Pos logit contributions
ory+0.003
 cuc+0.003
hire+0.003
abby+0.003
 mineral+0.003
Neg logit contributions
chairs-0.005
 stations-0.005
 themselves-0.005
adders-0.005
 ambul-0.005
Token it
Token ID340
Feature activation+4.99e-03

Pos logit contributions
chairs+0.045
 cinema+0.040
hole+0.040
ays+0.039
 basement+0.039
Neg logit contributions
 taste-0.021
 smell-0.018
 take-0.017
 took-0.016
ory-0.016
Token is
Token ID318
Feature activation+3.53e-02

Pos logit contributions
chairs+0.010
 Michael+0.009
adders+0.009
 themselves+0.009
 emergencies+0.009
Neg logit contributions
 taste-0.004
ory-0.004
 che-0.004
hire-0.004
 lick-0.004
Token running
Token ID2491
Feature activation+4.45e-02

Pos logit contributions
 stations+0.097
chairs+0.095
 emergencies+0.095
 Rex+0.094
 themselves+0.094
Neg logit contributions
ory-0.051
 smile-0.046
 ate-0.044
va-0.044
abby-0.042
Token around
Token ID1088
Feature activation+3.29e-02

Pos logit contributions
 emergencies+0.089
 ambul+0.080
adders+0.079
chairs+0.078
 OK+0.077
Neg logit contributions
ka-0.045
ory-0.043
by-0.043
 smile-0.040
 swoop-0.035
Token with
Token ID351
Feature activation-1.30e-02

Pos logit contributions
chairs+0.063
adders+0.062
 fault+0.056
 Otherwise+0.054
cases+0.054
Neg logit contributions
 smile-0.036
 che-0.031
 smell-0.030
 taste-0.030
 ve-0.029
Token their
Token ID511
Feature activation+3.61e-02

Pos logit contributions
 smile+0.017
 taste+0.014
 smell+0.014
 juicy+0.013
 l+0.012
Neg logit contributions
chairs-0.024
adders-0.022
oses-0.022
 basement-0.021
 ambul-0.021
Token suit
Token ID6050
Feature activation-3.90e-03

Pos logit contributions
chairs+0.057
adders+0.054
 themselves+0.052
 Michael+0.052
 stations+0.051
Neg logit contributions
ory-0.050
 smile-0.049
by-0.049
ka-0.044
 taste-0.043
Tokencases
Token ID33964
Feature activation+1.51e-01

Pos logit contributions
 smile+0.004
ory+0.004
ouse+0.004
 complexion+0.004
 taste+0.004
Neg logit contributions
 emergencies-0.007
 stations-0.007
 basement-0.007
adders-0.007
 911-0.007
Token looked
Token ID3114
Feature activation-4.73e-02

Pos logit contributions
adders+0.277
chairs+0.274
 emergencies+0.273
 stations+0.268
 ambul+0.267
Neg logit contributions
 smile-0.194
ory-0.185
 taste-0.178
 mineral-0.163
 token-0.162
Token like
Token ID588
Feature activation+6.35e-03

Pos logit contributions
ory+0.064
ka+0.054
 smile+0.054
 cuc+0.054
 mineral+0.053
Neg logit contributions
chairs-0.087
adders-0.084
 Ali-0.081
 stations-0.081
 ambul-0.081
Token tiny
Token ID7009
Feature activation+6.49e-04

Pos logit contributions
chairs+0.011
adders+0.010
 911+0.009
oses+0.009
 basement+0.009
Neg logit contributions
 cuc-0.008
 jewel-0.008
 prince-0.007
 mineral-0.007
 ruby-0.007
Token ants
Token ID27842
Feature activation+4.62e-02

Pos logit contributions
adders+0.001
 themselves+0.001
chairs+0.001
 Ali+0.001
 basement+0.001
Neg logit contributions
ory-0.001
ka-0.001
 cuc-0.001
 mineral-0.001
hire-0.001
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
adders+0.082
 stations+0.080
 911+0.080
 basement+0.080
 emergencies+0.078
Neg logit contributions
ory-0.055
 mineral-0.055
 che-0.054
 cuc-0.052
 smile-0.051
Token tall
Token ID7331
Feature activation-6.08e-03

Pos logit contributions
 cuc+0.019
 jewel+0.017
 che+0.017
 ruby+0.016
 mineral+0.016
Neg logit contributions
chairs-0.030
 Michael-0.028
adders-0.028
 stations-0.027
oses-0.027
Token towers
Token ID18028
Feature activation+6.09e-02

Pos logit contributions
ory+0.011
 cuc+0.011
hire+0.011
ka+0.011
va+0.010
Neg logit contributions
chairs-0.007
 themselves-0.007
 stations-0.007
 Michael-0.007
adders-0.006
Token and
Token ID290
Feature activation+1.67e-02

Pos logit contributions
 emergencies+0.116
 911+0.114
 ambul+0.113
 Michael+0.112
 stations+0.111
Neg logit contributions
ory-0.067
hire-0.063
 cuc-0.061
ka-0.060
abby-0.059
Token winding
Token ID28967
Feature activation+6.19e-02

Pos logit contributions
chairs+0.021
 stations+0.020
adders+0.019
 themselves+0.019
 Michael+0.019
Neg logit contributions
ory-0.029
 cuc-0.029
hire-0.028
va-0.028
abby-0.027
Token stair
Token ID18761
Feature activation+7.53e-02

Pos logit contributions
chairs+0.066
 Michael+0.063
 ambul+0.063
adders+0.061
 emergencies+0.061
Neg logit contributions
ory-0.115
 cuc-0.115
ka-0.113
hire-0.113
 mineral-0.108
Tokencases
Token ID33964
Feature activation+1.51e-01

Pos logit contributions
 emergencies+0.125
 ambul+0.124
 Michael+0.123
 themselves+0.118
 911+0.117
Neg logit contributions
ka-0.094
ory-0.091
 cuc-0.090
 swoop-0.087
 che-0.084
Token but
Token ID475
Feature activation+4.78e-02

Pos logit contributions
chairs+0.229
 ambul+0.225
 Michael+0.220
 emergencies+0.218
 stations+0.215
Neg logit contributions
ory-0.216
 cuc-0.210
hire-0.210
 mineral-0.199
abby-0.197
Token most
Token ID749
Feature activation+7.31e-02

Pos logit contributions
 stations+0.080
adders+0.080
chairs+0.076
oses+0.075
 emergencies+0.073
Neg logit contributions
 jewel-0.054
 taste-0.053
 ruby-0.053
 mineral-0.051
 che-0.051
Token of
Token ID286
Feature activation+5.17e-02

Pos logit contributions
 themselves+0.135
 stations+0.126
 Michael+0.125
 911+0.122
 Rex+0.121
Neg logit contributions
 mineral-0.090
va-0.089
 cuc-0.089
ory-0.088
hire-0.085
Token all
Token ID477
Feature activation-1.89e-02

Pos logit contributions
chairs+0.076
 stations+0.069
 ambul+0.068
adders+0.067
 Michael+0.067
Neg logit contributions
 cuc-0.082
ory-0.080
 mineral-0.078
hire-0.078
va-0.077
Token she
Token ID673
Feature activation+2.80e-02

Pos logit contributions
 taste+0.022
hire+0.020
 cuc+0.020
 smell+0.019
 mineral+0.019
Neg logit contributions
 stations-0.031
 ambul-0.031
 Rex-0.031
adders-0.030
 themselves-0.030
Token big
Token ID1263
Feature activation-2.02e-02

Pos logit contributions
 cuc+0.025
 mineral+0.024
ory+0.024
 pineapple+0.022
va+0.022
Neg logit contributions
 themselves-0.040
adders-0.040
chairs-0.039
 stations-0.036
 ambul-0.035
Token belt
Token ID10999
Feature activation+1.96e-02

Pos logit contributions
ory+0.028
 cuc+0.027
 mineral+0.026
 pineapple+0.025
 ruby+0.024
Neg logit contributions
 themselves-0.035
adders-0.035
chairs-0.035
 stations-0.032
 ambul-0.031
Token.
Token ID13
Feature activation-5.47e-03

Pos logit contributions
 emergencies+0.036
 ambul+0.035
 themselves+0.034
 stations+0.034
 earthquakes+0.034
Neg logit contributions
hire-0.020
ory-0.019
abby-0.019
 token-0.018
ka-0.018
Token The
Token ID383
Feature activation+1.07e-02

Pos logit contributions
ory+0.005
va+0.005
 cuc+0.005
 pineapple+0.005
 mineral+0.005
Neg logit contributions
 stations-0.011
 ambul-0.011
 emergencies-0.010
chairs-0.010
adders-0.010
Token suit
Token ID6050
Feature activation-1.80e-02

Pos logit contributions
 themselves+0.018
adders+0.017
chairs+0.017
 Michael+0.016
 together+0.015
Neg logit contributions
ory-0.014
hire-0.014
 cuc-0.014
 mineral-0.014
 che-0.013
Tokencases
Token ID33964
Feature activation+1.51e-01

Pos logit contributions
ka+0.018
ory+0.017
hire+0.017
va+0.017
abby+0.016
Neg logit contributions
 emergencies-0.035
 stations-0.035
adders-0.035
 themselves-0.034
 911-0.034
Token went
Token ID1816
Feature activation-4.91e-03

Pos logit contributions
chairs+0.268
adders+0.263
 stations+0.262
 emergencies+0.262
 ambul+0.258
Neg logit contributions
ory-0.171
hire-0.168
 smile-0.164
abby-0.160
 taste-0.159
Token into
Token ID656
Feature activation-1.43e-03

Pos logit contributions
ory+0.007
ka+0.007
abby+0.007
hire+0.007
 cuc+0.007
Neg logit contributions
adders-0.007
chairs-0.007
 themselves-0.007
 stations-0.007
 ambul-0.007
Token the
Token ID262
Feature activation-4.17e-03

Pos logit contributions
 cuc+0.002
 mineral+0.002
 che+0.002
 pineapple+0.002
 ruby+0.002
Neg logit contributions
chairs-0.003
adders-0.002
 fault-0.002
cases-0.002
ained-0.002
Token plane
Token ID6614
Feature activation+7.93e-02

Pos logit contributions
ory+0.006
 mineral+0.006
 che+0.006
 cuc+0.006
ka+0.006
Neg logit contributions
adders-0.007
 themselves-0.006
chairs-0.006
oses-0.006
 ambul-0.005
Token.
Token ID13
Feature activation+4.62e-03

Pos logit contributions
 emergencies+0.166
adders+0.160
chairs+0.150
 ambul+0.150
 911+0.147
Neg logit contributions
 smile-0.085
 swoop-0.078
by-0.076
ory-0.076
app-0.072
Token rest
Token ID1334
Feature activation+4.31e-02

Pos logit contributions
chairs+0.038
adders+0.038
 stations+0.036
 emergencies+0.035
 ambul+0.035
Neg logit contributions
 cuc-0.015
 mineral-0.015
ory-0.014
 lick-0.014
 che-0.014
Token in
Token ID287
Feature activation-2.64e-02

Pos logit contributions
chairs+0.092
adders+0.078
 stations+0.077
 ambul+0.076
 rails+0.076
Neg logit contributions
ory-0.046
 cuc-0.046
 mineral-0.044
abby-0.041
va-0.040
Token the
Token ID262
Feature activation+4.91e-03

Pos logit contributions
 mineral+0.027
 ruby+0.025
 che+0.024
 smile+0.024
 cuc+0.023
Neg logit contributions
chairs-0.056
cases-0.055
adders-0.055
oses-0.050
 paramedics-0.049
Token spot
Token ID4136
Feature activation+1.01e-02

Pos logit contributions
chairs+0.008
adders+0.007
 ambul+0.007
 themselves+0.007
 stations+0.007
Neg logit contributions
ory-0.007
 mineral-0.007
 cuc-0.007
va-0.007
hire-0.007
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
chairs+0.020
 ambul+0.019
 emergencies+0.018
 Michael+0.018
 Kenny+0.018
Neg logit contributions
ory-0.010
 cuc-0.010
 smile-0.010
abby-0.010
va-0.009
Token Other
Token ID3819
Feature activation+1.50e-01

Pos logit contributions
chairs+0.022
 stations+0.022
 emergencies+0.021
adders+0.021
 ambul+0.021
Neg logit contributions
 cuc-0.013
 pineapple-0.012
ory-0.012
 mineral-0.011
 cabbage-0.011
Token times
Token ID1661
Feature activation+1.77e-02

Pos logit contributions
chairs+0.213
adders+0.203
 Michael+0.203
 stations+0.199
 basement+0.197
Neg logit contributions
ory-0.260
 cuc-0.229
va-0.224
ka-0.223
 mineral-0.223
Token it
Token ID340
Feature activation+1.11e-02

Pos logit contributions
chairs+0.030
 stations+0.029
 emergencies+0.028
 ambul+0.027
 rails+0.026
Neg logit contributions
ory-0.022
 cuc-0.022
 mineral-0.021
va-0.021
hire-0.020
Token was
Token ID373
Feature activation+4.13e-02

Pos logit contributions
chairs+0.022
 stations+0.022
 themselves+0.022
 emergencies+0.022
adders+0.021
Neg logit contributions
ory-0.012
 cuc-0.011
 che-0.011
va-0.010
hire-0.010
Token a
Token ID257
Feature activation-2.72e-03

Pos logit contributions
chairs+0.077
 stations+0.076
adders+0.072
 ambul+0.067
 basement+0.066
Neg logit contributions
 che-0.050
ory-0.049
 cuc-0.049
 sly-0.049
abby-0.048
Token squirrel
Token ID33039
Feature activation-3.49e-02

Pos logit contributions
ory+0.004
 cuc+0.003
va+0.003
hire+0.003
abby+0.003
Neg logit contributions
chairs-0.005
 stations-0.005
 themselves-0.005
adders-0.005
 ambul-0.004
Token hurt
Token ID5938
Feature activation+1.63e-02

Pos logit contributions
chairs+0.052
 stations+0.052
 ambul+0.047
 rails+0.046
 tracks+0.046
Neg logit contributions
ory-0.046
hire-0.046
abby-0.045
 cuc-0.044
 che-0.044
Token badly
Token ID11234
Feature activation-5.04e-03

Pos logit contributions
chairs+0.036
 stations+0.035
 rails+0.034
 cinema+0.034
adders+0.033
Neg logit contributions
ory-0.014
 che-0.012
hire-0.011
ka-0.011
abby-0.011
Token.
Token ID13
Feature activation+1.08e-03

Pos logit contributions
ory+0.007
hire+0.007
 che+0.007
 cuc+0.006
 mineral+0.006
Neg logit contributions
chairs-0.008
 stations-0.008
adders-0.008
 basement-0.007
 Michael-0.007
Token She
Token ID1375
Feature activation-3.22e-02

Pos logit contributions
 stations+0.002
chairs+0.002
 ambul+0.002
adders+0.002
 rails+0.002
Neg logit contributions
ory-0.001
va-0.001
 cuc-0.001
hire-0.001
 pineapple-0.001
Token called
Token ID1444
Feature activation+3.29e-02

Pos logit contributions
ory+0.048
 cuc+0.044
abby+0.044
hire+0.044
va+0.044
Neg logit contributions
chairs-0.051
 stations-0.049
 themselves-0.048
 ambul-0.048
adders-0.047
Token 911
Token ID16679
Feature activation+1.50e-01

Pos logit contributions
chairs+0.059
 stations+0.054
 ambul+0.051
adders+0.048
 rails+0.047
Neg logit contributions
ory-0.041
 mineral-0.040
 cuc-0.039
 taste-0.037
 lobster-0.037
Token and
Token ID290
Feature activation+9.25e-03

Pos logit contributions
chairs+0.277
adders+0.262
 emergencies+0.259
 stations+0.254
 basement+0.254
Neg logit contributions
ho-0.144
y-0.132
 smile-0.130
ory-0.128
ka-0.125
Token waited
Token ID13488
Feature activation+1.51e-02

Pos logit contributions
chairs+0.019
 stations+0.019
adders+0.018
 themselves+0.018
 rails+0.017
Neg logit contributions
 che-0.009
 cuc-0.009
 ate-0.008
ory-0.008
 pineapple-0.008
Token for
Token ID329
Feature activation+3.69e-02

Pos logit contributions
chairs+0.035
 stations+0.032
 ambul+0.030
 Michael+0.030
 basement+0.029
Neg logit contributions
ory-0.012
 smile-0.012
 cuc-0.012
 slur-0.010
 che-0.009
Token the
Token ID262
Feature activation-4.53e-04

Pos logit contributions
chairs+0.074
 stations+0.066
adders+0.065
 ambul+0.061
 basement+0.060
Neg logit contributions
ory-0.046
 cuc-0.040
 pineapple-0.040
 taste-0.039
 che-0.038
Token ambulance
Token ID22536
Feature activation+8.69e-02

Pos logit contributions
 taste+0.001
 cuc+0.001
 mineral+0.001
 lick+0.001
ory+0.001
Neg logit contributions
chairs-0.001
 themselves-0.001
 stations-0.001
 Michael-0.001
adders-0.001
Token He
Token ID679
Feature activation-2.39e-02

Pos logit contributions
adders+0.028
 stations+0.027
chairs+0.025
 themselves+0.024
 basement+0.024
Neg logit contributions
ory-0.021
 mineral-0.018
 cuc-0.017
va-0.017
 taste-0.016
Token put
Token ID1234
Feature activation-2.33e-02

Pos logit contributions
ory+0.025
va+0.024
abby+0.022
 cuc+0.022
 che+0.021
Neg logit contributions
chairs-0.048
 stations-0.048
 themselves-0.047
adders-0.047
 ambul-0.046
Token all
Token ID477
Feature activation-1.86e-02

Pos logit contributions
ka+0.035
 mineral+0.033
ory+0.033
 cuc+0.032
hire+0.031
Neg logit contributions
chairs-0.036
 stations-0.035
 ambul-0.035
adders-0.034
 emergencies-0.033
Token the
Token ID262
Feature activation+7.62e-03

Pos logit contributions
ka+0.025
 mineral+0.023
ory+0.023
abby+0.022
va+0.021
Neg logit contributions
 stations-0.030
adders-0.030
chairs-0.029
 ambul-0.029
 emergencies-0.029
Token suit
Token ID6050
Feature activation-3.89e-03

Pos logit contributions
chairs+0.011
 Rex+0.011
 themselves+0.011
 Ali+0.011
adders+0.011
Neg logit contributions
ory-0.012
 mineral-0.012
 cuc-0.011
ka-0.011
hire-0.011
Tokencases
Token ID33964
Feature activation+1.49e-01

Pos logit contributions
ka+0.005
hire+0.005
ory+0.005
ouse+0.005
abby+0.005
Neg logit contributions
 emergencies-0.006
 stations-0.006
 ambul-0.006
 Rex-0.006
 Ali-0.006
Token in
Token ID287
Feature activation-2.46e-02

Pos logit contributions
chairs+0.258
 Ali+0.256
 Rex+0.251
 ambul+0.251
 emergencies+0.250
Neg logit contributions
ory-0.181
 mineral-0.174
 smile-0.173
hire-0.167
 taste-0.164
Token the
Token ID262
Feature activation+8.60e-03

Pos logit contributions
 mineral+0.030
 che+0.026
 cuc+0.025
ory+0.025
 ruby+0.025
Neg logit contributions
chairs-0.048
adders-0.048
oses-0.040
chair-0.038
 hurricanes-0.038
Token car
Token ID1097
Feature activation+8.91e-02

Pos logit contributions
chairs+0.013
adders+0.012
 themselves+0.012
 Mark+0.012
 Ali+0.012
Neg logit contributions
ory-0.013
 cuc-0.013
 mineral-0.013
ka-0.012
hire-0.012
Token,
Token ID11
Feature activation+8.90e-03

Pos logit contributions
 emergencies+0.177
 hurricanes+0.166
 earthquakes+0.166
 ambul+0.163
adders+0.163
Neg logit contributions
ory-0.074
ouse-0.070
 smile-0.068
 mineral-0.068
 trade-0.066
Token and
Token ID290
Feature activation+1.83e-02

Pos logit contributions
chairs+0.016
adders+0.016
 fault+0.014
 themselves+0.014
 stations+0.014
Neg logit contributions
 cuc-0.011
 che-0.011
 mineral-0.010
 smile-0.010
 lad-0.010
Token down
Token ID866
Feature activation+1.10e-02

Pos logit contributions
ory+0.023
 mineral+0.018
ka+0.018
abby+0.017
by+0.016
Neg logit contributions
 stations-0.035
adders-0.035
chairs-0.035
 emergencies-0.035
 ambul-0.034
Token on
Token ID319
Feature activation+3.86e-02

Pos logit contributions
 stations+0.020
chairs+0.020
 Michael+0.019
 ambul+0.019
adders+0.019
Neg logit contributions
ory-0.013
 cuc-0.012
 mineral-0.012
ka-0.011
abby-0.011
Token the
Token ID262
Feature activation+2.40e-04

Pos logit contributions
adders+0.081
chairs+0.080
 stations+0.075
cases+0.074
oses+0.070
Neg logit contributions
 mineral-0.039
 cuc-0.039
 che-0.038
 caul-0.036
 ruby-0.036
Token path
Token ID3108
Feature activation+2.12e-02

Pos logit contributions
chairs+0.000
adders+0.000
 themselves+0.000
 ambul+0.000
 stations+0.000
Neg logit contributions
ory-0.000
 mineral-0.000
 cuc-0.000
hire-0.000
 ruby-0.000
Token.
Token ID13
Feature activation+2.34e-03

Pos logit contributions
 emergencies+0.044
 ambul+0.042
 stations+0.041
chairs+0.040
 Michael+0.039
Neg logit contributions
ory-0.019
abby-0.017
ho-0.017
 smile-0.017
 cuc-0.017
Token Other
Token ID3819
Feature activation+1.49e-01

Pos logit contributions
 stations+0.004
 ambul+0.004
adders+0.004
chairs+0.004
 emergencies+0.004
Neg logit contributions
ory-0.003
va-0.003
 cuc-0.003
hire-0.003
 lobster-0.003
Token animals
Token ID4695
Feature activation+5.19e-02

Pos logit contributions
chairs+0.211
adders+0.207
 themselves+0.205
 basement+0.205
 stations+0.204
Neg logit contributions
ory-0.252
 cuc-0.226
va-0.223
hire-0.220
ka-0.219
Token saw
Token ID2497
Feature activation+4.10e-02

Pos logit contributions
chairs+0.102
 stations+0.100
 emergencies+0.099
 ambul+0.095
 basement+0.094
Neg logit contributions
ory-0.062
 cuc-0.053
 swoop-0.051
 mineral-0.050
hire-0.050
Token the
Token ID262
Feature activation+1.96e-03

Pos logit contributions
chairs+0.072
 basement+0.068
adders+0.065
 stations+0.065
 emergencies+0.063
Neg logit contributions
 cuc-0.058
ory-0.053
 lobster-0.050
 pineapple-0.049
 beetle-0.048
Token fox
Token ID21831
Feature activation+7.63e-03

Pos logit contributions
chairs+0.004
adders+0.004
 themselves+0.004
 Michael+0.004
 stations+0.004
Neg logit contributions
ory-0.002
 cuc-0.002
hire-0.002
 mineral-0.002
va-0.002
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
 stations+0.014
adders+0.014
 basement+0.013
 emergencies+0.013
chairs+0.013
Neg logit contributions
ory-0.010
 che-0.010
 smile-0.009
 ate-0.009
 mineral-0.009
Token a
Token ID257
Feature activation-2.43e-03

Pos logit contributions
 cuc+0.006
ory+0.005
hire+0.005
 pineapple+0.005
 mineral+0.005
Neg logit contributions
chairs-0.007
 stations-0.007
 Michael-0.006
 ambul-0.006
adders-0.006
Token lot
Token ID1256
Feature activation-3.55e-03

Pos logit contributions
ory+0.004
abby+0.004
hire+0.004
ka+0.004
va+0.003
Neg logit contributions
 tracks-0.003
chairs-0.003
 shelter-0.003
 beds-0.003
 basement-0.003
Token of
Token ID286
Feature activation+5.18e-02

Pos logit contributions
ory+0.005
 cuc+0.005
abby+0.005
va+0.005
hire+0.005
Neg logit contributions
 Michael-0.006
 ambul-0.006
 stations-0.005
chairs-0.005
 emergencies-0.005
Token food
Token ID2057
Feature activation+1.62e-02

Pos logit contributions
chairs+0.073
 stations+0.070
 themselves+0.069
 ambul+0.068
adders+0.068
Neg logit contributions
ory-0.087
abby-0.081
hire-0.080
va-0.080
 cuc-0.079
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
chairs+0.031
 ambul+0.030
 stations+0.030
 Michael+0.030
 emergencies+0.028
Neg logit contributions
 cuc-0.021
ory-0.020
 mineral-0.018
 pineapple-0.018
 jewel-0.018
Token Other
Token ID3819
Feature activation+1.48e-01

Pos logit contributions
 stations+0.029
chairs+0.028
adders+0.028
 ambul+0.028
 emergencies+0.027
Neg logit contributions
 cuc-0.018
 pineapple-0.017
 mineral-0.017
ory-0.017
va-0.016
Token birds
Token ID10087
Feature activation+2.95e-02

Pos logit contributions
chairs+0.198
 Michael+0.187
 stations+0.186
 themselves+0.183
adders+0.180
Neg logit contributions
ory-0.257
 cuc-0.241
hire-0.236
va-0.234
ka-0.230
Token saw
Token ID2497
Feature activation+3.84e-02

Pos logit contributions
chairs+0.050
 stations+0.048
 themselves+0.046
adders+0.046
 Michael+0.046
Neg logit contributions
ory-0.041
 cuc-0.038
hire-0.038
abby-0.037
va-0.036
Token the
Token ID262
Feature activation+2.80e-03

Pos logit contributions
chairs+0.066
 stations+0.062
adders+0.061
 basement+0.060
 Michael+0.060
Neg logit contributions
 cuc-0.057
lder-0.050
ory-0.049
 pineapple-0.049
hire-0.048
Token fat
Token ID3735
Feature activation-6.16e-04

Pos logit contributions
chairs+0.005
 Michael+0.004
adders+0.004
 stations+0.004
 themselves+0.004
Neg logit contributions
ory-0.004
 cuc-0.004
hire-0.004
va-0.004
abby-0.004
Token bird
Token ID6512
Feature activation+9.62e-03

Pos logit contributions
ory+0.001
 cuc+0.001
hire+0.001
va+0.001
abby+0.001
Neg logit contributions
chairs-0.001
 stations-0.001
 themselves-0.001
adders-0.001
 Michael-0.001
Token making
Token ID1642
Feature activation+2.10e-02

Pos logit contributions
chairs+0.023
adders+0.022
 stations+0.021
 ambul+0.020
 basement+0.019
Neg logit contributions
 smile-0.019
ory-0.018
 che-0.018
 cuc-0.017
 mineral-0.017
Token a
Token ID257
Feature activation-7.23e-03

Pos logit contributions
chairs+0.038
adders+0.037
path+0.033
 ambul+0.033
chair+0.032
Neg logit contributions
 taste-0.026
by-0.023
 gu-0.023
 smell-0.023
 che-0.023
Token big
Token ID1263
Feature activation-7.48e-03

Pos logit contributions
ory+0.010
 cuc+0.009
va+0.009
hire+0.009
abby+0.009
Neg logit contributions
chairs-0.013
adders-0.012
 stations-0.012
 themselves-0.012
 ambul-0.012
Token fuss
Token ID34297
Feature activation+8.14e-03

Pos logit contributions
ory+0.010
 cuc+0.009
va+0.009
hire+0.009
abby+0.009
Neg logit contributions
chairs-0.013
adders-0.013
 stations-0.013
 themselves-0.013
 ambul-0.012
Token.
Token ID13
Feature activation+5.97e-03

Pos logit contributions
 emergencies+0.015
chairs+0.014
 basement+0.014
 ambul+0.014
 stations+0.014
Neg logit contributions
hire-0.008
ory-0.008
va-0.007
ka-0.007
 taste-0.007
Token Other
Token ID3819
Feature activation+1.47e-01

Pos logit contributions
 stations+0.012
chairs+0.011
 ambul+0.011
 emergencies+0.011
adders+0.011
Neg logit contributions
va-0.007
ory-0.007
 cuc-0.006
 pineapple-0.006
aire-0.006
Token customers
Token ID4297
Feature activation+2.49e-03

Pos logit contributions
 basement+0.245
adders+0.232
chairs+0.231
 Michael+0.226
 themselves+0.221
Neg logit contributions
ory-0.216
va-0.188
ka-0.184
 cuc-0.167
 pineapple-0.167
Token in
Token ID287
Feature activation-2.79e-02

Pos logit contributions
chairs+0.005
 stations+0.005
 tracks+0.005
 emergencies+0.005
 rails+0.005
Neg logit contributions
ory-0.003
va-0.003
abby-0.003
hire-0.002
 smile-0.002
Token the
Token ID262
Feature activation+1.01e-03

Pos logit contributions
ory+0.034
 cuc+0.033
 pineapple+0.033
 smile+0.032
 che+0.032
Neg logit contributions
chairs-0.048
adders-0.047
 stations-0.043
cases-0.041
 ambul-0.040
Token store
Token ID3650
Feature activation-1.01e-02

Pos logit contributions
adders+0.002
 themselves+0.002
chairs+0.002
 stations+0.001
 Michael+0.001
Neg logit contributions
ory-0.002
 cuc-0.001
va-0.001
hire-0.001
abby-0.001
Token started
Token ID2067
Feature activation+2.33e-02

Pos logit contributions
ory+0.013
va+0.011
 smile+0.010
abby+0.010
lder+0.010
Neg logit contributions
chairs-0.019
adders-0.019
 stations-0.018
 emergencies-0.018
 cords-0.017
Token and
Token ID290
Feature activation+1.38e-02

Pos logit contributions
ory+0.005
 mineral+0.005
 cuc+0.005
 che+0.005
hire+0.004
Neg logit contributions
chairs-0.005
adders-0.005
 stations-0.005
 ambul-0.005
 Michael-0.005
Token not
Token ID407
Feature activation-7.21e-04

Pos logit contributions
chairs+0.021
 themselves+0.021
 stations+0.020
 Rex+0.020
 tracks+0.020
Neg logit contributions
ory-0.021
abby-0.020
 cuc-0.020
hire-0.019
va-0.019
Token miserable
Token ID22444
Feature activation+4.02e-02

Pos logit contributions
ory+0.001
abby+0.001
hire+0.001
 cuc+0.001
va+0.001
Neg logit contributions
chairs-0.001
 stations-0.001
 ambul-0.001
 themselves-0.001
 Michael-0.001
Token anymore
Token ID7471
Feature activation+8.99e-02

Pos logit contributions
chairs+0.043
 stations+0.043
adders+0.043
 rails+0.042
 ambul+0.040
Neg logit contributions
ory-0.078
 mineral-0.072
 cuc-0.072
hire-0.072
 che-0.071
Token.
Token ID13
Feature activation+7.89e-03

Pos logit contributions
 stations+0.156
chairs+0.153
adders+0.151
 ambul+0.146
 basement+0.143
Neg logit contributions
ory-0.121
 mineral-0.114
 cuc-0.110
hire-0.110
 che-0.109
Token Other
Token ID3819
Feature activation+1.46e-01

Pos logit contributions
 stations+0.010
chairs+0.010
 ambul+0.010
 themselves+0.010
adders+0.009
Neg logit contributions
ory-0.014
va-0.013
abby-0.013
 cuc-0.013
hire-0.013
Token kids
Token ID3988
Feature activation+4.06e-02

Pos logit contributions
chairs+0.211
 Michael+0.204
 stations+0.199
 themselves+0.194
adders+0.192
Neg logit contributions
ory-0.236
ka-0.215
 cuc-0.215
hire-0.215
va-0.214
Token heard
Token ID2982
Feature activation+4.45e-02

Pos logit contributions
chairs+0.079
 stations+0.076
adders+0.074
 emergencies+0.074
 ambul+0.074
Neg logit contributions
ory-0.049
 cuc-0.041
va-0.041
hire-0.041
abby-0.040
Token Tom
Token ID4186
Feature activation+5.46e-03

Pos logit contributions
adders+0.088
 basement+0.087
 loft+0.082
chairs+0.081
 fault+0.080
Neg logit contributions
 pineapple-0.041
 taste-0.038
 smile-0.037
 che-0.037
ory-0.036
Token whistle
Token ID16121
Feature activation+3.29e-03

Pos logit contributions
chairs+0.008
adders+0.008
 stations+0.008
 basement+0.008
 ambul+0.007
Neg logit contributions
ory-0.009
hire-0.008
 cuc-0.008
 che-0.008
va-0.008
Token and
Token ID290
Feature activation+9.11e-03

Pos logit contributions
adders+0.006
chairs+0.006
 emergencies+0.006
 ambul+0.005
 stations+0.005
Neg logit contributions
 smile-0.004
ory-0.004
hire-0.004
 che-0.004
 taste-0.004
Token like
Token ID588
Feature activation-3.68e-02

Pos logit contributions
 mineral+0.032
ory+0.032
 taste+0.031
 cuc+0.031
 ruby+0.028
Neg logit contributions
 stations-0.040
chairs-0.040
 Michael-0.039
 ambul-0.039
 Rex-0.038
Token lettuce
Token ID39406
Feature activation-1.12e-01

Pos logit contributions
 cuc+0.055
 mineral+0.052
 pineapple+0.050
 ruby+0.048
ory+0.047
Neg logit contributions
chairs-0.059
adders-0.057
 Michael-0.053
 basement-0.053
 stations-0.052
Token,
Token ID11
Feature activation-3.29e-02

Pos logit contributions
ory+0.184
 cuc+0.183
 mineral+0.165
hire+0.162
 pineapple+0.161
Neg logit contributions
chairs-0.175
 stations-0.174
 Michael-0.173
 ambul-0.165
adders-0.161
Token tomatoes
Token ID23972
Feature activation-7.92e-02

Pos logit contributions
 cuc+0.040
ory+0.036
 pineapple+0.034
 mineral+0.034
hire+0.032
Neg logit contributions
chairs-0.068
 stations-0.066
 Michael-0.065
adders-0.065
 basement-0.064
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
 cuc+0.126
ory+0.124
 pineapple+0.110
 mineral+0.110
hire+0.110
Neg logit contributions
chairs-0.127
 stations-0.126
 Michael-0.124
 ambul-0.120
 emergencies-0.116
Token cuc
Token ID38421
Feature activation-2.13e-01

Pos logit contributions
ory+0.025
 cuc+0.024
hire+0.022
va+0.021
abby+0.021
Neg logit contributions
chairs-0.052
 stations-0.051
adders-0.050
 Michael-0.049
 ambul-0.049
Tokenumbers
Token ID17024
Feature activation-2.54e-02

Pos logit contributions
va+0.318
ory+0.311
 mineral+0.307
lder+0.300
abby+0.299
Neg logit contributions
adders-0.286
 themselves-0.285
chairs-0.278
 stations-0.267
 together-0.258
Token.
Token ID13
Feature activation-1.79e-02

Pos logit contributions
ory+0.041
 cuc+0.039
hire+0.037
abby+0.036
 pineapple+0.035
Neg logit contributions
chairs-0.042
 stations-0.040
 Michael-0.040
 ambul-0.039
 basement-0.039
Token He
Token ID679
Feature activation-5.54e-02

Pos logit contributions
 cuc+0.022
ory+0.022
va+0.020
 pineapple+0.020
 cabbage+0.019
Neg logit contributions
 stations-0.034
chairs-0.033
 ambul-0.032
 emergencies-0.031
 basement-0.031
Token smiled
Token ID13541
Feature activation-6.04e-02

Pos logit contributions
ory+0.073
 cuc+0.068
hire+0.066
abby+0.066
va+0.066
Neg logit contributions
chairs-0.097
 stations-0.094
 themselves-0.092
 ambul-0.091
adders-0.091
Token as
Token ID355
Feature activation-9.15e-02

Pos logit contributions
ory+0.115
 cuc+0.108
hire+0.107
abby+0.103
va+0.101
Neg logit contributions
chairs-0.081
 stations-0.076
adders-0.074
 ambul-0.073
 emergencies-0.071
Token John
Token ID1757
Feature activation+2.48e-02

Pos logit contributions
ory+0.006
 cuc+0.005
hire+0.005
va+0.005
abby+0.005
Neg logit contributions
chairs-0.008
 stations-0.008
 emergencies-0.008
adders-0.008
 ambul-0.008
Token chose
Token ID7690
Feature activation-4.42e-02

Pos logit contributions
adders+0.047
 stations+0.046
chairs+0.046
 emergencies+0.046
 basement+0.045
Neg logit contributions
ory-0.033
abby-0.028
ka-0.028
by-0.028
va-0.027
Token some
Token ID617
Feature activation+8.85e-03

Pos logit contributions
 mineral+0.060
 cuc+0.060
ory+0.059
 pineapple+0.058
 ruby+0.053
Neg logit contributions
adders-0.076
 stations-0.075
chairs-0.075
 basement-0.074
 emergencies-0.072
Token carrots
Token ID34397
Feature activation-4.16e-02

Pos logit contributions
 Michael+0.015
 stations+0.015
chairs+0.015
 basement+0.015
adders+0.015
Neg logit contributions
 cuc-0.012
 mineral-0.011
ory-0.011
 pineapple-0.010
 taste-0.010
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
ory+0.063
 cuc+0.061
 mineral+0.054
abby+0.052
 cabbage+0.051
Neg logit contributions
 stations-0.074
 emergencies-0.074
chairs-0.073
 ambul-0.071
 basement-0.070
Token cuc
Token ID38421
Feature activation-2.07e-01

Pos logit contributions
 cuc+0.025
ory+0.023
 pineapple+0.022
 che+0.021
 cabbage+0.021
Neg logit contributions
 stations-0.031
adders-0.031
chairs-0.030
 emergencies-0.030
 basement-0.030
Tokenumbers
Token ID17024
Feature activation-2.01e-02

Pos logit contributions
ory+0.304
va+0.299
hire+0.288
 mineral+0.287
abby+0.286
Neg logit contributions
chairs-0.292
 themselves-0.289
adders-0.287
 stations-0.275
 ambul-0.258
Token to
Token ID284
Feature activation-3.35e-02

Pos logit contributions
ory+0.033
 cuc+0.032
hire+0.030
abby+0.030
 mineral+0.028
Neg logit contributions
chairs-0.032
 basement-0.030
 stations-0.030
 emergencies-0.030
 ambul-0.030
Token take
Token ID1011
Feature activation-5.46e-02

Pos logit contributions
ory+0.036
 cuc+0.035
hire+0.033
abby+0.031
va+0.031
Neg logit contributions
chairs-0.068
 stations-0.066
adders-0.064
 ambul-0.064
 Michael-0.063
Token home
Token ID1363
Feature activation-2.41e-02

Pos logit contributions
ory+0.096
 cuc+0.095
ka+0.089
hire+0.088
 pineapple+0.088
Neg logit contributions
 stations-0.078
 basement-0.078
chairs-0.077
 ambul-0.074
adders-0.072
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
ory+0.039
 cuc+0.037
hire+0.034
abby+0.033
 pineapple+0.032
Neg logit contributions
 stations-0.040
chairs-0.040
 basement-0.040
 Michael-0.037
 emergencies-0.037
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
ory+0.045
 cuc+0.040
hire+0.038
va+0.037
abby+0.036
Neg logit contributions
chairs-0.065
 stations-0.061
 ambul-0.061
 emergencies-0.060
adders-0.060
Token You
Token ID921
Feature activation-2.37e-02

Pos logit contributions
ory+0.019
va+0.018
 cuc+0.018
 pineapple+0.018
 mineral+0.017
Neg logit contributions
chairs-0.020
 stations-0.020
adders-0.019
 ambul-0.018
 rails-0.018
Token found
Token ID1043
Feature activation-1.99e-02

Pos logit contributions
ory+0.041
abby+0.038
 cuc+0.038
va+0.038
hire+0.037
Neg logit contributions
chairs-0.032
 stations-0.031
 themselves-0.031
adders-0.030
 ambul-0.030
Token a
Token ID257
Feature activation-2.72e-02

Pos logit contributions
 cuc+0.027
 pineapple+0.027
ory+0.026
hire+0.026
 peanut+0.026
Neg logit contributions
chairs-0.035
 stations-0.032
 ambul-0.031
adders-0.030
path-0.030
Token nice
Token ID3621
Feature activation-3.22e-02

Pos logit contributions
ory+0.037
 cuc+0.035
va+0.034
 mineral+0.034
abby+0.033
Neg logit contributions
chairs-0.048
 themselves-0.046
 stations-0.045
adders-0.045
 ambul-0.044
Token cuc
Token ID38421
Feature activation-1.95e-01

Pos logit contributions
ory+0.046
hire+0.043
abby+0.043
 cuc+0.042
va+0.042
Neg logit contributions
chairs-0.051
 stations-0.050
 themselves-0.049
 ambul-0.048
adders-0.048
Tokenumber
Token ID4494
Feature activation-3.70e-02

Pos logit contributions
va+0.305
 mineral+0.294
lder+0.293
ory+0.291
abby+0.289
Neg logit contributions
adders-0.246
 together-0.218
chairs-0.216
 themselves-0.211
 until-0.198
Token.
Token ID13
Feature activation-7.82e-03

Pos logit contributions
ory+0.054
 cuc+0.051
hire+0.049
abby+0.048
 pineapple+0.046
Neg logit contributions
chairs-0.060
 stations-0.059
 Michael-0.057
 ambul-0.057
 emergencies-0.057
Token It
Token ID632
Feature activation-2.79e-02

Pos logit contributions
ory+0.013
 cuc+0.012
hire+0.012
va+0.012
abby+0.012
Neg logit contributions
chairs-0.011
 stations-0.011
 ambul-0.011
 themselves-0.011
adders-0.011
Token will
Token ID481
Feature activation+2.25e-02

Pos logit contributions
ory+0.043
 cuc+0.041
hire+0.040
abby+0.040
va+0.040
Neg logit contributions
chairs-0.042
 stations-0.041
 themselves-0.040
adders-0.040
 ambul-0.039
Token be
Token ID307
Feature activation-6.15e-02

Pos logit contributions
chairs+0.036
 themselves+0.036
 stations+0.036
 ambul+0.035
adders+0.034
Neg logit contributions
ory-0.032
abby-0.029
va-0.029
hire-0.029
 cuc-0.029
Token has
Token ID468
Feature activation-3.91e-02

Pos logit contributions
ory+0.063
abby+0.060
hire+0.059
va+0.059
ka+0.057
Neg logit contributions
chairs-0.029
 themselves-0.029
 tracks-0.028
 Rex-0.027
 stations-0.027
Token lettuce
Token ID39406
Feature activation-9.20e-02

Pos logit contributions
ory+0.078
abby+0.073
hire+0.072
va+0.072
 cuc+0.072
Neg logit contributions
chairs-0.041
 stations-0.040
 themselves-0.039
adders-0.038
 ambul-0.038
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
ory+0.185
 cuc+0.174
hire+0.174
abby+0.173
va+0.172
Neg logit contributions
chairs-0.097
 stations-0.092
 themselves-0.089
 ambul-0.088
adders-0.087
Token tomatoes
Token ID23972
Feature activation-5.53e-02

Pos logit contributions
ory+0.018
abby+0.018
va+0.017
hire+0.016
ka+0.016
Neg logit contributions
 themselves-0.021
 stations-0.019
 tracks-0.019
 Rex-0.019
chairs-0.019
Token and
Token ID290
Feature activation-1.08e-02

Pos logit contributions
ory+0.096
 cuc+0.094
hire+0.090
abby+0.087
 pineapple+0.086
Neg logit contributions
chairs-0.078
 Michael-0.074
 ambul-0.074
 stations-0.073
 emergencies-0.069
Token cuc
Token ID38421
Feature activation-1.90e-01

Pos logit contributions
ory+0.014
abby+0.013
va+0.012
hire+0.012
ka+0.012
Neg logit contributions
 themselves-0.019
 stations-0.019
chairs-0.019
 ambul-0.018
 tracks-0.018
Tokenumbers
Token ID17024
Feature activation-5.26e-03

Pos logit contributions
va+0.365
 mineral+0.344
lder+0.339
mber+0.328
 complexion+0.325
Neg logit contributions
adders-0.210
 themselves-0.206
 together-0.186
 until-0.162
 stations-0.160
Token.
Token ID13
Feature activation-1.85e-02

Pos logit contributions
ory+0.009
 cuc+0.009
hire+0.009
abby+0.008
 pineapple+0.008
Neg logit contributions
chairs-0.008
 ambul-0.007
 stations-0.007
 Michael-0.007
 basement-0.007
Token And
Token ID843
Feature activation+5.12e-02

Pos logit contributions
 pineapple+0.021
 cuc+0.021
ory+0.020
va+0.017
 mineral+0.017
Neg logit contributions
chairs-0.037
 ambul-0.036
 stations-0.035
adders-0.032
 emergencies-0.032
Token it
Token ID340
Feature activation-9.86e-03

Pos logit contributions
chairs+0.086
 ambul+0.073
 stations+0.069
 basement+0.066
 emergencies+0.064
Neg logit contributions
 cuc-0.092
ory-0.089
 pineapple-0.088
 ruby-0.084
hire-0.083
Token has
Token ID468
Feature activation-3.28e-02

Pos logit contributions
ory+0.021
abby+0.021
va+0.020
 complexion+0.020
hire+0.019
Neg logit contributions
 themselves-0.008
chairs-0.008
 stations-0.008
 tracks-0.008
 together-0.008
Token garden
Token ID11376
Feature activation-4.74e-02

Pos logit contributions
ory+0.035
abby+0.033
hire+0.033
va+0.033
ka+0.033
Neg logit contributions
chairs-0.016
 stations-0.016
 themselves-0.015
 ambul-0.015
adders-0.015
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
 cuc+0.068
ory+0.063
 pineapple+0.057
 che+0.056
hire+0.055
Neg logit contributions
chairs-0.084
 Michael-0.079
 ambul-0.078
 emergencies-0.077
 Kenny-0.075
Token They
Token ID1119
Feature activation-2.10e-02

Pos logit contributions
ory+0.025
 cuc+0.023
va+0.023
 pineapple+0.021
 taste+0.021
Neg logit contributions
 stations-0.038
chairs-0.038
 ambul-0.036
adders-0.035
 cords-0.032
Token cut
Token ID2005
Feature activation-2.25e-02

Pos logit contributions
va+0.032
ory+0.031
abby+0.031
ka+0.030
hire+0.030
Neg logit contributions
 themselves-0.031
adders-0.031
 stations-0.031
chairs-0.030
 Rex-0.030
Token the
Token ID262
Feature activation-2.00e-02

Pos logit contributions
ory+0.037
hire+0.036
 cuc+0.035
ka+0.035
 pineapple+0.033
Neg logit contributions
chairs-0.034
 ambul-0.031
 stations-0.031
 emergencies-0.031
 basement-0.030
Token cuc
Token ID38421
Feature activation-1.71e-01

Pos logit contributions
ka+0.030
va+0.029
ory+0.029
abby+0.029
ited+0.027
Neg logit contributions
 until-0.029
 themselves-0.028
 cables-0.027
 basement-0.026
 tracks-0.026
Tokenumber
Token ID4494
Feature activation-3.26e-02

Pos logit contributions
va+0.339
 complexion+0.320
lder+0.319
ka+0.318
 smile+0.312
Neg logit contributions
adders-0.177
 themselves-0.156
 together-0.138
oses-0.131
 until-0.126
Token and
Token ID290
Feature activation-7.78e-03

Pos logit contributions
ory+0.053
hire+0.052
 cuc+0.052
abby+0.049
 pineapple+0.048
Neg logit contributions
chairs-0.047
 stations-0.044
 ambul-0.044
 emergencies-0.043
 Michael-0.043
Token put
Token ID1234
Feature activation-4.38e-02

Pos logit contributions
ory+0.010
 cuc+0.009
hire+0.009
abby+0.008
va+0.008
Neg logit contributions
chairs-0.014
 stations-0.014
adders-0.013
 ambul-0.013
 themselves-0.013
Token it
Token ID340
Feature activation-9.46e-03

Pos logit contributions
ory+0.083
 cuc+0.081
hire+0.079
abby+0.076
 pineapple+0.076
Neg logit contributions
chairs-0.054
 stations-0.050
 ambul-0.049
adders-0.047
 Michael-0.046
Token in
Token ID287
Feature activation-4.31e-02

Pos logit contributions
ory+0.016
 cuc+0.015
hire+0.015
abby+0.015
va+0.014
Neg logit contributions
chairs-0.013
 stations-0.013
 ambul-0.012
 themselves-0.012
 Michael-0.012
Token sip
Token ID31145
Feature activation-4.90e-02

Pos logit contributions
ory+0.098
va+0.091
abby+0.087
hire+0.087
ka+0.086
Neg logit contributions
chairs-0.140
 stations-0.139
 themselves-0.135
adders-0.134
 ambul-0.133
Token and
Token ID290
Feature activation-4.04e-02

Pos logit contributions
ory+0.081
 cuc+0.077
va+0.074
 taste+0.073
 pineapple+0.068
Neg logit contributions
 stations-0.072
adders-0.072
chairs-0.071
 ambul-0.070
 emergencies-0.070
Token nib
Token ID33272
Feature activation-1.21e-01

Pos logit contributions
 cuc+0.047
 che+0.045
ory+0.044
 taste+0.043
 ate+0.041
Neg logit contributions
chairs-0.077
 stations-0.077
 ambul-0.076
adders-0.075
 emergencies-0.073
Tokenble
Token ID903
Feature activation-6.44e-02

Pos logit contributions
ory+0.169
abby+0.154
hire+0.153
 cuc+0.153
va+0.150
Neg logit contributions
chairs-0.211
 stations-0.206
 ambul-0.203
 tracks-0.195
 Michael-0.194
Token and
Token ID290
Feature activation-2.66e-02

Pos logit contributions
 cuc+0.094
ory+0.092
hire+0.085
 che+0.083
 pineapple+0.081
Neg logit contributions
 stations-0.110
 ambul-0.108
 basement-0.107
 emergencies-0.106
chairs-0.105
Token smile
Token ID8212
Feature activation-1.69e-01

Pos logit contributions
ory+0.029
hire+0.026
abby+0.026
 cuc+0.026
va+0.025
Neg logit contributions
chairs-0.052
 stations-0.051
 themselves-0.050
 ambul-0.050
adders-0.050
Token.
Token ID13
Feature activation-2.59e-02

Pos logit contributions
ory+0.310
hire+0.292
 cuc+0.290
abby+0.284
va+0.283
Neg logit contributions
chairs-0.217
 stations-0.211
adders-0.206
 ambul-0.204
 emergencies-0.194
Token They
Token ID1119
Feature activation-2.94e-02

Pos logit contributions
ory+0.031
 cuc+0.031
va+0.031
 pineapple+0.029
 cabbage+0.026
Neg logit contributions
 stations-0.054
 ambul-0.049
chairs-0.048
 emergencies-0.047
adders-0.047
Token feel
Token ID1254
Feature activation-7.94e-02

Pos logit contributions
ory+0.037
abby+0.034
hire+0.034
 cuc+0.034
va+0.033
Neg logit contributions
chairs-0.052
 stations-0.051
 themselves-0.050
adders-0.050
 ambul-0.050
Token happy
Token ID3772
Feature activation-3.12e-02

Pos logit contributions
ory+0.120
 cuc+0.116
hire+0.112
va+0.109
abby+0.106
Neg logit contributions
chairs-0.129
 stations-0.123
 ambul-0.119
adders-0.118
 Michael-0.116
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
ory+0.050
 cuc+0.048
va+0.045
hire+0.044
 mineral+0.043
Neg logit contributions
chairs-0.051
adders-0.048
 stations-0.047
 ambul-0.047
 Michael-0.046
Token proud
Token ID6613
Feature activation-8.18e-02

Pos logit contributions
adders+0.021
 ambul+0.021
chairs+0.021
 fault+0.021
 emergencies+0.021
Neg logit contributions
 smile-0.008
 taste-0.007
 rip-0.007
 ring-0.007
 smell-0.007
Token of
Token ID286
Feature activation-1.60e-02

Pos logit contributions
ory+0.137
va+0.130
 cuc+0.123
hire+0.117
abby+0.116
Neg logit contributions
chairs-0.128
adders-0.125
 stations-0.122
 ambul-0.119
 basement-0.117
Token them
Token ID606
Feature activation-3.16e-02

Pos logit contributions
va+0.019
ory+0.018
 cuc+0.018
 pineapple+0.017
 ruby+0.017
Neg logit contributions
chairs-0.029
 stations-0.028
cases-0.027
adders-0.027
 basement-0.026
Token.
Token ID13
Feature activation-3.93e-02

Pos logit contributions
ory+0.038
 smile+0.030
 token+0.030
hire+0.029
va+0.029
Neg logit contributions
chairs-0.060
adders-0.059
 stations-0.058
 basement-0.058
 emergencies-0.058
Token They
Token ID1119
Feature activation-4.71e-02

Pos logit contributions
ory+0.050
va+0.048
 cuc+0.044
abby+0.043
 pineapple+0.042
Neg logit contributions
 stations-0.074
chairs-0.070
 ambul-0.069
adders-0.068
 basement-0.067
Token smile
Token ID8212
Feature activation-1.69e-01

Pos logit contributions
ory+0.059
 cuc+0.053
hire+0.053
abby+0.051
va+0.050
Neg logit contributions
chairs-0.088
 stations-0.084
 ambul-0.082
adders-0.081
 themselves-0.081
Token and
Token ID290
Feature activation-3.04e-02

Pos logit contributions
ory+0.271
hire+0.244
ka+0.233
lder+0.229
 smile+0.229
Neg logit contributions
chairs-0.279
adders-0.274
 stations-0.256
 emergencies-0.251
 ambul-0.247
Token say
Token ID910
Feature activation-3.35e-02

Pos logit contributions
ory+0.025
 cuc+0.023
 smile+0.023
 che+0.022
 lick+0.021
Neg logit contributions
chairs-0.071
adders-0.068
 stations-0.067
 ambul-0.066
 emergencies-0.065
Token thank
Token ID5875
Feature activation-8.33e-02

Pos logit contributions
va+0.047
ory+0.046
 cuc+0.043
lder+0.042
 smile+0.041
Neg logit contributions
chairs-0.061
 stations-0.058
 basement-0.055
 cords-0.053
 tracks-0.052
Token you
Token ID345
Feature activation-3.30e-02

Pos logit contributions
ory+0.127
va+0.112
abby+0.104
 cuc+0.104
ka+0.103
Neg logit contributions
chairs-0.148
 stations-0.138
adders-0.134
 basement-0.134
 tracks-0.129
Token.
Token ID13
Feature activation-3.22e-02

Pos logit contributions
ory+0.048
va+0.039
by+0.038
abby+0.036
hire+0.036
Neg logit contributions
chairs-0.067
 basement-0.060
 emergencies-0.060
 stations-0.059
 cords-0.059
Token girl
Token ID2576
Feature activation-4.46e-02

Pos logit contributions
ory+0.056
 cuc+0.051
va+0.047
ka+0.045
hire+0.045
Neg logit contributions
chairs-0.114
adders-0.111
 stations-0.111
 themselves-0.106
 ambul-0.105
Token."
Token ID526
Feature activation-1.65e-02

Pos logit contributions
ory+0.055
 cuc+0.051
va+0.050
 che+0.048
ka+0.047
Neg logit contributions
 stations-0.085
chairs-0.083
adders-0.081
 ambul-0.079
 themselves-0.078
Token With
Token ID2080
Feature activation-4.78e-02

Pos logit contributions
ory+0.022
va+0.020
 cuc+0.018
aire+0.017
abby+0.017
Neg logit contributions
 stations-0.033
adders-0.032
chairs-0.030
 ambul-0.030
 emergencies-0.029
Token a
Token ID257
Feature activation-4.05e-02

Pos logit contributions
 cuc+0.064
ory+0.060
 che+0.060
 pineapple+0.060
 ruby+0.057
Neg logit contributions
 stations-0.082
adders-0.081
chairs-0.080
oses-0.075
 basement-0.074
Token curious
Token ID11040
Feature activation-5.37e-02

Pos logit contributions
ory+0.042
 cuc+0.039
va+0.039
hire+0.037
abby+0.037
Neg logit contributions
chairs-0.082
 stations-0.081
 themselves-0.080
adders-0.079
 ambul-0.078
Token smile
Token ID8212
Feature activation-1.67e-01

Pos logit contributions
ory+0.048
 cuc+0.042
hire+0.041
va+0.041
abby+0.040
Neg logit contributions
chairs-0.118
 stations-0.115
 themselves-0.114
adders-0.112
 ambul-0.112
Token,
Token ID11
Feature activation-3.53e-02

Pos logit contributions
ory+0.236
hire+0.230
 cuc+0.229
va+0.218
ka+0.216
Neg logit contributions
chairs-0.284
 stations-0.278
adders-0.269
 emergencies-0.262
 basement-0.260
Token the
Token ID262
Feature activation-3.24e-02

Pos logit contributions
va+0.049
 cuc+0.049
ory+0.048
 che+0.046
 smile+0.044
Neg logit contributions
 stations-0.066
adders-0.062
chairs-0.059
 basement-0.058
 emergencies-0.054
Token girl
Token ID2576
Feature activation-4.29e-02

Pos logit contributions
ory+0.036
 cuc+0.034
hire+0.033
va+0.033
 mineral+0.032
Neg logit contributions
chairs-0.065
 stations-0.063
 themselves-0.062
adders-0.062
 ambul-0.061
Token opened
Token ID4721
Feature activation-5.91e-02

Pos logit contributions
ory+0.043
 cuc+0.042
 swoop+0.037
 che+0.037
va+0.036
Neg logit contributions
 emergencies-0.090
 stations-0.090
chairs-0.089
 basement-0.088
 ambul-0.087
Token the
Token ID262
Feature activation-2.73e-02

Pos logit contributions
 cuc+0.078
ory+0.075
ka+0.074
hire+0.072
 ruby+0.069
Neg logit contributions
chairs-0.109
adders-0.105
 stations-0.104
 emergencies-0.103
 ambul-0.103
Token They
Token ID1119
Feature activation-5.15e-04

Pos logit contributions
ory+0.005
va+0.005
 cuc+0.005
 pineapple+0.005
abby+0.005
Neg logit contributions
 stations-0.005
chairs-0.005
 ambul-0.005
 emergencies-0.005
adders-0.005
Token were
Token ID547
Feature activation+2.65e-02

Pos logit contributions
ory+0.001
va+0.001
abby+0.001
ka+0.001
hire+0.001
Neg logit contributions
 themselves-0.001
 stations-0.001
chairs-0.001
 ambul-0.001
adders-0.001
Token filled
Token ID5901
Feature activation-3.23e-02

Pos logit contributions
chairs+0.033
 stations+0.032
adders+0.030
 ambul+0.030
 themselves+0.029
Neg logit contributions
ory-0.049
 cuc-0.047
hire-0.046
abby-0.046
va-0.045
Token with
Token ID351
Feature activation-3.16e-02

Pos logit contributions
ory+0.061
hire+0.058
 cuc+0.057
abby+0.057
va+0.057
Neg logit contributions
 themselves-0.037
chairs-0.036
 stations-0.035
adders-0.033
 ambul-0.033
Token fresh
Token ID4713
Feature activation-1.97e-02

Pos logit contributions
ory+0.041
 cuc+0.039
hire+0.037
abby+0.036
va+0.035
Neg logit contributions
chairs-0.058
 stations-0.055
adders-0.054
 ambul-0.054
 Michael-0.053
Token cuc
Token ID38421
Feature activation-1.67e-01

Pos logit contributions
abby+0.038
va+0.037
hire+0.036
ory+0.036
lder+0.034
Neg logit contributions
 themselves-0.021
 until-0.020
 beds-0.020
chairs-0.020
 stations-0.019
Tokenumber
Token ID4494
Feature activation-2.95e-02

Pos logit contributions
 mineral+0.294
va+0.286
 ruby+0.278
ory+0.277
ka+0.272
Neg logit contributions
adders-0.200
 themselves-0.181
 together-0.166
chairs-0.163
oses-0.154
Token and
Token ID290
Feature activation-2.37e-03

Pos logit contributions
ory+0.048
hire+0.043
 cuc+0.042
abby+0.042
by+0.041
Neg logit contributions
chairs-0.046
 stations-0.045
 basement-0.045
 emergencies-0.044
adders-0.044
Token cheese
Token ID9891
Feature activation-4.00e-02

Pos logit contributions
ory+0.003
 cuc+0.003
hire+0.003
abby+0.003
va+0.003
Neg logit contributions
chairs-0.004
 stations-0.004
 themselves-0.004
 ambul-0.004
adders-0.004
Token.
Token ID13
Feature activation-7.92e-03

Pos logit contributions
 cuc+0.065
ory+0.065
hire+0.057
 pineapple+0.057
 mineral+0.056
Neg logit contributions
chairs-0.066
 stations-0.063
 ambul-0.063
 Michael-0.060
 emergencies-0.060
Token It
Token ID632
Feature activation-2.47e-02

Pos logit contributions
ory+0.012
va+0.011
 cuc+0.011
 pineapple+0.011
lder+0.011
Neg logit contributions
 stations-0.013
chairs-0.012
 emergencies-0.012
adders-0.012
 ambul-0.012
Token saw
Token ID2497
Feature activation+2.22e-02

Pos logit contributions
ory+0.051
hire+0.048
abby+0.047
va+0.045
 cuc+0.045
Neg logit contributions
chairs-0.075
 themselves-0.073
 stations-0.073
 ambul-0.072
 Michael-0.072
Token carrots
Token ID34397
Feature activation-2.58e-02

Pos logit contributions
chairs+0.036
 stations+0.034
adders+0.034
 emergencies+0.033
 Michael+0.032
Neg logit contributions
 cuc-0.036
 pineapple-0.032
 mineral-0.032
 peanut-0.031
 lobster-0.030
Token,
Token ID11
Feature activation-8.13e-03

Pos logit contributions
ory+0.042
 cuc+0.041
 mineral+0.039
hire+0.038
abby+0.037
Neg logit contributions
chairs-0.040
 stations-0.039
 ambul-0.037
 Michael-0.036
adders-0.036
Token potatoes
Token ID18821
Feature activation-1.60e-02

Pos logit contributions
ory+0.011
 cuc+0.010
hire+0.010
abby+0.010
va+0.010
Neg logit contributions
chairs-0.014
 stations-0.014
 themselves-0.013
adders-0.013
 ambul-0.013
Token,
Token ID11
Feature activation-9.05e-03

Pos logit contributions
ory+0.025
 cuc+0.025
 mineral+0.022
 pineapple+0.022
hire+0.022
Neg logit contributions
chairs-0.026
 stations-0.025
 Michael-0.024
 ambul-0.024
adders-0.023
Token cuc
Token ID38421
Feature activation-1.66e-01

Pos logit contributions
ory+0.014
 cuc+0.013
hire+0.013
abby+0.012
va+0.012
Neg logit contributions
chairs-0.014
 stations-0.014
adders-0.013
 themselves-0.013
 ambul-0.013
Tokenumbers
Token ID17024
Feature activation-3.13e-03

Pos logit contributions
va+0.318
 mineral+0.313
 taste+0.305
 ruby+0.302
 smile+0.301
Neg logit contributions
adders-0.163
 themselves-0.157
 together-0.139
 stations-0.134
chairs-0.126
Token and
Token ID290
Feature activation-3.17e-03

Pos logit contributions
ory+0.005
 cuc+0.004
hire+0.004
 mineral+0.004
abby+0.004
Neg logit contributions
chairs-0.005
 stations-0.005
 basement-0.005
 ambul-0.005
 emergencies-0.005
Token tomatoes
Token ID23972
Feature activation-4.46e-02

Pos logit contributions
 cuc+0.004
ory+0.003
 pineapple+0.003
 mineral+0.003
hire+0.003
Neg logit contributions
chairs-0.006
adders-0.006
 Michael-0.006
 stations-0.006
 basement-0.006
Token.
Token ID13
Feature activation-7.62e-03

Pos logit contributions
 cuc+0.070
ory+0.069
hire+0.062
 mineral+0.061
 pineapple+0.061
Neg logit contributions
chairs-0.072
 stations-0.070
 ambul-0.070
 Michael-0.069
 emergencies-0.067
Token He
Token ID679
Feature activation-4.87e-02

Pos logit contributions
ory+0.013
 cuc+0.012
va+0.012
abby+0.012
hire+0.012
Neg logit contributions
chairs-0.011
 stations-0.010
 ambul-0.010
adders-0.010
 themselves-0.010
Token girl
Token ID2576
Feature activation-4.81e-02

Pos logit contributions
 stations+0.028
chairs+0.028
 themselves+0.026
 ambul+0.025
adders+0.025
Neg logit contributions
ory-0.028
 cuc-0.027
abby-0.026
ka-0.026
va-0.026
Token and
Token ID290
Feature activation-3.14e-02

Pos logit contributions
ory+0.059
va+0.057
 cuc+0.052
 smile+0.052
lder+0.051
Neg logit contributions
 stations-0.093
chairs-0.092
 emergencies-0.088
 ambul-0.087
adders-0.086
Token swing
Token ID9628
Feature activation-2.15e-02

Pos logit contributions
ory+0.041
 cuc+0.039
 smile+0.039
va+0.037
 ate+0.036
Neg logit contributions
 stations-0.054
chairs-0.054
 ambul-0.051
adders-0.051
 emergencies-0.051
Token with
Token ID351
Feature activation-5.59e-02

Pos logit contributions
ka+0.025
thank+0.023
ory+0.022
 smile+0.022
 swoop+0.021
Neg logit contributions
 emergencies-0.042
 ambul-0.040
chairs-0.036
 basement-0.036
 hurricanes-0.036
Token a
Token ID257
Feature activation-4.43e-02

Pos logit contributions
 smile+0.064
 mineral+0.057
 pineapple+0.057
 cuc+0.057
ka+0.057
Neg logit contributions
chairs-0.099
 ambul-0.099
 stations-0.097
 basement-0.097
adders-0.095
Token smile
Token ID8212
Feature activation-1.65e-01

Pos logit contributions
ory+0.048
 cuc+0.047
va+0.047
 mineral+0.046
 smile+0.045
Neg logit contributions
chairs-0.087
 themselves-0.086
adders-0.084
 stations-0.084
 ambul-0.082
Token on
Token ID319
Feature activation+3.96e-03

Pos logit contributions
ory+0.227
hire+0.222
ka+0.215
 cuc+0.202
va+0.200
Neg logit contributions
chairs-0.277
adders-0.276
 ambul-0.273
 emergencies-0.273
 stations-0.273
Token his
Token ID465
Feature activation-2.14e-02

Pos logit contributions
chairs+0.006
 stations+0.006
adders+0.005
 ambul+0.005
 basement+0.005
Neg logit contributions
ory-0.006
hire-0.006
 cuc-0.006
va-0.006
abby-0.006
Token face
Token ID1986
Feature activation-9.09e-02

Pos logit contributions
ory+0.019
hire+0.016
 cuc+0.016
va+0.015
 ruby+0.015
Neg logit contributions
chairs-0.049
 stations-0.047
 ambul-0.046
adders-0.046
 Michael-0.045
Token.
Token ID13
Feature activation-1.86e-02

Pos logit contributions
ory+0.088
 smile+0.078
hire+0.074
 cuc+0.073
 ruby+0.071
Neg logit contributions
 stations-0.195
 basement-0.193
 emergencies-0.187
 ambul-0.187
adders-0.187
Token<|endoftext|>
Token ID50256
Feature activation-4.13e-02

Pos logit contributions
ory+0.022
va+0.021
 cuc+0.020
 pineapple+0.019
abby+0.018
Neg logit contributions
 stations-0.037
adders-0.035
 ambul-0.034
 emergencies-0.034
chairs-0.034
Token wise
Token ID10787
Feature activation-9.06e-02

Pos logit contributions
ory+0.016
 cuc+0.014
hire+0.014
abby+0.014
va+0.014
Neg logit contributions
chairs-0.017
 stations-0.017
 themselves-0.016
adders-0.016
 ambul-0.016
Token man
Token ID582
Feature activation-7.75e-02

Pos logit contributions
ory+0.135
hire+0.124
 cuc+0.123
va+0.122
abby+0.118
Neg logit contributions
chairs-0.151
adders-0.149
 stations-0.145
 Michael-0.143
 ambul-0.140
Token with
Token ID351
Feature activation-6.12e-02

Pos logit contributions
ory+0.108
 cuc+0.101
va+0.095
 lobster+0.094
 Crown+0.093
Neg logit contributions
 stations-0.135
chairs-0.132
 basement-0.129
adders-0.128
 emergencies-0.124
Token a
Token ID257
Feature activation-4.28e-02

Pos logit contributions
 cuc+0.085
 mineral+0.083
 ruby+0.081
hire+0.080
 pineapple+0.079
Neg logit contributions
chairs-0.100
 stations-0.095
adders-0.091
 ambul-0.090
 basement-0.088
Token kind
Token ID1611
Feature activation-8.41e-02

Pos logit contributions
ory+0.050
 cuc+0.049
 mineral+0.047
va+0.047
hire+0.045
Neg logit contributions
chairs-0.082
 themselves-0.081
 stations-0.081
adders-0.079
 Michael-0.076
Token smile
Token ID8212
Feature activation-1.65e-01

Pos logit contributions
ory+0.113
abby+0.103
 cuc+0.103
hire+0.103
va+0.101
Neg logit contributions
chairs-0.142
 stations-0.139
 themselves-0.136
 ambul-0.136
adders-0.135
Token.
Token ID13
Feature activation-2.92e-02

Pos logit contributions
ory+0.257
hire+0.253
 cuc+0.241
ka+0.238
abby+0.232
Neg logit contributions
chairs-0.253
 stations-0.248
adders-0.246
 ambul-0.241
 emergencies-0.235
Token The
Token ID383
Feature activation-6.94e-03

Pos logit contributions
ory+0.037
va+0.035
 cuc+0.033
abby+0.032
 pineapple+0.032
Neg logit contributions
 stations-0.057
adders-0.054
chairs-0.053
 ambul-0.053
 rails-0.051
Token wise
Token ID10787
Feature activation-7.95e-02

Pos logit contributions
ory+0.008
 cuc+0.008
hire+0.008
va+0.007
abby+0.007
Neg logit contributions
chairs-0.013
 stations-0.013
 themselves-0.013
adders-0.012
 ambul-0.012
Token man
Token ID582
Feature activation-4.70e-02

Pos logit contributions
ory+0.102
abby+0.093
 cuc+0.093
hire+0.092
va+0.092
Neg logit contributions
chairs-0.139
 stations-0.135
 themselves-0.132
 ambul-0.132
adders-0.131
Token told
Token ID1297
Feature activation+3.02e-02

Pos logit contributions
ory+0.061
 cuc+0.056
va+0.055
hire+0.054
abby+0.053
Neg logit contributions
 stations-0.084
chairs-0.083
adders-0.081
 themselves-0.080
 emergencies-0.079
Token thank
Token ID5875
Feature activation-5.40e-02

Pos logit contributions
chairs+0.003
 stations+0.003
adders+0.003
 ambul+0.003
 themselves+0.003
Neg logit contributions
ory-0.003
 cuc-0.003
va-0.003
hire-0.003
 pineapple-0.003
Token you
Token ID345
Feature activation-6.03e-03

Pos logit contributions
hire+0.118
abby+0.117
ory+0.117
 cuc+0.115
 mineral+0.114
Neg logit contributions
 themselves-0.046
 Rex-0.042
 Michael-0.039
chairs-0.038
 ambul-0.037
Token.
Token ID13
Feature activation-9.93e-04

Pos logit contributions
ory+0.008
 cuc+0.007
va+0.007
abby+0.006
hire+0.006
Neg logit contributions
chairs-0.012
 stations-0.012
 basement-0.011
adders-0.011
 ambul-0.011
Token We
Token ID775
Feature activation+3.51e-02

Pos logit contributions
ory+0.002
 cuc+0.001
hire+0.001
va+0.001
abby+0.001
Neg logit contributions
chairs-0.001
 stations-0.001
 themselves-0.001
 ambul-0.001
adders-0.001
Token love
Token ID1842
Feature activation+1.27e-02

Pos logit contributions
 themselves+0.038
adders+0.037
 shelter+0.035
 until+0.035
 stations+0.035
Neg logit contributions
va-0.064
ory-0.062
 cuc-0.061
abby-0.060
ka-0.060
Token cuc
Token ID38421
Feature activation-1.65e-01

Pos logit contributions
chairs+0.015
 stations+0.014
 themselves+0.014
 ambul+0.014
adders+0.014
Neg logit contributions
ory-0.024
abby-0.023
hire-0.023
 cuc-0.022
va-0.022
Tokenumbers
Token ID17024
Feature activation+1.18e-02

Pos logit contributions
va+0.295
 mineral+0.286
ory+0.280
 ruby+0.276
abby+0.272
Neg logit contributions
adders-0.194
 themselves-0.169
 together-0.161
chairs-0.161
oses-0.148
Token.
Token ID13
Feature activation-1.65e-03

Pos logit contributions
chairs+0.022
 stations+0.021
 basement+0.021
 Michael+0.021
 emergencies+0.020
Neg logit contributions
ory-0.016
 cuc-0.014
hire-0.013
 mineral-0.013
 pineapple-0.013
Token And
Token ID843
Feature activation+6.03e-02

Pos logit contributions
ory+0.003
 cuc+0.003
hire+0.002
 pineapple+0.002
va+0.002
Neg logit contributions
chairs-0.003
 stations-0.002
 ambul-0.002
adders-0.002
 themselves-0.002
Token we
Token ID356
Feature activation+3.81e-02

Pos logit contributions
chairs+0.078
 stations+0.071
 ambul+0.069
adders+0.068
 Michael+0.066
Neg logit contributions
ory-0.112
 cuc-0.108
hire-0.104
va-0.104
abby-0.102
Token love
Token ID1842
Feature activation+1.22e-02

Pos logit contributions
 themselves+0.043
adders+0.042
 stations+0.041
 until+0.040
 basement+0.038
Neg logit contributions
va-0.069
ory-0.068
abby-0.067
ka-0.065
 cuc-0.064
Token finished
Token ID5201
Feature activation-2.13e-02

Pos logit contributions
 themselves+0.054
adders+0.051
 stations+0.051
chairs+0.051
 Michael+0.049
Neg logit contributions
ory-0.054
va-0.052
hire-0.051
abby-0.051
ka-0.051
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
ory+0.039
hire+0.036
 cuc+0.036
abby+0.036
va+0.036
Neg logit contributions
chairs-0.026
 stations-0.026
 themselves-0.025
 ambul-0.025
adders-0.024
Token Mrs
Token ID9074
Feature activation-3.65e-02

Pos logit contributions
ory+0.020
 cuc+0.018
va+0.018
 pineapple+0.017
hire+0.016
Neg logit contributions
chairs-0.017
adders-0.017
 stations-0.016
 ambul-0.015
 rails-0.014
Token.
Token ID13
Feature activation-5.17e-03

Pos logit contributions
ory+0.071
hire+0.066
 cuc+0.066
abby+0.066
va+0.065
Neg logit contributions
chairs-0.042
 stations-0.040
adders-0.038
 ambul-0.038
 themselves-0.038
Token P
Token ID350
Feature activation-4.89e-02

Pos logit contributions
ory+0.008
hire+0.008
abby+0.008
va+0.007
 cuc+0.007
Neg logit contributions
chairs-0.007
 themselves-0.007
 Rex-0.007
 Michael-0.007
 ambul-0.007
Tokenige
Token ID10045
Feature activation-1.65e-01

Pos logit contributions
 mineral+0.054
 cuc+0.053
 swoop+0.052
 ruby+0.051
 lick+0.051
Neg logit contributions
chairs-0.083
 themselves-0.082
 fault-0.081
adders-0.080
oses-0.076
Tokenon
Token ID261
Feature activation-3.49e-02

Pos logit contributions
ory+0.253
 cuc+0.229
abby+0.226
hire+0.219
va+0.216
Neg logit contributions
chairs-0.241
 themselves-0.237
adders-0.230
 stations-0.227
 tracks-0.225
Token said
Token ID531
Feature activation-8.61e-03

Pos logit contributions
ory+0.050
 cuc+0.047
hire+0.047
va+0.047
abby+0.046
Neg logit contributions
chairs-0.059
 stations-0.057
 ambul-0.056
 emergencies-0.055
adders-0.053
Token Jack
Token ID3619
Feature activation-3.08e-02

Pos logit contributions
ory+0.015
va+0.015
 cuc+0.015
hire+0.014
abby+0.013
Neg logit contributions
chairs-0.013
 stations-0.012
 ambul-0.011
 basement-0.011
adders-0.011
Token had
Token ID550
Feature activation-4.97e-02

Pos logit contributions
ory+0.041
va+0.038
hire+0.038
abby+0.037
 cuc+0.036
Neg logit contributions
 stations-0.055
chairs-0.053
adders-0.052
 ambul-0.052
 emergencies-0.052
Token done
Token ID1760
Feature activation-5.34e-02

Pos logit contributions
ory+0.072
 cuc+0.066
hire+0.066
abby+0.066
va+0.065
Neg logit contributions
chairs-0.081
 stations-0.078
 themselves-0.077
 ambul-0.076
adders-0.076
Token.
Token ID13
Feature activation-2.28e-02

Pos logit contributions
 mineral+0.013
ory+0.012
 cuc+0.012
 taste+0.012
 jewel+0.010
Neg logit contributions
chairs-0.044
 stations-0.044
 Michael-0.042
adders-0.041
 ambul-0.040
Token They
Token ID1119
Feature activation-4.94e-02

Pos logit contributions
va+0.025
ory+0.025
 cuc+0.023
 pineapple+0.021
 mineral+0.021
Neg logit contributions
 stations-0.047
adders-0.043
chairs-0.042
 emergencies-0.042
 ambul-0.042
Token hold
Token ID1745
Feature activation-7.60e-02

Pos logit contributions
ory+0.056
 cuc+0.050
hire+0.049
abby+0.045
va+0.045
Neg logit contributions
chairs-0.097
 stations-0.092
 emergencies-0.092
 ambul-0.091
adders-0.090
Token hands
Token ID2832
Feature activation-6.54e-02

Pos logit contributions
ka+0.099
ory+0.098
by+0.092
 lick+0.079
 mor+0.073
Neg logit contributions
 basement-0.148
 stations-0.134
chairs-0.130
adders-0.130
 emergencies-0.127
Token and
Token ID290
Feature activation-2.85e-02

Pos logit contributions
ory+0.072
 smile+0.056
ka+0.055
abby+0.053
by+0.053
Neg logit contributions
 emergencies-0.149
 basement-0.145
adders-0.140
 stations-0.138
chairs-0.136
Token smile
Token ID8212
Feature activation-1.64e-01

Pos logit contributions
 lick+0.026
 che+0.025
 smile+0.024
ory+0.024
 taste+0.022
Neg logit contributions
chairs-0.062
adders-0.061
 stations-0.061
 emergencies-0.060
 basement-0.059
Token.
Token ID13
Feature activation-3.25e-02

Pos logit contributions
ory+0.245
hire+0.215
by+0.204
ka+0.203
 cuc+0.198
Neg logit contributions
chairs-0.302
adders-0.291
 basement-0.279
 stations-0.279
 emergencies-0.275
Token
Token ID198
Feature activation-2.64e-02

Pos logit contributions
ory+0.031
va+0.028
 pineapple+0.026
 cuc+0.025
aire+0.024
Neg logit contributions
chairs-0.068
 stations-0.068
adders-0.065
 basement-0.063
 ambul-0.063
Token
Token ID198
Feature activation-2.16e-02

Pos logit contributions
abby+0.033
ory+0.033
va+0.032
hire+0.028
 cuc+0.028
Neg logit contributions
chairs-0.047
 stations-0.045
 ambul-0.044
adders-0.043
 basement-0.041
Token"
Token ID1
Feature activation-6.63e-02

Pos logit contributions
ory+0.028
abby+0.027
va+0.027
hire+0.026
 cuc+0.025
Neg logit contributions
chairs-0.039
 stations-0.036
 ambul-0.036
adders-0.036
 tracks-0.034
TokenAre
Token ID8491
Feature activation+2.17e-03

Pos logit contributions
ory+0.077
 cuc+0.070
hire+0.070
va+0.070
abby+0.069
Neg logit contributions
chairs-0.128
 stations-0.123
adders-0.122
 ambul-0.120
 themselves-0.120
Token He
Token ID679
Feature activation-5.45e-02

Pos logit contributions
chairs+0.007
 Michael+0.007
adders+0.006
 Ali+0.006
 ambul+0.006
Neg logit contributions
 mineral-0.004
 ruby-0.003
 cuc-0.003
 nut-0.003
 che-0.003
Token also
Token ID635
Feature activation-5.66e-02

Pos logit contributions
ory+0.055
 cuc+0.053
 che+0.050
 mineral+0.048
hire+0.048
Neg logit contributions
chairs-0.110
 stations-0.108
 emergencies-0.107
 ambul-0.106
adders-0.105
Token has
Token ID468
Feature activation-6.08e-02

Pos logit contributions
ory+0.055
 cuc+0.052
va+0.048
 che+0.048
hire+0.046
Neg logit contributions
chairs-0.121
 stations-0.115
 ambul-0.114
adders-0.113
 emergencies-0.113
Token a
Token ID257
Feature activation-4.90e-02

Pos logit contributions
 pineapple+0.088
 cuc+0.086
 che+0.086
ory+0.079
va+0.079
Neg logit contributions
chairs-0.102
 stations-0.091
 basement-0.088
adders-0.087
 Michael-0.086
Token big
Token ID1263
Feature activation-4.89e-02

Pos logit contributions
ory+0.062
 cuc+0.060
 mineral+0.057
va+0.057
hire+0.055
Neg logit contributions
chairs-0.090
 themselves-0.088
 stations-0.087
adders-0.086
 ambul-0.083
Token smile
Token ID8212
Feature activation-1.64e-01

Pos logit contributions
ory+0.063
 cuc+0.058
hire+0.057
abby+0.056
va+0.056
Neg logit contributions
chairs-0.087
 stations-0.085
 themselves-0.084
adders-0.082
 ambul-0.082
Token on
Token ID319
Feature activation+4.24e-03

Pos logit contributions
ory+0.260
hire+0.250
 cuc+0.245
ka+0.237
va+0.234
Neg logit contributions
chairs-0.248
 stations-0.248
adders-0.243
 themselves-0.239
 ambul-0.233
Token his
Token ID465
Feature activation-2.54e-02

Pos logit contributions
chairs+0.006
 stations+0.006
 themselves+0.005
 ambul+0.005
adders+0.005
Neg logit contributions
ory-0.007
 cuc-0.007
hire-0.007
abby-0.007
va-0.007
Token face
Token ID1986
Feature activation-9.41e-02

Pos logit contributions
ory+0.025
hire+0.022
 cuc+0.022
abby+0.021
va+0.021
Neg logit contributions
chairs-0.055
 stations-0.053
 ambul-0.052
adders-0.051
 Michael-0.051
Token.
Token ID13
Feature activation-2.58e-02

Pos logit contributions
ory+0.107
 cuc+0.096
hire+0.093
 mineral+0.092
 ruby+0.092
Neg logit contributions
 stations-0.194
adders-0.183
 basement-0.183
 ambul-0.181
chairs-0.179
Token
Token ID198
Feature activation-1.46e-02

Pos logit contributions
ory+0.030
va+0.029
 cuc+0.029
 pineapple+0.028
 mineral+0.027
Neg logit contributions
 stations-0.051
chairs-0.048
 ambul-0.047
adders-0.047
 rails-0.046
Token was
Token ID373
Feature activation-2.81e-03

Pos logit contributions
ory+0.079
 cuc+0.074
hire+0.073
abby+0.073
ka+0.069
Neg logit contributions
chairs-0.073
 stations-0.069
adders-0.069
 Michael-0.069
 ambul-0.065
Token happy
Token ID3772
Feature activation-5.10e-02

Pos logit contributions
 cuc+0.004
 che+0.004
 smile+0.003
ory+0.003
va+0.003
Neg logit contributions
chairs-0.005
 stations-0.005
 ambul-0.005
adders-0.004
 rails-0.004
Token to
Token ID284
Feature activation-5.52e-02

Pos logit contributions
ory+0.067
va+0.064
 cuc+0.061
 smile+0.061
 mineral+0.060
Neg logit contributions
adders-0.095
chairs-0.095
 stations-0.093
 ambul-0.090
 emergencies-0.087
Token put
Token ID1234
Feature activation-7.75e-02

Pos logit contributions
ory+0.055
va+0.048
hire+0.047
 cuc+0.047
abby+0.046
Neg logit contributions
chairs-0.121
 stations-0.115
 ambul-0.110
adders-0.109
 rails-0.106
Token a
Token ID257
Feature activation-4.13e-02

Pos logit contributions
ka+0.097
 mineral+0.092
ory+0.091
 che+0.085
 ruby+0.085
Neg logit contributions
 stations-0.140
 ambul-0.138
adders-0.132
chairs-0.128
 basement-0.127
Token smile
Token ID8212
Feature activation-1.64e-01

Pos logit contributions
ory+0.055
 cuc+0.051
 mineral+0.050
va+0.050
hire+0.049
Neg logit contributions
chairs-0.074
 stations-0.072
 themselves-0.072
adders-0.071
 ambul-0.069
Token on
Token ID319
Feature activation+4.00e-03

Pos logit contributions
ory+0.195
hire+0.193
ka+0.177
 jewel+0.175
 ruby+0.173
Neg logit contributions
 ambul-0.303
 stations-0.303
adders-0.302
 emergencies-0.300
chairs-0.298
Token her
Token ID607
Feature activation-3.67e-02

Pos logit contributions
 stations+0.007
chairs+0.007
adders+0.007
 ambul+0.007
 basement+0.006
Neg logit contributions
ory-0.005
hire-0.005
 cuc-0.005
 ruby-0.005
 mineral-0.005
Token friend
Token ID1545
Feature activation-4.42e-03

Pos logit contributions
ory+0.037
 cuc+0.034
 mineral+0.033
hire+0.032
 ruby+0.032
Neg logit contributions
chairs-0.080
 stations-0.078
 ambul-0.075
adders-0.075
 Michael-0.073
Tokenâ
Token ID22940
Feature activation+2.87e-02

Pos logit contributions
ory+0.006
 smile+0.005
 che+0.005
 peanut+0.005
 cuc+0.005
Neg logit contributions
 stations-0.009
adders-0.009
 ambul-0.008
 basement-0.008
chairs-0.008
Token€
Token ID26391
Feature activation-1.87e-02

Pos logit contributions
chairs+0.046
 stations+0.045
 themselves+0.044
 ambul+0.043
adders+0.043
Neg logit contributions
ory-0.042
 cuc-0.039
hire-0.039
abby-0.038
va-0.038
Token says
Token ID1139
Feature activation-2.46e-02

Pos logit contributions
ory+0.075
hire+0.075
 cuc+0.072
ka+0.071
 mineral+0.071
Neg logit contributions
chairs-0.082
adders-0.072
 stations-0.072
 themselves-0.069
 ambul-0.067
Token.
Token ID13
Feature activation-5.55e-02

Pos logit contributions
ory+0.033
va+0.031
 cuc+0.029
 smile+0.027
lder+0.027
Neg logit contributions
chairs-0.047
 stations-0.046
 basement-0.044
 cords-0.043
 cables-0.043
Token
Token ID198
Feature activation-4.59e-02

Pos logit contributions
va+0.063
ory+0.059
 cuc+0.056
 pineapple+0.055
 ruby+0.049
Neg logit contributions
 stations-0.112
chairs-0.106
 ambul-0.103
 basement-0.101
adders-0.101
Token
Token ID198
Feature activation-3.64e-02

Pos logit contributions
ory+0.067
va+0.065
abby+0.064
 cuc+0.061
hire+0.060
Neg logit contributions
chairs-0.072
 stations-0.070
 ambul-0.068
adders-0.066
 basement-0.063
TokenThey
Token ID2990
Feature activation-2.86e-02

Pos logit contributions
ory+0.045
va+0.043
abby+0.042
hire+0.041
 cuc+0.040
Neg logit contributions
chairs-0.067
 stations-0.064
 ambul-0.063
adders-0.063
 themselves-0.061
Token smile
Token ID8212
Feature activation-1.64e-01

Pos logit contributions
ory+0.038
abby+0.035
va+0.034
 cuc+0.034
hire+0.034
Neg logit contributions
chairs-0.049
 stations-0.048
 themselves-0.047
adders-0.047
 ambul-0.047
Token at
Token ID379
Feature activation+3.97e-03

Pos logit contributions
ory+0.264
hire+0.234
 cuc+0.228
ka+0.227
 smile+0.224
Neg logit contributions
chairs-0.268
adders-0.253
 ambul-0.236
 stations-0.234
 emergencies-0.231
Token each
Token ID1123
Feature activation-7.19e-02

Pos logit contributions
chairs+0.008
adders+0.007
 basement+0.007
 ambul+0.007
cases+0.006
Neg logit contributions
ory-0.005
hire-0.004
ka-0.004
 smile-0.004
 pineapple-0.004
Token other
Token ID584
Feature activation+8.06e-02

Pos logit contributions
ory+0.125
hire+0.120
 cuc+0.114
va+0.113
 lick+0.112
Neg logit contributions
chairs-0.102
adders-0.092
 ambul-0.088
 Michael-0.085
 stations-0.084
Token.
Token ID13
Feature activation-3.26e-02

Pos logit contributions
chairs+0.155
adders+0.154
 basement+0.149
 ambul+0.146
 emergencies+0.145
Neg logit contributions
 smile-0.094
ory-0.090
hire-0.076
 cuc-0.075
 lick-0.074
Token They
Token ID1119
Feature activation-3.34e-02

Pos logit contributions
ory+0.030
va+0.029
 pineapple+0.028
 cuc+0.026
Thank+0.024
Neg logit contributions
chairs-0.069
adders-0.064
 ambul-0.064
 stations-0.064
 cords-0.062
Token smiled
Token ID13541
Feature activation-8.09e-02

Pos logit contributions
 cuc+0.030
 smile+0.029
ory+0.028
 che+0.028
 mineral+0.027
Neg logit contributions
chairs-0.044
 stations-0.043
adders-0.043
 ambul-0.043
 emergencies-0.042
Token a
Token ID257
Feature activation-5.17e-02

Pos logit contributions
ory+0.132
hire+0.119
 cuc+0.115
ka+0.113
va+0.110
Neg logit contributions
chairs-0.142
adders-0.133
 stations-0.129
 ambul-0.126
 emergencies-0.124
Token big
Token ID1263
Feature activation-4.70e-02

Pos logit contributions
ory+0.079
hire+0.071
ka+0.069
abby+0.069
 cuc+0.069
Neg logit contributions
 until-0.069
 tracks-0.069
 Michael-0.067
 stations-0.067
 ambul-0.067
Token,
Token ID11
Feature activation-3.03e-02

Pos logit contributions
ory+0.044
abby+0.039
hire+0.039
 cuc+0.038
va+0.037
Neg logit contributions
chairs-0.097
 stations-0.096
 ambul-0.095
 themselves-0.095
 tracks-0.094
Token joyful
Token ID48977
Feature activation-1.31e-03

Pos logit contributions
 cuc+0.027
ory+0.027
 che+0.025
 mineral+0.025
 juicy+0.025
Neg logit contributions
chairs-0.068
adders-0.067
 stations-0.066
 themselves-0.063
 ambul-0.062
Token smile
Token ID8212
Feature activation-1.63e-01

Pos logit contributions
ory+0.002
va+0.001
 cuc+0.001
hire+0.001
 mineral+0.001
Neg logit contributions
chairs-0.003
adders-0.003
 stations-0.002
 ambul-0.002
 themselves-0.002
Token.
Token ID13
Feature activation-2.67e-02

Pos logit contributions
ory+0.257
hire+0.247
ka+0.237
 cuc+0.231
va+0.226
Neg logit contributions
chairs-0.263
adders-0.254
 stations-0.246
 emergencies-0.240
 ambul-0.240
Token
Token ID198
Feature activation-1.79e-02

Pos logit contributions
va+0.033
ory+0.033
abby+0.028
 cuc+0.028
aire+0.027
Neg logit contributions
 stations-0.054
adders-0.050
 ambul-0.050
chairs-0.049
 emergencies-0.049
Token
Token ID198
Feature activation-1.45e-02

Pos logit contributions
va+0.021
abby+0.020
ory+0.020
hire+0.017
 cuc+0.017
Neg logit contributions
chairs-0.034
 stations-0.034
 ambul-0.034
adders-0.032
 emergencies-0.032
TokenSoon
Token ID28093
Feature activation+6.06e-02

Pos logit contributions
va+0.020
abby+0.019
ory+0.019
hire+0.018
 cuc+0.017
Neg logit contributions
chairs-0.026
 stations-0.025
 ambul-0.024
adders-0.024
 tracks-0.024
Token they
Token ID484
Feature activation+3.10e-02

Pos logit contributions
chairs+0.110
adders+0.104
 stations+0.104
 rails+0.098
 basement+0.097
Neg logit contributions
ory-0.089
va-0.087
 cuc-0.081
 che-0.077
 ruby-0.076
Token cuc
Token ID38421
Feature activation-1.62e-01

Pos logit contributions
ory+0.007
abby+0.006
ka+0.006
hire+0.006
va+0.006
Neg logit contributions
 ambul-0.010
 stations-0.010
 themselves-0.010
 basement-0.010
 emergencies-0.010
Tokenumber
Token ID4494
Feature activation-1.42e-02

Pos logit contributions
va+0.290
ka+0.274
 smile+0.271
 swoop+0.271
lder+0.267
Neg logit contributions
adders-0.172
 together-0.136
 themselves-0.134
oses-0.134
chairs-0.129
Token.
Token ID13
Feature activation-2.79e-04

Pos logit contributions
ory+0.021
 cuc+0.020
hire+0.020
 pineapple+0.019
abby+0.018
Neg logit contributions
 stations-0.023
chairs-0.022
 basement-0.022
 ambul-0.021
 emergencies-0.021
Token Mama
Token ID40084
Feature activation+2.69e-02

Pos logit contributions
ory+0.000
 cuc+0.000
hire+0.000
 pineapple+0.000
 mineral+0.000
Neg logit contributions
chairs-0.000
 stations-0.000
 ambul-0.000
 Michael-0.000
adders-0.000
Token puts
Token ID7584
Feature activation-1.37e-02

Pos logit contributions
chairs+0.045
 stations+0.044
adders+0.043
 themselves+0.043
 ambul+0.042
Neg logit contributions
ory-0.037
 cuc-0.035
hire-0.034
abby-0.033
va-0.033
Token cuc
Token ID38421
Feature activation-1.62e-01

Pos logit contributions
ory+0.026
 cuc+0.025
hire+0.025
ka+0.024
abby+0.024
Neg logit contributions
chairs-0.017
 stations-0.016
 ambul-0.015
adders-0.015
 Michael-0.015
Tokenumbers
Token ID17024
Feature activation+1.39e-02

Pos logit contributions
va+0.298
ory+0.276
lder+0.273
 complexion+0.273
 mineral+0.272
Neg logit contributions
adders-0.189
 themselves-0.176
 together-0.158
chairs-0.154
oses-0.142
Token in
Token ID287
Feature activation-2.51e-02

Pos logit contributions
chairs+0.026
 Michael+0.024
 stations+0.024
 ambul+0.023
 basement+0.023
Neg logit contributions
ory-0.020
 cuc-0.018
hire-0.018
abby-0.017
 mineral-0.017
Token our
Token ID674
Feature activation+2.64e-02

Pos logit contributions
ory+0.046
 cuc+0.045
hire+0.043
abby+0.042
 mineral+0.041
Neg logit contributions
chairs-0.034
 stations-0.031
adders-0.030
 Michael-0.029
 ambul-0.029
Token salad
Token ID20698
Feature activation-5.62e-02

Pos logit contributions
 themselves+0.037
 tracks+0.037
chairs+0.037
 stations+0.036
 ambul+0.036
Neg logit contributions
ory-0.040
abby-0.039
va-0.039
hire-0.038
ka-0.037
Token.
Token ID13
Feature activation-7.39e-04

Pos logit contributions
ory+0.088
 cuc+0.084
abby+0.077
 mineral+0.075
hire+0.072
Neg logit contributions
chairs-0.095
 Michael-0.095
 stations-0.095
 ambul-0.095
 emergencies-0.091
Token Sam
Token ID3409
Feature activation+2.39e-02

Pos logit contributions
adders+0.001
chairs+0.001
 basement+0.000
 stations+0.000
 fault+0.000
Neg logit contributions
 che-0.001
 lick-0.001
 l-0.001
 pineapple-0.001
ory-0.001
Token said
Token ID531
Feature activation+6.97e-03

Pos logit contributions
 emergencies+0.048
 stations+0.047
adders+0.047
 basement+0.046
chairs+0.046
Neg logit contributions
ory-0.027
va-0.025
 cuc-0.024
hire-0.023
abby-0.023
Token,
Token ID11
Feature activation+5.94e-04

Pos logit contributions
 stations+0.012
chairs+0.012
 ambul+0.011
 basement+0.011
 emergencies+0.011
Neg logit contributions
ory-0.011
 cuc-0.010
va-0.009
ka-0.009
hire-0.009
Token pet
Token ID4273
Feature activation-4.00e-02

Pos logit contributions
chairs+0.001
 stations+0.001
adders+0.001
 themselves+0.001
 ambul+0.001
Neg logit contributions
ory-0.001
 cuc-0.001
hire-0.001
abby-0.001
va-0.001
Tokenting
Token ID889
Feature activation+6.53e-02

Pos logit contributions
ory+0.051
ka+0.046
 cuc+0.042
va+0.041
hire+0.041
Neg logit contributions
chairs-0.076
adders-0.074
 emergencies-0.074
 stations-0.073
 basement-0.073
Token Rex
Token ID17853
Feature activation+7.86e-02

Pos logit contributions
adders+0.081
chairs+0.076
 basement+0.074
 emergencies+0.073
 stations+0.066
Neg logit contributions
ory-0.111
 che-0.110
 l-0.110
ka-0.107
 cuc-0.107
Token.
Token ID13
Feature activation-5.36e-04

Pos logit contributions
chairs+0.132
 stations+0.125
 emergencies+0.124
 ambul+0.123
 basement+0.123
Neg logit contributions
ory-0.114
 cuc-0.104
hire-0.102
 che-0.100
 lick-0.100
Token "
Token ID366
Feature activation+1.64e-03

Pos logit contributions
 cuc+0.000
ory+0.000
va+0.000
 lick+0.000
 che+0.000
Neg logit contributions
 stations-0.001
chairs-0.001
adders-0.001
 basement-0.001
 supervision-0.001
TokenBut
Token ID1537
Feature activation+4.74e-02

Pos logit contributions
chairs+0.003
 stations+0.002
 ambul+0.002
adders+0.002
 themselves+0.002
Neg logit contributions
ory-0.002
hire-0.002
 cuc-0.002
abby-0.002
va-0.002
Token we
Token ID356
Feature activation+4.20e-02

Pos logit contributions
chairs+0.096
adders+0.087
 stations+0.086
 basement+0.083
 rails+0.078
Neg logit contributions
 mineral-0.059
 cuc-0.055
 che-0.053
 ruby-0.052
 pineapple-0.051
Token have
Token ID423
Feature activation+2.77e-03

Pos logit contributions
chairs+0.071
 stations+0.069
 themselves+0.069
adders+0.068
 ambul+0.067
Neg logit contributions
ory-0.056
abby-0.052
 cuc-0.052
va-0.052
hire-0.052
Token want
Token ID765
Feature activation-2.39e-02

Pos logit contributions
ory+0.054
 cuc+0.048
hire+0.048
abby+0.048
va+0.048
Neg logit contributions
chairs-0.086
 stations-0.083
 themselves-0.082
 ambul-0.081
adders-0.081
Token to
Token ID284
Feature activation-4.55e-02

Pos logit contributions
 taste+0.024
 pineapple+0.024
 mineral+0.022
ory+0.022
 lobster+0.021
Neg logit contributions
chairs-0.050
 stations-0.049
 basement-0.046
adders-0.046
 ambul-0.044
Token play
Token ID711
Feature activation+3.57e-03

Pos logit contributions
ory+0.053
hire+0.046
 cuc+0.045
va+0.045
abby+0.045
Neg logit contributions
chairs-0.091
adders-0.086
 stations-0.085
 ambul-0.083
 Michael-0.080
Token with
Token ID351
Feature activation-4.71e-02

Pos logit contributions
chairs+0.007
 ambul+0.007
adders+0.007
 Ali+0.007
 supervision+0.007
Neg logit contributions
 mineral-0.003
 trophy-0.003
 trade-0.003
 che-0.003
 score-0.003
Token Sue
Token ID26089
Feature activation-2.16e-02

Pos logit contributions
 mineral+0.045
 pineapple+0.038
 che+0.037
 nut+0.037
ory+0.036
Neg logit contributions
adders-0.089
 stations-0.089
chairs-0.087
 loft-0.084
 basement-0.084
Token anymore
Token ID7471
Feature activation+7.10e-02

Pos logit contributions
ory+0.030
by+0.025
 che+0.024
 beetle+0.024
 cuc+0.023
Neg logit contributions
adders-0.039
 stations-0.038
 emergencies-0.037
chairs-0.036
 basement-0.036
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
 stations+0.142
adders+0.135
chairs+0.130
 basement+0.129
 ambul+0.129
Neg logit contributions
ory-0.077
va-0.073
 che-0.071
 mineral-0.070
 cuc-0.070
Token He
Token ID679
Feature activation-5.27e-02

Pos logit contributions
ory+0.016
 pineapple+0.015
va+0.015
 cuc+0.015
abby+0.014
Neg logit contributions
 stations-0.029
chairs-0.027
 ambul-0.026
adders-0.026
 emergencies-0.025
Token took
Token ID1718
Feature activation-3.90e-02

Pos logit contributions
ory+0.066
 cuc+0.060
hire+0.059
va+0.058
abby+0.058
Neg logit contributions
chairs-0.095
 stations-0.093
 themselves-0.091
adders-0.091
 ambul-0.090
Token the
Token ID262
Feature activation-1.65e-02

Pos logit contributions
hire+0.060
ka+0.060
ory+0.058
 cuc+0.056
 pineapple+0.056
Neg logit contributions
chairs-0.058
 stations-0.057
 ambul-0.055
 basement-0.053
 emergencies-0.052
Token dish
Token ID9433
Feature activation-2.91e-02

Pos logit contributions
 cuc+0.022
ory+0.021
 mineral+0.021
 pineapple+0.020
hire+0.020
Neg logit contributions
chairs-0.029
 themselves-0.028
 stations-0.028
 Michael-0.027
adders-0.027
Token look
Token ID804
Feature activation-6.95e-02

Pos logit contributions
chairs+0.022
adders+0.021
 stations+0.019
 emergencies+0.019
 ambul+0.019
Neg logit contributions
 cuc-0.015
ory-0.015
hire-0.015
 swoop-0.015
 taste-0.014
Token up
Token ID510
Feature activation+3.18e-02

Pos logit contributions
ory+0.119
hire+0.106
 cuc+0.106
ka+0.106
 mineral+0.099
Neg logit contributions
chairs-0.098
 stations-0.094
 ambul-0.093
adders-0.090
 emergencies-0.088
Token.
Token ID13
Feature activation+3.31e-03

Pos logit contributions
chairs+0.044
 stations+0.042
adders+0.040
 ambul+0.040
 Michael+0.038
Neg logit contributions
ory-0.056
 cuc-0.051
hire-0.051
abby-0.050
ka-0.049
Token It
Token ID632
Feature activation-8.83e-03

Pos logit contributions
chairs+0.005
 stations+0.005
 ambul+0.005
adders+0.005
 rails+0.005
Neg logit contributions
ory-0.005
 cuc-0.005
va-0.005
 pineapple-0.004
 mineral-0.004
Token was
Token ID373
Feature activation+3.42e-02

Pos logit contributions
ory+0.013
hire+0.012
va+0.012
 cuc+0.012
abby+0.012
Neg logit contributions
chairs-0.014
adders-0.013
 stations-0.013
 themselves-0.013
 ambul-0.013
Token raining
Token ID43079
Feature activation+9.64e-02

Pos logit contributions
chairs+0.059
adders+0.052
cases+0.049
ird+0.049
 stations+0.048
Neg logit contributions
 taste-0.042
 smell-0.042
 cuc-0.041
 che-0.041
hire-0.039
Token hard
Token ID1327
Feature activation+1.57e-02

Pos logit contributions
adders+0.126
chairs+0.125
 emergencies+0.124
 themselves+0.124
 ambul+0.121
Neg logit contributions
ka-0.144
ory-0.140
abby-0.136
hire-0.135
 cuc-0.132
Token,
Token ID11
Feature activation+1.03e-04

Pos logit contributions
adders+0.026
 themselves+0.025
 stations+0.024
 911+0.024
 supervision+0.024
Neg logit contributions
ka-0.018
 che-0.018
 mineral-0.017
ory-0.017
 cuc-0.017
Token and
Token ID290
Feature activation+8.82e-03

Pos logit contributions
chairs+0.000
adders+0.000
ird+0.000
 emergencies+0.000
 911+0.000
Neg logit contributions
 smell-0.000
 taste-0.000
 che-0.000
 smile-0.000
 ate-0.000
Token the
Token ID262
Feature activation-3.34e-03

Pos logit contributions
ird+0.016
adders+0.016
chairs+0.015
 fault+0.014
oses+0.014
Neg logit contributions
 smell-0.009
 che-0.009
 taste-0.009
 smelled-0.009
 swoop-0.008
Token wind
Token ID2344
Feature activation-5.42e-02

Pos logit contributions
 mineral+0.004
 cuc+0.004
 beetle+0.004
 taste+0.004
 pineapple+0.004
Neg logit contributions
chairs-0.006
adders-0.005
 themselves-0.005
 Michael-0.005
 stations-0.005
Token away
Token ID1497
Feature activation-2.09e-02

Pos logit contributions
chairs+0.010
adders+0.010
 Michael+0.010
 ambul+0.009
 Ali+0.009
Neg logit contributions
ory-0.009
ka-0.008
hire-0.008
 cuc-0.007
 mineral-0.007
Token.
Token ID13
Feature activation+6.64e-03

Pos logit contributions
ory+0.021
 mineral+0.018
 cuc+0.018
hire+0.016
 caul+0.016
Neg logit contributions
chairs-0.045
 Michael-0.040
 Kenny-0.040
 Tyler-0.039
 rails-0.038
Token We
Token ID775
Feature activation+4.23e-02

Pos logit contributions
chairs+0.012
adders+0.011
cases+0.010
 stations+0.010
 keys+0.010
Neg logit contributions
ory-0.007
 cuc-0.007
 pineapple-0.006
 cabbage-0.006
 grant-0.006
Token can
Token ID460
Feature activation+2.32e-02

Pos logit contributions
chairs+0.067
 stations+0.066
 themselves+0.065
adders+0.064
 ambul+0.064
Neg logit contributions
ory-0.061
abby-0.057
va-0.057
 cuc-0.057
hire-0.056
Token see
Token ID766
Feature activation+1.94e-02

Pos logit contributions
chairs+0.042
 stations+0.040
adders+0.039
 ambul+0.038
 themselves+0.037
Neg logit contributions
ory-0.031
hire-0.028
 cuc-0.027
abby-0.027
 mineral-0.026
Token other
Token ID584
Feature activation+1.18e-01

Pos logit contributions
chairs+0.039
liv+0.038
 Michael+0.035
 Kenny+0.034
 Tyler+0.034
Neg logit contributions
 cuc-0.024
 caul-0.022
 jewel-0.021
 peach-0.020
ory-0.020
Token fish
Token ID5916
Feature activation+8.63e-03

Pos logit contributions
chairs+0.207
 Michael+0.190
 Chloe+0.178
adders+0.176
 Kenny+0.173
Neg logit contributions
ory-0.171
 cuc-0.171
 mineral-0.157
hire-0.152
 lobster-0.148
Token at
Token ID379
Feature activation+3.70e-02

Pos logit contributions
chairs+0.017
 Michael+0.015
 ambul+0.014
 Tyler+0.014
 stations+0.014
Neg logit contributions
ory-0.012
 cuc-0.011
 mineral-0.011
fish-0.010
 lobster-0.010
Token the
Token ID262
Feature activation+9.52e-04

Pos logit contributions
chairs+0.066
adders+0.059
 Michael+0.058
 ambul+0.056
cases+0.056
Neg logit contributions
 cuc-0.051
 mineral-0.051
ory-0.049
 ruby-0.046
 pineapple-0.045
Token aquarium
Token ID33868
Feature activation+1.59e-02

Pos logit contributions
chairs+0.002
adders+0.001
 Michael+0.001
 ambul+0.001
 themselves+0.001
Neg logit contributions
ory-0.001
 cuc-0.001
 mineral-0.001
hire-0.001
 pineapple-0.001
Token,
Token ID11
Feature activation+2.95e-03

Pos logit contributions
chairs+0.031
 Kenny+0.031
 Michael+0.030
 Kathy+0.030
 ambul+0.029
Neg logit contributions
ory-0.022
 cuc-0.019
abby-0.017
 pineapple-0.017
ater-0.016
Token They
Token ID1119
Feature activation-3.54e-03

Pos logit contributions
ory+0.006
va+0.005
 cuc+0.005
 pineapple+0.005
hire+0.005
Neg logit contributions
chairs-0.006
 stations-0.006
adders-0.006
 ambul-0.006
 emergencies-0.006
Token promise
Token ID6991
Feature activation-2.38e-02

Pos logit contributions
ory+0.005
va+0.005
abby+0.005
hire+0.005
 cuc+0.005
Neg logit contributions
 themselves-0.005
 stations-0.005
chairs-0.005
adders-0.005
 Rex-0.005
Token to
Token ID284
Feature activation-9.53e-03

Pos logit contributions
ory+0.033
by+0.030
lder+0.029
va+0.029
 taste+0.028
Neg logit contributions
chairs-0.043
adders-0.042
 stations-0.038
 emergencies-0.038
 cords-0.037
Token see
Token ID766
Feature activation+1.33e-02

Pos logit contributions
ory+0.012
hire+0.011
abby+0.011
 cuc+0.011
va+0.011
Neg logit contributions
chairs-0.017
 stations-0.016
adders-0.016
 ambul-0.016
 themselves-0.016
Token each
Token ID1123
Feature activation-4.26e-02

Pos logit contributions
chairs+0.026
 basement+0.024
adders+0.024
case+0.023
cases+0.023
Neg logit contributions
ory-0.016
 l-0.014
 c-0.014
 caul-0.014
 taste-0.013
Token other
Token ID584
Feature activation+1.15e-01

Pos logit contributions
ory+0.079
hire+0.074
 cuc+0.071
va+0.070
 lick+0.070
Neg logit contributions
chairs-0.056
adders-0.054
 Michael-0.048
 stations-0.047
 basement-0.046
Token again
Token ID757
Feature activation+6.10e-02

Pos logit contributions
chairs+0.213
adders+0.199
 ambul+0.190
 emergencies+0.189
 Michael+0.181
Neg logit contributions
ory-0.157
 smile-0.154
 taste-0.135
hire-0.134
by-0.134
Token.
Token ID13
Feature activation-4.93e-03

Pos logit contributions
chairs+0.145
adders+0.132
 stations+0.126
 basement+0.123
 ambul+0.123
Neg logit contributions
ory-0.068
va-0.052
by-0.051
 token-0.050
 che-0.050
Token They
Token ID1119
Feature activation-5.35e-03

Pos logit contributions
ory+0.007
va+0.006
 pineapple+0.006
 cuc+0.006
 mineral+0.006
Neg logit contributions
chairs-0.008
 stations-0.008
adders-0.008
 ambul-0.008
 cords-0.007
Token go
Token ID467
Feature activation+1.17e-02

Pos logit contributions
ory+0.008
va+0.007
abby+0.007
 cuc+0.007
hire+0.007
Neg logit contributions
 themselves-0.008
chairs-0.008
 stations-0.008
adders-0.008
 Rex-0.008
Token home
Token ID1363
Feature activation-5.44e-03

Pos logit contributions
chairs+0.017
 stations+0.016
 ambul+0.016
 emergencies+0.015
 Michael+0.015
Neg logit contributions
ory-0.020
abby-0.018
va-0.017
hire-0.017
 cuc-0.017
Token room
Token ID2119
Feature activation+1.17e-02

Pos logit contributions
chairs+0.008
 stations+0.008
adders+0.007
 ambul+0.007
 themselves+0.007
Neg logit contributions
ory-0.005
 cuc-0.004
 mineral-0.004
hire-0.004
 ruby-0.004
Token.
Token ID13
Feature activation+9.36e-03

Pos logit contributions
chairs+0.021
 ambul+0.021
 stations+0.021
 emergencies+0.021
adders+0.020
Neg logit contributions
ory-0.015
 cuc-0.014
 che-0.012
abby-0.012
hire-0.012
Token Now
Token ID2735
Feature activation+2.67e-02

Pos logit contributions
 stations+0.017
chairs+0.016
 ambul+0.016
adders+0.016
 emergencies+0.016
Neg logit contributions
ory-0.012
 cuc-0.011
va-0.011
hire-0.011
 mineral-0.011
Token he
Token ID339
Feature activation+2.16e-02

Pos logit contributions
chairs+0.049
 stations+0.047
adders+0.047
 basement+0.045
 ambul+0.044
Neg logit contributions
hire-0.033
ory-0.032
 che-0.032
 mineral-0.032
 cuc-0.032
Token could
Token ID714
Feature activation-1.17e-03

Pos logit contributions
chairs+0.035
 stations+0.034
 themselves+0.033
 ambul+0.033
adders+0.033
Neg logit contributions
ory-0.031
 cuc-0.029
hire-0.029
abby-0.029
va-0.028
Token see
Token ID766
Feature activation+1.34e-02

Pos logit contributions
ory+0.001
hire+0.001
 cuc+0.001
abby+0.001
 mineral+0.001
Neg logit contributions
chairs-0.002
 stations-0.002
 ambul-0.002
adders-0.002
 themselves-0.002
Token it
Token ID340
Feature activation+8.26e-03

Pos logit contributions
adders+0.025
 basement+0.024
chairs+0.023
 cinema+0.022
 ambul+0.022
Neg logit contributions
 mineral-0.015
 ruby-0.014
 cuc-0.013
 Crown-0.013
hire-0.013
Token all
Token ID477
Feature activation-2.20e-02

Pos logit contributions
adders+0.014
chairs+0.013
 ambul+0.013
 emergencies+0.013
 stations+0.013
Neg logit contributions
 taste-0.011
 mineral-0.011
ory-0.011
 che-0.011
hire-0.010
Token the
Token ID262
Feature activation+1.32e-03

Pos logit contributions
ory+0.033
abby+0.032
 mineral+0.031
 cuc+0.031
va+0.031
Neg logit contributions
adders-0.033
chairs-0.033
 stations-0.031
 ambul-0.031
 themselves-0.031
Token time
Token ID640
Feature activation-3.50e-02

Pos logit contributions
chairs+0.002
 stations+0.002
 themselves+0.002
adders+0.002
 ambul+0.002
Neg logit contributions
ory-0.002
 cuc-0.002
hire-0.002
abby-0.002
va-0.002
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
ory+0.038
 che+0.033
 mineral+0.032
by+0.031
 cuc+0.031
Neg logit contributions
 ambul-0.068
chairs-0.068
 basement-0.065
 stations-0.065
 911-0.064
Token,
Token ID11
Feature activation+7.35e-03

Pos logit contributions
ory+0.029
 cuc+0.027
hire+0.027
abby+0.027
va+0.027
Neg logit contributions
chairs-0.021
 stations-0.020
 themselves-0.019
 ambul-0.019
adders-0.019
Token what
Token ID644
Feature activation+4.33e-03

Pos logit contributions
chairs+0.010
 stations+0.009
 ambul+0.009
adders+0.009
 themselves+0.009
Neg logit contributions
ory-0.013
 cuc-0.012
hire-0.012
va-0.012
abby-0.012
Token is
Token ID318
Feature activation+4.39e-02

Pos logit contributions
chairs+0.006
 stations+0.005
 themselves+0.005
 ambul+0.005
adders+0.005
Neg logit contributions
ory-0.008
 cuc-0.007
va-0.007
hire-0.007
abby-0.007
Token this
Token ID428
Feature activation-1.61e-03

Pos logit contributions
 themselves+0.033
 stations+0.031
chairs+0.031
adders+0.031
 Rex+0.030
Neg logit contributions
ory-0.103
abby-0.099
va-0.097
hire-0.095
 cuc-0.094
Token?"
Token ID1701
Feature activation+3.60e-02

Pos logit contributions
ory+0.002
abby+0.001
hire+0.001
 cuc+0.001
va+0.001
Neg logit contributions
chairs-0.003
 stations-0.003
 themselves-0.003
adders-0.003
 ambul-0.003
Token Lily
Token ID20037
Feature activation+2.90e-02

Pos logit contributions
chairs+0.047
 stations+0.046
adders+0.044
 ambul+0.044
 themselves+0.043
Neg logit contributions
ory-0.064
 cuc-0.060
va-0.060
hire-0.059
abby-0.059
Token asked
Token ID1965
Feature activation-1.69e-02

Pos logit contributions
chairs+0.053
 stations+0.053
adders+0.051
 themselves+0.051
 ambul+0.051
Neg logit contributions
ory-0.036
 cuc-0.032
va-0.032
abby-0.032
hire-0.032
Token.
Token ID13
Feature activation+7.72e-03

Pos logit contributions
ory+0.023
 cuc+0.019
ka+0.019
 smile+0.018
va+0.018
Neg logit contributions
 stations-0.032
chairs-0.032
 ambul-0.030
 rails-0.029
 basement-0.028
Token
Token ID198
Feature activation+1.99e-02

Pos logit contributions
 stations+0.011
chairs+0.011
 ambul+0.011
adders+0.010
 themselves+0.010
Neg logit contributions
ory-0.013
 cuc-0.012
va-0.012
abby-0.011
hire-0.011
Token
Token ID198
Feature activation+2.00e-02

Pos logit contributions
 stations+0.034
chairs+0.033
 ambul+0.032
adders+0.030
 emergencies+0.030
Neg logit contributions
ory-0.029
va-0.028
abby-0.027
lder-0.025
hire-0.025
Token"
Token ID1
Feature activation-3.41e-02

Pos logit contributions
chairs+0.039
 stations+0.038
 ambul+0.037
adders+0.037
 themselves+0.037
Neg logit contributions
ory-0.023
va-0.020
abby-0.020
hire-0.020
 cuc-0.020
Token play
Token ID711
Feature activation+2.21e-02

Pos logit contributions
ory+0.020
abby+0.019
hire+0.018
va+0.018
 cuc+0.018
Neg logit contributions
chairs-0.026
 stations-0.026
 themselves-0.026
 ambul-0.025
adders-0.025
Token outside
Token ID2354
Feature activation+1.95e-02

Pos logit contributions
chairs+0.030
 stations+0.028
adders+0.028
 ambul+0.028
 themselves+0.028
Neg logit contributions
ory-0.038
 cuc-0.036
hire-0.035
va-0.035
abby-0.035
Token on
Token ID319
Feature activation+3.70e-02

Pos logit contributions
chairs+0.036
adders+0.036
 stations+0.034
 emergencies+0.034
 Michael+0.033
Neg logit contributions
 cuc-0.024
 mineral-0.022
 che-0.021
ory-0.021
ka-0.020
Token the
Token ID262
Feature activation-2.13e-03

Pos logit contributions
chairs+0.060
adders+0.060
 stations+0.057
oses+0.054
 ambul+0.053
Neg logit contributions
 mineral-0.052
 cuc-0.052
 che-0.047
 pineapple-0.046
ory-0.045
Token ground
Token ID2323
Feature activation+3.45e-03

Pos logit contributions
ory+0.003
 cuc+0.003
hire+0.003
 mineral+0.003
abby+0.003
Neg logit contributions
chairs-0.003
adders-0.003
 Michael-0.003
 stations-0.003
 ambul-0.003
Token.
Token ID13
Feature activation+6.23e-03

Pos logit contributions
 Michael+0.007
 emergencies+0.007
 ambul+0.007
 911+0.006
chairs+0.006
Neg logit contributions
 cuc-0.004
 mineral-0.003
 che-0.003
ory-0.003
 lobster-0.003
Token One
Token ID1881
Feature activation+3.02e-02

Pos logit contributions
chairs+0.009
 stations+0.008
 themselves+0.008
 Michael+0.008
 ambul+0.008
Neg logit contributions
ory-0.011
va-0.010
hire-0.010
abby-0.010
 cuc-0.010
Token day
Token ID1110
Feature activation-1.84e-03

Pos logit contributions
chairs+0.043
 stations+0.042
 themselves+0.041
 ambul+0.040
adders+0.040
Neg logit contributions
ory-0.050
 cuc-0.046
hire-0.046
abby-0.046
va-0.046
Token,
Token ID11
Feature activation+2.37e-03

Pos logit contributions
ory+0.004
abby+0.004
hire+0.004
 cuc+0.004
va+0.004
Neg logit contributions
chairs-0.002
 Michael-0.002
 themselves-0.002
 ambul-0.001
 stations-0.001
Token Robert
Token ID5199
Feature activation-6.03e-02

Pos logit contributions
chairs+0.003
 stations+0.003
 themselves+0.003
 ambul+0.003
 Michael+0.003
Neg logit contributions
ory-0.004
abby-0.004
 cuc-0.004
hire-0.004
va-0.004
Token was
Token ID373
Feature activation+3.18e-02

Pos logit contributions
ory+0.073
 cuc+0.066
hire+0.066
abby+0.066
va+0.065
Neg logit contributions
chairs-0.111
 stations-0.108
 themselves-0.106
 ambul-0.105
adders-0.105
Token to
Token ID284
Feature activation-2.36e-03

Pos logit contributions
chairs+0.117
 emergencies+0.111
 stations+0.110
 basement+0.109
 ambul+0.108
Neg logit contributions
ory-0.105
by-0.093
 cuc-0.092
 Crown-0.087
va-0.087
Token follow
Token ID1061
Feature activation+1.19e-02

Pos logit contributions
ory+0.002
 cuc+0.002
hire+0.002
 mineral+0.002
abby+0.002
Neg logit contributions
chairs-0.005
 stations-0.005
adders-0.005
 ambul-0.005
 rails-0.005
Token her
Token ID607
Feature activation-3.23e-03

Pos logit contributions
adders+0.023
chairs+0.022
 stations+0.022
 basement+0.021
 fault+0.021
Neg logit contributions
ory-0.015
hire-0.012
 cuc-0.012
 mor-0.012
 mineral-0.012
Token.
Token ID13
Feature activation+7.72e-03

Pos logit contributions
ory+0.003
 smile+0.003
by+0.003
 mineral+0.003
 Crown+0.003
Neg logit contributions
chairs-0.007
 ambul-0.007
adders-0.006
 stations-0.006
 emergencies-0.006
Token
Token ID220
Feature activation-3.84e-02

Pos logit contributions
 stations+0.016
adders+0.015
 emergencies+0.014
chairs+0.014
 ambul+0.014
Neg logit contributions
ory-0.009
va-0.009
 cuc-0.008
aire-0.008
abby-0.007
Token
Token ID198
Feature activation+1.84e-02

Pos logit contributions
abby+0.035
ory+0.034
va+0.030
app+0.028
are+0.027
Neg logit contributions
 stations-0.085
 ambul-0.079
 radios-0.075
 trains-0.074
 basement-0.074
Token
Token ID198
Feature activation+1.84e-02

Pos logit contributions
 stations+0.030
chairs+0.030
 ambul+0.028
adders+0.028
 emergencies+0.027
Neg logit contributions
va-0.027
ory-0.026
abby-0.026
 cuc-0.024
hire-0.024
TokenThe
Token ID464
Feature activation+2.50e-02

Pos logit contributions
chairs+0.030
 stations+0.028
adders+0.028
 ambul+0.027
 tracks+0.027
Neg logit contributions
ory-0.027
va-0.026
abby-0.026
hire-0.026
 cuc-0.025
Token nurse
Token ID15849
Feature activation+6.91e-02

Pos logit contributions
chairs+0.036
adders+0.036
 themselves+0.036
 stations+0.035
 Michael+0.032
Neg logit contributions
ory-0.038
va-0.037
hire-0.036
lder-0.035
ka-0.035
Token followed
Token ID3940
Feature activation+1.37e-02

Pos logit contributions
 stations+0.146
 emergencies+0.145
chairs+0.143
adders+0.141
 basement+0.141
Neg logit contributions
ory-0.073
va-0.069
 cuc-0.062
abby-0.061
ka-0.061
Token the
Token ID262
Feature activation+5.06e-03

Pos logit contributions
adders+0.025
chairs+0.024
 stations+0.024
 basement+0.023
 fault+0.022
Neg logit contributions
ory-0.019
hire-0.015
 cuc-0.015
 mor-0.015
va-0.015
Token'm
Token ID1101
Feature activation-9.38e-04

Pos logit contributions
ory+0.079
abby+0.071
va+0.071
 cuc+0.071
hire+0.070
Neg logit contributions
chairs-0.112
 stations-0.110
 themselves-0.107
adders-0.106
 ambul-0.106
Token always
Token ID1464
Feature activation-4.77e-02

Pos logit contributions
va+0.001
abby+0.001
 cuc+0.001
ory+0.001
hire+0.001
Neg logit contributions
 stations-0.002
chairs-0.002
 radios-0.002
 beds-0.002
 rails-0.002
Token here
Token ID994
Feature activation+8.65e-03

Pos logit contributions
ory+0.068
 cuc+0.066
hire+0.064
va+0.063
ka+0.062
Neg logit contributions
chairs-0.082
 stations-0.078
adders-0.073
 ambul-0.072
 themselves-0.071
Token to
Token ID284
Feature activation-2.40e-02

Pos logit contributions
chairs+0.017
 stations+0.016
adders+0.015
 basement+0.014
 ambul+0.014
Neg logit contributions
ory-0.011
va-0.010
 cuc-0.010
 token-0.010
 mineral-0.010
Token help
Token ID1037
Feature activation-2.80e-02

Pos logit contributions
ory+0.028
hire+0.027
 cuc+0.027
 mineral+0.026
va+0.026
Neg logit contributions
chairs-0.045
 stations-0.043
adders-0.042
 ambul-0.041
 themselves-0.041
Token".
Token ID1911
Feature activation-1.88e-02

Pos logit contributions
ory+0.033
va+0.031
hire+0.029
 mineral+0.029
 taste+0.029
Neg logit contributions
adders-0.052
chairs-0.052
 stations-0.049
 ambul-0.047
 basement-0.047
Token<|endoftext|>
Token ID50256
Feature activation-3.14e-02

Pos logit contributions
ory+0.025
va+0.024
abby+0.023
 cuc+0.022
 pineapple+0.021
Neg logit contributions
 stations-0.035
 ambul-0.034
chairs-0.033
 rails-0.032
adders-0.032
TokenOnce
Token ID7454
Feature activation-5.88e-02

Pos logit contributions
ory+0.039
va+0.036
 cuc+0.035
abby+0.035
 mineral+0.034
Neg logit contributions
 stations-0.057
chairs-0.056
 ambul-0.056
adders-0.055
 themselves-0.054
Token there
Token ID612
Feature activation-3.30e-03

Pos logit contributions
ory+0.055
abby+0.047
va+0.046
 cuc+0.042
hire+0.042
Neg logit contributions
 stations-0.130
chairs-0.127
 ambul-0.125
 themselves-0.123
adders-0.122
Token were
Token ID547
Feature activation+2.63e-02

Pos logit contributions
ory+0.004
va+0.004
 mineral+0.004
abby+0.004
 cuc+0.003
Neg logit contributions
 stations-0.006
adders-0.006
chairs-0.006
 emergencies-0.006
 rails-0.006
Token two
Token ID734
Feature activation+1.80e-02

Pos logit contributions
chairs+0.041
 stations+0.037
adders+0.035
 rails+0.035
 ambul+0.034
Neg logit contributions
ory-0.045
 cuc-0.041
va-0.040
ka-0.039
abby-0.038
Token hearing
Token ID4854
Feature activation-9.70e-03

Pos logit contributions
chairs+0.025
 stations+0.023
adders+0.023
 ambul+0.022
 rails+0.021
Neg logit contributions
ory-0.029
 cuc-0.029
hire-0.028
va-0.028
abby-0.028
Token her
Token ID607
Feature activation-2.82e-02

Pos logit contributions
 smile+0.012
hire+0.011
 grant+0.011
 cuc+0.011
va+0.011
Neg logit contributions
adders-0.017
chairs-0.017
 stations-0.017
 rails-0.016
 basement-0.016
Token prayers
Token ID15707
Feature activation-1.42e-02

Pos logit contributions
 smile+0.032
 taste+0.030
 token+0.030
 mineral+0.028
ory+0.028
Neg logit contributions
chairs-0.055
 stations-0.054
adders-0.053
 ambul-0.050
 themselves-0.049
Token and
Token ID290
Feature activation-7.55e-03

Pos logit contributions
ory+0.018
 smile+0.017
 token+0.016
hire+0.015
 taste+0.015
Neg logit contributions
adders-0.026
chairs-0.025
 basement-0.025
 stations-0.025
 emergencies-0.024
Token returned
Token ID4504
Feature activation-1.59e-02

Pos logit contributions
 che+0.007
 token+0.007
 smile+0.007
ory+0.007
 ate+0.007
Neg logit contributions
chairs-0.016
adders-0.015
 stations-0.015
 rails-0.014
 cables-0.014
Token home
Token ID1363
Feature activation-2.84e-02

Pos logit contributions
ory+0.016
by+0.016
 ruby+0.015
thank+0.015
 che+0.014
Neg logit contributions
chairs-0.032
adders-0.031
 stations-0.031
 ambul-0.029
 rails-0.029
Token with
Token ID351
Feature activation-3.82e-02

Pos logit contributions
ory+0.031
 smile+0.031
 token+0.031
va+0.030
 che+0.030
Neg logit contributions
 stations-0.053
chairs-0.051
adders-0.049
 ambul-0.048
 rails-0.048
Token a
Token ID257
Feature activation-3.11e-02

Pos logit contributions
 cuc+0.050
 ruby+0.050
ory+0.050
 jewel+0.049
 che+0.049
Neg logit contributions
chairs-0.065
 stations-0.063
adders-0.063
 rails-0.061
oses-0.059
Token smile
Token ID8212
Feature activation-1.57e-01

Pos logit contributions
ory+0.041
hire+0.038
abby+0.037
 cuc+0.037
ka+0.036
Neg logit contributions
chairs-0.053
 ambul-0.051
 stations-0.051
 Michael-0.050
 tracks-0.050
Token on
Token ID319
Feature activation+1.99e-02

Pos logit contributions
hire+0.240
ory+0.240
ka+0.223
 cuc+0.223
abby+0.220
Neg logit contributions
 stations-0.238
chairs-0.237
adders-0.228
 ambul-0.227
 emergencies-0.222
Token her
Token ID607
Feature activation-3.30e-02

Pos logit contributions
 themselves+0.019
chairs+0.019
 stations+0.018
 ambul+0.018
 Michael+0.018
Neg logit contributions
ory-0.042
abby-0.039
va-0.039
 cuc-0.039
hire-0.038
Token was
Token ID373
Feature activation+2.07e-02

Pos logit contributions
ory+0.000
 che+0.000
va+0.000
 cuc+0.000
 token+0.000
Neg logit contributions
chairs-0.001
adders-0.001
 stations-0.001
 emergencies-0.001
 ambul-0.001
Token brave
Token ID14802
Feature activation-7.28e-03

Pos logit contributions
 stations+0.039
chairs+0.039
 emergencies+0.036
 basement+0.035
 ambul+0.035
Neg logit contributions
abby-0.026
va-0.025
ory-0.025
hire-0.025
 che-0.024
Token and
Token ID290
Feature activation+5.54e-04

Pos logit contributions
ory+0.011
 che+0.010
by+0.010
 cuc+0.010
hire+0.010
Neg logit contributions
chairs-0.012
adders-0.012
 ambul-0.011
 basement-0.011
 stations-0.011
Token strong
Token ID1913
Feature activation-1.74e-02

Pos logit contributions
chairs+0.001
 stations+0.001
adders+0.001
 ambul+0.001
 Michael+0.001
Neg logit contributions
ory-0.001
 cuc-0.001
hire-0.001
va-0.001
 mineral-0.001
Token.
Token ID13
Feature activation-3.20e-03

Pos logit contributions
ory+0.022
 che+0.021
 mineral+0.020
 cuc+0.020
hire+0.019
Neg logit contributions
chairs-0.030
adders-0.030
 basement-0.030
 stations-0.029
 ambul-0.029
Token<|endoftext|>
Token ID50256
Feature activation-3.24e-02

Pos logit contributions
ory+0.004
va+0.004
abby+0.003
 cuc+0.003
 pineapple+0.003
Neg logit contributions
 stations-0.006
 ambul-0.006
adders-0.006
 emergencies-0.006
chairs-0.006
TokenOnce
Token ID7454
Feature activation-5.63e-02

Pos logit contributions
ory+0.045
abby+0.042
va+0.042
 mineral+0.041
 cuc+0.040
Neg logit contributions
 stations-0.053
 ambul-0.052
chairs-0.052
adders-0.049
 rails-0.049
Token upon
Token ID2402
Feature activation-7.76e-03

Pos logit contributions
ory+0.063
 cuc+0.057
hire+0.057
abby+0.056
va+0.056
Neg logit contributions
chairs-0.108
 stations-0.105
 themselves-0.104
adders-0.103
 ambul-0.103
Token a
Token ID257
Feature activation-1.29e-02

Pos logit contributions
ory+0.015
hire+0.015
 cuc+0.015
abby+0.015
va+0.015
Neg logit contributions
chairs-0.008
 stations-0.008
 ambul-0.007
adders-0.007
 themselves-0.007
Token time
Token ID640
Feature activation-3.95e-02

Pos logit contributions
ory+0.025
hire+0.024
 cuc+0.024
abby+0.024
va+0.024
Neg logit contributions
chairs-0.014
 stations-0.013
 themselves-0.013
adders-0.013
 ambul-0.013
Token,
Token ID11
Feature activation+1.64e-03

Pos logit contributions
ory+0.064
va+0.060
 cuc+0.059
abby+0.059
 mineral+0.056
Neg logit contributions
 stations-0.059
chairs-0.058
 ambul-0.053
 themselves-0.053
adders-0.052
Token come
Token ID1282
Feature activation-5.65e-02

Pos logit contributions
lder+0.035
 mineral+0.035
 taste+0.035
hire+0.034
 cuc+0.034
Neg logit contributions
chairs-0.034
 Michael-0.032
 Ali-0.031
 stations-0.031
 themselves-0.031
Token and
Token ID290
Feature activation-3.11e-02

Pos logit contributions
ory+0.083
 smile+0.074
abby+0.069
hire+0.067
 token+0.066
Neg logit contributions
chairs-0.108
 stations-0.096
 ambul-0.094
adders-0.093
 cables-0.092
Token help
Token ID1037
Feature activation-4.66e-02

Pos logit contributions
ory+0.027
 smile+0.026
 taste+0.026
 mineral+0.025
 cuc+0.025
Neg logit contributions
chairs-0.071
adders-0.067
 stations-0.066
 ambul-0.065
 emergencies-0.064
Token them
Token ID606
Feature activation-2.05e-02

Pos logit contributions
 taste+0.051
ory+0.048
 mineral+0.047
 cuc+0.043
 gu+0.043
Neg logit contributions
adders-0.091
chairs-0.088
 stations-0.084
cases-0.079
 cinema-0.078
Token.
Token ID13
Feature activation-2.95e-02

Pos logit contributions
 taste+0.024
ory+0.023
 cuc+0.023
 mineral+0.023
hire+0.022
Neg logit contributions
chairs-0.041
 stations-0.038
 Michael-0.038
 ambul-0.036
 Rex-0.036
Token They
Token ID1119
Feature activation-3.82e-02

Pos logit contributions
ory+0.034
va+0.031
 cuc+0.031
 pineapple+0.030
 mineral+0.028
Neg logit contributions
 stations-0.057
chairs-0.056
 ambul-0.053
adders-0.052
 basement-0.052
Token hugged
Token ID41130
Feature activation-4.38e-02

Pos logit contributions
ory+0.050
 cuc+0.047
hire+0.046
abby+0.045
va+0.044
Neg logit contributions
chairs-0.067
 stations-0.065
 themselves-0.063
 ambul-0.063
adders-0.063
Token each
Token ID1123
Feature activation-7.07e-02

Pos logit contributions
ory+0.060
 smile+0.050
ka+0.049
hire+0.048
 cuc+0.047
Neg logit contributions
adders-0.086
chairs-0.085
 basement-0.076
 emergencies-0.074
ird-0.071
Token other
Token ID584
Feature activation+8.77e-02

Pos logit contributions
ory+0.130
hire+0.130
 cuc+0.123
ka+0.119
 pineapple+0.119
Neg logit contributions
chairs-0.090
adders-0.083
 emergencies-0.077
 stations-0.076
 basement-0.076
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
 emergencies+0.172
chairs+0.171
adders+0.169
 basement+0.161
 ambul+0.154
Neg logit contributions
ory-0.104
 smile-0.102
hire-0.094
by-0.092
 cuc-0.085
Token waited
Token ID13488
Feature activation-1.25e-03

Pos logit contributions
ory+0.015
 che+0.014
 cuc+0.014
 smile+0.014
 ate+0.013
Neg logit contributions
chairs-0.040
adders-0.039
 stations-0.038
 emergencies-0.037
 ambul-0.037
Token blanket
Token ID18447
Feature activation+2.02e-02

Pos logit contributions
 cuc+0.007
ory+0.007
 mineral+0.007
va+0.006
 ruby+0.006
Neg logit contributions
chairs-0.012
 ambul-0.012
 themselves-0.012
adders-0.012
 stations-0.012
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
 ambul+0.042
 emergencies+0.041
 stations+0.040
chairs+0.040
 Michael+0.039
Neg logit contributions
 taste-0.019
 mineral-0.019
ory-0.019
 smile-0.018
hire-0.017
Token Her
Token ID2332
Feature activation+1.95e-03

Pos logit contributions
 stations+0.018
adders+0.017
 ambul+0.017
 emergencies+0.016
chairs+0.016
Neg logit contributions
ory-0.013
lder-0.012
va-0.012
abby-0.012
 pineapple-0.012
Token mother
Token ID2802
Feature activation+8.78e-03

Pos logit contributions
chairs+0.003
 stations+0.003
 ambul+0.003
adders+0.003
 themselves+0.003
Neg logit contributions
ory-0.003
 mineral-0.003
 beetle-0.003
 cuc-0.003
va-0.003
Token had
Token ID550
Feature activation-3.22e-02

Pos logit contributions
 stations+0.017
 themselves+0.017
chairs+0.017
adders+0.017
 basement+0.017
Neg logit contributions
ory-0.010
va-0.009
 cuc-0.008
abby-0.008
ka-0.008
Token been
Token ID587
Feature activation-3.88e-02

Pos logit contributions
 cuc+0.045
ory+0.043
va+0.042
hire+0.042
 che+0.041
Neg logit contributions
chairs-0.052
 stations-0.050
 themselves-0.048
 Michael-0.047
adders-0.047
Token showing
Token ID4478
Feature activation-9.26e-03

Pos logit contributions
ory+0.062
 cuc+0.059
va+0.058
abby+0.058
ka+0.057
Neg logit contributions
chairs-0.057
 stations-0.055
 themselves-0.054
adders-0.053
 ambul-0.051
Token her
Token ID607
Feature activation-1.07e-02

Pos logit contributions
hire+0.011
ory+0.011
ka+0.010
 taste+0.010
by+0.010
Neg logit contributions
 stations-0.016
adders-0.016
 basement-0.016
chairs-0.016
 ambul-0.016
Token how
Token ID703
Feature activation-7.06e-02

Pos logit contributions
 mineral+0.013
 taste+0.013
 cuc+0.013
 jewel+0.013
hire+0.013
Neg logit contributions
chairs-0.018
 stations-0.017
 ambul-0.017
adders-0.017
 emergencies-0.017
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
ory+0.127
 cuc+0.119
hire+0.119
abby+0.119
va+0.118
Neg logit contributions
chairs-0.089
 stations-0.086
 themselves-0.084
 ambul-0.082
adders-0.082
Token do
Token ID466
Feature activation-6.95e-03

Pos logit contributions
ory+0.011
hire+0.010
 cuc+0.010
 mineral+0.010
va+0.009
Neg logit contributions
chairs-0.027
 ambul-0.025
 stations-0.025
adders-0.025
 Michael-0.024
Token just
Token ID655
Feature activation+1.56e-02

Pos logit contributions
 stations+0.050
chairs+0.048
adders+0.047
 themselves+0.046
 beds+0.043
Neg logit contributions
 che-0.037
 cuc-0.033
ory-0.033
va-0.032
ka-0.032
Token curious
Token ID11040
Feature activation-2.20e-02

Pos logit contributions
chairs+0.027
 stations+0.025
adders+0.024
 basement+0.023
 rails+0.023
Neg logit contributions
 cuc-0.021
ory-0.021
ka-0.020
 che-0.020
 mineral-0.020
Token and
Token ID290
Feature activation+3.62e-03

Pos logit contributions
ory+0.022
 smile+0.022
ly+0.020
 che+0.020
 taste+0.019
Neg logit contributions
 basement-0.041
adders-0.041
 stations-0.040
 themselves-0.039
chairs-0.038
Token excited
Token ID6568
Feature activation+3.09e-03

Pos logit contributions
 stations+0.007
chairs+0.007
adders+0.007
 themselves+0.007
 basement+0.006
Neg logit contributions
ory-0.004
 cuc-0.004
va-0.004
 mineral-0.004
 che-0.004
Token.
Token ID13
Feature activation-4.15e-03

Pos logit contributions
 stations+0.005
chairs+0.005
adders+0.005
 ambul+0.005
 basement+0.005
Neg logit contributions
ory-0.005
va-0.004
by-0.004
ka-0.004
hire-0.004
Token He
Token ID679
Feature activation-4.42e-02

Pos logit contributions
ory+0.004
va+0.004
 cuc+0.004
 mineral+0.003
 pineapple+0.003
Neg logit contributions
 stations-0.009
chairs-0.008
adders-0.008
 rails-0.008
 themselves-0.008
Token likes
Token ID7832
Feature activation-4.24e-03

Pos logit contributions
ory+0.051
hire+0.046
va+0.045
 cuc+0.045
abby+0.044
Neg logit contributions
 stations-0.083
 themselves-0.083
chairs-0.083
adders-0.081
 ambul-0.079
Token to
Token ID284
Feature activation-1.72e-02

Pos logit contributions
 mineral+0.004
 che+0.004
 cuc+0.004
 pineapple+0.004
ory+0.004
Neg logit contributions
adders-0.008
chairs-0.008
 basement-0.008
 stations-0.008
 themselves-0.007
Token zoom
Token ID19792
Feature activation+5.59e-02

Pos logit contributions
 taste+0.014
 lick+0.014
 smile+0.014
 mineral+0.013
ory+0.013
Neg logit contributions
chairs-0.038
adders-0.036
 stations-0.035
 ambul-0.034
 emergencies-0.034
Token around
Token ID1088
Feature activation+2.31e-02

Pos logit contributions
 emergencies+0.095
 ambul+0.092
 basement+0.091
adders+0.090
chairs+0.088
Neg logit contributions
ka-0.077
ory-0.070
hire-0.069
 mineral-0.063
by-0.060
Token on
Token ID319
Feature activation+3.52e-02

Pos logit contributions
adders+0.042
chairs+0.042
 fault+0.038
cases+0.038
liv+0.037
Neg logit contributions
 mineral-0.026
 cuc-0.025
 che-0.024
ory-0.022
 ruby-0.022
Token old
Token ID1468
Feature activation-4.37e-02

Pos logit contributions
chairs+0.004
 stations+0.004
 themselves+0.004
 ambul+0.004
adders+0.004
Neg logit contributions
ory-0.002
abby-0.002
 cuc-0.002
hire-0.002
va-0.002
Token turtle
Token ID28699
Feature activation-2.65e-02

Pos logit contributions
ory+0.048
abby+0.045
hire+0.044
 cuc+0.043
ka+0.043
Neg logit contributions
 themselves-0.080
chairs-0.078
 tracks-0.077
 stations-0.077
 Rex-0.077
Token said
Token ID531
Feature activation+8.86e-03

Pos logit contributions
ory+0.031
 cuc+0.029
hire+0.028
va+0.028
abby+0.027
Neg logit contributions
 stations-0.051
chairs-0.050
 emergencies-0.050
adders-0.049
 basement-0.049
Token with
Token ID351
Feature activation-1.43e-02

Pos logit contributions
chairs+0.016
 stations+0.015
 basement+0.014
 ambul+0.014
 rails+0.014
Neg logit contributions
ory-0.013
 cuc-0.012
hire-0.012
va-0.011
 token-0.011
Token a
Token ID257
Feature activation-6.15e-03

Pos logit contributions
ory+0.025
 cuc+0.025
hire+0.025
 mineral+0.024
va+0.023
Neg logit contributions
chairs-0.019
 stations-0.017
 ambul-0.017
adders-0.017
 themselves-0.016
Token smile
Token ID8212
Feature activation-1.30e-01

Pos logit contributions
ory+0.007
abby+0.006
hire+0.006
ka+0.006
 cuc+0.006
Neg logit contributions
chairs-0.011
 ambul-0.011
 tracks-0.011
 stations-0.011
 Michael-0.011
Token.
Token ID13
Feature activation+1.60e-03

Pos logit contributions
ory+0.223
hire+0.215
 cuc+0.210
ka+0.203
abby+0.202
Neg logit contributions
chairs-0.181
 stations-0.176
adders-0.168
 ambul-0.166
 basement-0.163
Token
Token ID198
Feature activation+9.59e-03

Pos logit contributions
 stations+0.003
chairs+0.003
 ambul+0.002
adders+0.002
 beds+0.002
Neg logit contributions
ory-0.002
 cuc-0.002
va-0.002
abby-0.002
hire-0.002
Token
Token ID198
Feature activation+1.00e-02

Pos logit contributions
chairs+0.014
 stations+0.014
 ambul+0.013
adders+0.013
 Michael+0.012
Neg logit contributions
ory-0.016
abby-0.015
va-0.015
 cuc-0.015
hire-0.015
TokenHarry
Token ID18308
Feature activation-3.74e-02

Pos logit contributions
 themselves+0.016
 stations+0.015
chairs+0.015
 ambul+0.015
adders+0.015
Neg logit contributions
ory-0.015
 cuc-0.013
ka-0.013
abby-0.013
hire-0.013
Token thanked
Token ID26280
Feature activation-5.16e-02

Pos logit contributions
ory+0.053
 cuc+0.049
hire+0.049
va+0.049
abby+0.049
Neg logit contributions
chairs-0.062
 stations-0.060
adders-0.059
 ambul-0.058
 themselves-0.058
Token."
Token ID526
Feature activation+4.42e-04

Pos logit contributions
chairs+0.042
 stations+0.041
 ambul+0.040
adders+0.040
 rails+0.040
Neg logit contributions
ory-0.021
va-0.018
hire-0.018
abby-0.017
ka-0.017
Token The
Token ID383
Feature activation+1.62e-02

Pos logit contributions
 stations+0.001
chairs+0.001
adders+0.001
 ambul+0.001
 emergencies+0.001
Neg logit contributions
ory-0.001
va-0.001
 cuc-0.001
 pineapple-0.001
lder-0.000
Token balloon
Token ID21190
Feature activation-1.25e-02

Pos logit contributions
chairs+0.025
 stations+0.025
 themselves+0.024
 ambul+0.024
adders+0.023
Neg logit contributions
ory-0.024
abby-0.022
 cuc-0.022
va-0.022
hire-0.022
Token seems
Token ID2331
Feature activation+2.62e-02

Pos logit contributions
ory+0.019
abby+0.017
hire+0.017
 cuc+0.017
va+0.017
Neg logit contributions
chairs-0.019
 stations-0.019
 themselves-0.019
adders-0.018
 ambul-0.018
Token to
Token ID284
Feature activation-6.16e-03

Pos logit contributions
chairs+0.048
 stations+0.046
 ambul+0.044
adders+0.043
 basement+0.043
Neg logit contributions
ory-0.036
 cuc-0.033
hire-0.033
abby-0.031
 mineral-0.031
Token smile
Token ID8212
Feature activation-1.32e-01

Pos logit contributions
ory+0.008
 cuc+0.008
abby+0.008
va+0.008
hire+0.008
Neg logit contributions
 themselves-0.010
chairs-0.010
 stations-0.010
 ambul-0.010
adders-0.010
Token back
Token ID736
Feature activation-1.03e-02

Pos logit contributions
ory+0.183
hire+0.167
ka+0.147
 cuc+0.143
by+0.143
Neg logit contributions
chairs-0.244
adders-0.242
 stations-0.235
 ambul-0.230
 emergencies-0.226
Token at
Token ID379
Feature activation+3.47e-02

Pos logit contributions
ory+0.014
hire+0.011
by+0.011
ka+0.011
abby+0.010
Neg logit contributions
 stations-0.021
chairs-0.020
 ambul-0.019
 emergencies-0.019
adders-0.019
Token her
Token ID607
Feature activation-7.47e-03

Pos logit contributions
chairs+0.059
adders+0.055
 ambul+0.053
 rails+0.051
 stations+0.050
Neg logit contributions
ory-0.053
va-0.048
 ruby-0.047
abby-0.047
hire-0.047
Token.
Token ID13
Feature activation+2.16e-04

Pos logit contributions
ory+0.011
 smile+0.010
hire+0.009
 ruby+0.009
 mineral+0.009
Neg logit contributions
chairs-0.014
adders-0.013
 stations-0.013
 ambul-0.013
 emergencies-0.012
Token Anna
Token ID11735
Feature activation+1.88e-02

Pos logit contributions
 stations+0.000
 ambul+0.000
chairs+0.000
adders+0.000
 rails+0.000
Neg logit contributions
ory-0.000
va-0.000
 cuc-0.000
 pineapple-0.000
hire-0.000
Token and
Token ID290
Feature activation-2.16e-03

Pos logit contributions
ory+0.029
va+0.027
 cuc+0.025
 che+0.025
 taste+0.024
Neg logit contributions
adders-0.039
 stations-0.039
chairs-0.037
 ambul-0.035
 basement-0.035
Token she
Token ID673
Feature activation+4.36e-03

Pos logit contributions
ory+0.003
 cuc+0.003
va+0.003
hire+0.003
abby+0.003
Neg logit contributions
chairs-0.004
 stations-0.004
adders-0.003
 ambul-0.003
 themselves-0.003
Token smiled
Token ID13541
Feature activation-5.79e-02

Pos logit contributions
chairs+0.007
 themselves+0.007
 Michael+0.007
 stations+0.007
 Rex+0.007
Neg logit contributions
ory-0.006
hire-0.006
abby-0.006
 cuc-0.006
va-0.005
Token a
Token ID257
Feature activation-2.55e-02

Pos logit contributions
ory+0.117
abby+0.111
 cuc+0.111
hire+0.110
va+0.110
Neg logit contributions
chairs-0.059
 stations-0.057
 themselves-0.056
 ambul-0.054
adders-0.054
Token big
Token ID1263
Feature activation-2.80e-02

Pos logit contributions
rust+0.051
ory+0.050
roc+0.050
abby+0.049
ka+0.047
Neg logit contributions
 until-0.023
 no-0.021
 more-0.019
 so-0.019
 Little-0.019
Token smile
Token ID8212
Feature activation-1.52e-01

Pos logit contributions
abby+0.045
ory+0.041
hire+0.039
va+0.038
uc+0.038
Neg logit contributions
 until-0.038
 911-0.037
 horror-0.036
 shelter-0.036
 hospital-0.036
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
ory+0.286
hire+0.271
 cuc+0.270
abby+0.267
va+0.266
Neg logit contributions
chairs-0.181
 stations-0.175
 themselves-0.168
adders-0.167
 ambul-0.167
Token
Token ID198
Feature activation-6.81e-03

Pos logit contributions
ory+0.023
va+0.022
abby+0.022
 cuc+0.022
hire+0.021
Neg logit contributions
chairs-0.021
 stations-0.021
 ambul-0.020
adders-0.020
 emergencies-0.019
Token
Token ID198
Feature activation-6.69e-03

Pos logit contributions
ory+0.012
abby+0.011
va+0.011
hire+0.011
 cuc+0.011
Neg logit contributions
chairs-0.009
 stations-0.008
 ambul-0.008
adders-0.008
 themselves-0.008
TokenWhen
Token ID2215
Feature activation+2.81e-02

Pos logit contributions
ory+0.012
 cuc+0.011
hire+0.011
ka+0.011
abby+0.011
Neg logit contributions
chairs-0.008
 themselves-0.008
 stations-0.008
 ambul-0.008
adders-0.008
Token they
Token ID484
Feature activation+2.74e-02

Pos logit contributions
chairs+0.027
 stations+0.026
adders+0.024
 ambul+0.023
 tracks+0.022
Neg logit contributions
ory-0.061
va-0.058
 cuc-0.057
abby-0.056
hire-0.056
Token beautiful
Token ID4950
Feature activation-2.50e-02

Pos logit contributions
ory+0.034
abby+0.031
hire+0.031
 cuc+0.030
va+0.030
Neg logit contributions
 tracks-0.033
chairs-0.033
 Rex-0.032
 Michael-0.031
 stations-0.031
Token hat
Token ID6877
Feature activation-5.98e-03

Pos logit contributions
ory+0.036
hire+0.033
abby+0.033
 cuc+0.032
va+0.032
Neg logit contributions
chairs-0.040
 tracks-0.038
 stations-0.038
 ambul-0.037
 themselves-0.037
Token and
Token ID290
Feature activation-1.14e-03

Pos logit contributions
ory+0.007
hire+0.007
 che+0.007
 smile+0.007
 mineral+0.007
Neg logit contributions
 stations-0.011
 emergencies-0.011
adders-0.011
 ambul-0.011
 themselves-0.011
Token a
Token ID257
Feature activation-2.38e-02

Pos logit contributions
 cuc+0.002
 mineral+0.002
 che+0.002
 ruby+0.001
va+0.001
Neg logit contributions
adders-0.002
 stations-0.002
chairs-0.002
 basement-0.002
 ambul-0.002
Token bright
Token ID6016
Feature activation-3.00e-02

Pos logit contributions
ory+0.030
 cuc+0.027
hire+0.027
abby+0.027
va+0.027
Neg logit contributions
chairs-0.042
 stations-0.041
 themselves-0.041
 ambul-0.040
adders-0.040
Token smile
Token ID8212
Feature activation-1.49e-01

Pos logit contributions
ory+0.043
abby+0.039
hire+0.039
va+0.038
 cuc+0.038
Neg logit contributions
 tracks-0.047
chairs-0.047
 stations-0.045
 Michael-0.044
 themselves-0.044
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
ory+0.234
hire+0.228
ka+0.217
 cuc+0.216
va+0.210
Neg logit contributions
chairs-0.222
 stations-0.222
adders-0.221
 ambul-0.213
 themselves-0.212
Token
Token ID220
Feature activation-5.82e-02

Pos logit contributions
ory+0.016
va+0.016
 cuc+0.015
hire+0.014
abby+0.014
Neg logit contributions
 stations-0.023
 ambul-0.022
adders-0.022
chairs-0.022
 emergencies-0.022
Token
Token ID198
Feature activation-7.31e-04

Pos logit contributions
abby+0.059
ory+0.058
va+0.055
hire+0.051
 cuc+0.048
Neg logit contributions
 stations-0.118
 ambul-0.113
chairs-0.107
 rails-0.106
 radios-0.105
Token
Token ID198
Feature activation-7.08e-04

Pos logit contributions
ory+0.001
va+0.001
abby+0.001
hire+0.001
 cuc+0.001
Neg logit contributions
chairs-0.001
 stations-0.001
 ambul-0.001
adders-0.001
 emergencies-0.001
TokenThe
Token ID464
Feature activation+6.73e-03

Pos logit contributions
ory+0.001
 cuc+0.001
abby+0.001
hire+0.001
va+0.001
Neg logit contributions
chairs-0.001
 stations-0.001
 themselves-0.001
 ambul-0.001
adders-0.001
Token her
Token ID607
Feature activation+7.90e-04

Pos logit contributions
adders+0.067
chairs+0.066
 ambul+0.065
 emergencies+0.062
 stations+0.062
Neg logit contributions
ory-0.054
 mineral-0.050
hire-0.048
ka-0.048
 ruby-0.045
Token mom
Token ID1995
Feature activation+2.66e-02

Pos logit contributions
 stations+0.001
adders+0.001
chairs+0.001
 themselves+0.001
 ambul+0.001
Neg logit contributions
ory-0.001
 mineral-0.001
 cuc-0.001
ka-0.001
va-0.001
Token with
Token ID351
Feature activation-8.75e-03

Pos logit contributions
adders+0.045
 stations+0.043
chairs+0.041
 ambul+0.041
 basement+0.041
Neg logit contributions
ory-0.039
 smile-0.035
 cuc-0.034
hire-0.033
abby-0.033
Token a
Token ID257
Feature activation+2.25e-03

Pos logit contributions
ory+0.016
 cuc+0.015
hire+0.015
abby+0.015
va+0.015
Neg logit contributions
chairs-0.011
 stations-0.011
 themselves-0.010
 ambul-0.010
adders-0.010
Token big
Token ID1263
Feature activation+4.84e-04

Pos logit contributions
chairs+0.004
 tracks+0.004
 ambul+0.004
 Michael+0.004
 stations+0.004
Neg logit contributions
ory-0.003
hire-0.003
abby-0.003
va-0.002
 cuc-0.002
Token smile
Token ID8212
Feature activation-1.23e-01

Pos logit contributions
chairs+0.001
 Michael+0.001
 tracks+0.001
 ambul+0.001
 Rex+0.001
Neg logit contributions
ory-0.001
abby-0.000
hire-0.000
va-0.000
 cuc-0.000
Token and
Token ID290
Feature activation+2.00e-02

Pos logit contributions
ory+0.185
hire+0.180
 cuc+0.169
ka+0.168
abby+0.163
Neg logit contributions
 stations-0.196
chairs-0.194
adders-0.191
 ambul-0.186
 emergencies-0.181
Token said
Token ID531
Feature activation+1.78e-02

Pos logit contributions
 stations+0.043
chairs+0.042
adders+0.041
 ambul+0.041
 emergencies+0.040
Neg logit contributions
 cuc-0.020
ory-0.020
va-0.019
 che-0.018
hire-0.017
Token,
Token ID11
Feature activation+9.40e-03

Pos logit contributions
chairs+0.025
 stations+0.024
 ambul+0.022
adders+0.022
 themselves+0.022
Neg logit contributions
ory-0.031
va-0.029
 cuc-0.029
hire-0.028
abby-0.028
Token "
Token ID366
Feature activation+1.33e-02

Pos logit contributions
chairs+0.014
 stations+0.014
adders+0.013
 ambul+0.013
 themselves+0.013
Neg logit contributions
ory-0.015
 cuc-0.014
va-0.014
hire-0.013
abby-0.013
TokenMom
Token ID29252
Feature activation-1.39e-02

Pos logit contributions
chairs+0.022
 stations+0.021
 themselves+0.021
 ambul+0.020
adders+0.020
Neg logit contributions
ory-0.019
 cuc-0.018
hire-0.018
abby-0.018
va-0.017