TOP ACTIVATIONS
MAX = 0.226

not notice that the all igator toy was not a toy
day , a quiet all igator went for a walk .
. They realized the all igator was real . They were
fun , making the all igator and the bird fight .
It was a real all igator that had escaped from the
Suddenly , the all igator snapped its jaws and bit
! The all igator said " This key is
closer and opened the all igator 's mouth . Inside she
. They left the all igator pond and went to see
up . The quiet all igator did not understand and walked
the bird , the all igator came out of the pond
we take the cute all igator home with us ?"
fun playing with the all igator . They were happy .
a big , scary all igator who lived in the wild
. The all igator was so happy to have
was lucky . The all igator did not catch him .
?" The all igator smiled and said " Open
The quiet all igator did not know what a
!" Maria and the all igator set out on their journey
stayed away from the all igator . But then , he

MIN ACTIVATIONS
MIN = -0.327

a girl who loved sun sets . Every day she would
The people , the sun sets and the b owing all
adm iring the beautiful sun sets from afar . <|endoftext|> Once
big mountains and beautiful sun sets . He felt happy and
could look at the sun sets pieces every day and remember
big mountains and beautiful sun sets . But one day ,
. She watched the sun sets and fireworks for hours .
bear . He loved sun sets and he wanted to take
bakery to buy the y umm iest pastry he could find
made him an even y umm ier snack and they both
ice cream taste even y umm ier ! And every time
to make food look y umm ier than before . She
little boy had the y umm iest treat . He asked
it and found some y umm my food inside . Leo
and it was the y umm iest ice cream she ever
little cracks and cre v ices of the rocks and trees
. They had stopped qu arre ling . To celebrate ,
. It was the y umm iest thing he ever tasted
and it was the y umm iest dinner they ever had
, picking out the y umm iest treats and sharing them

INTERVAL 0.088 - 0.226
CONTAINS 5.296%

Another truck came . It did not see crab . It
store . She says he can bring a small toy ,
her wounds . She scared away the bees with a loud
Every night the old man would laugh while he was alone
It was a very special moment for them both

INTERVAL -0.050 - 0.088
CONTAINS 80.098%

knocked on the door . Jane said " come in !".
and saw a blue dress . She pulled it and found
him that it would be fun . As they were getting
, and they all tried their best . Whenever Sally missed
relieved that she gave a big smile . The

INTERVAL -0.189 - -0.050
CONTAINS 14.299%

â € ™ t lie ! â € she sc olded
time , in a small village , there lived a thin
" Sorry , Ben . No more . Come on ,
It was a magical rainbow ! She couldn 't believe it
Suddenly , Tim my 's mom came into the room

INTERVAL -0.327 - -0.189
CONTAINS 0.307%

full from all the y ummy snacks ! <|endoftext|> One day
The families decided to put some of the texture in an
One day , Tim spilled some juice on his shirt and
is so pretty and y ummy ." Mom said
's office , Dr . Smith told Lucy that Hop was
Token not
Token ID407
Feature activation-1.13e-02

Pos logit contributions
 can+0.295
anda+0.285
ig+0.279
ite+0.275
 slips+0.264
Neg logit contributions
 restaurants-0.239
sets-0.235
 Son-0.225
 veggies-0.205
 foods-0.202
Token notice
Token ID4003
Feature activation-1.31e-01

Pos logit contributions
 restaurants+0.017
sets+0.017
 Son+0.017
kids+0.015
 veggies+0.015
Neg logit contributions
 can-0.019
anda-0.018
ig-0.018
 slips-0.017
ite-0.017
Token that
Token ID326
Feature activation-3.62e-02

Pos logit contributions
sets+0.217
 restaurants+0.216
 Son+0.215
 Father+0.192
 vegetables+0.190
Neg logit contributions
 can-0.201
ig-0.200
anda-0.196
 slips-0.184
ite-0.183
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
sets+0.054
 restaurants+0.054
 Son+0.052
 veggies+0.046
 Father+0.046
Neg logit contributions
 can-0.063
anda-0.063
ig-0.062
 slips-0.058
ite-0.057
Token all
Token ID477
Feature activation-9.45e-02

Pos logit contributions
 can+0.018
ig+0.018
 slips+0.017
anda+0.017
 does+0.017
Neg logit contributions
sets-0.015
 restaurants-0.015
 Son-0.014
 veggies-0.013
 vegetables-0.013
Tokenigator
Token ID23823
Feature activation+2.26e-01

Pos logit contributions
 restaurants+0.173
sets+0.167
 Son+0.157
 vegetables+0.151
 everybody+0.147
Neg logit contributions
ig-0.127
anda-0.121
 can-0.117
uddle-0.116
 slips-0.115
Token toy
Token ID13373
Feature activation+6.72e-03

Pos logit contributions
arl+0.361
ig+0.340
anda+0.340
ite+0.336
br+0.331
Neg logit contributions
 restaurants-0.324
sets-0.318
 Son-0.300
 vegetables-0.298
 foods-0.296
Token was
Token ID373
Feature activation+2.68e-02

Pos logit contributions
anda+0.009
ig+0.009
 can+0.009
arl+0.009
ite+0.009
Neg logit contributions
sets-0.011
 restaurants-0.010
 vegetables-0.010
 foods-0.010
 veggies-0.010
Token not
Token ID407
Feature activation-1.13e-02

Pos logit contributions
ite+0.046
ig+0.045
anda+0.045
 can+0.045
ars+0.041
Neg logit contributions
 restaurants-0.040
sets-0.038
 Son-0.038
 Thanksgiving-0.037
 vegetables-0.034
Token a
Token ID257
Feature activation+1.03e-02

Pos logit contributions
 restaurants+0.018
sets+0.017
 Son+0.017
 veggies+0.016
 vegetables+0.016
Neg logit contributions
ite-0.019
anda-0.019
 can-0.018
ig-0.018
 slips-0.017
Token toy
Token ID13373
Feature activation+6.82e-03

Pos logit contributions
ig+0.018
ows+0.018
anda+0.018
 slips+0.018
ite+0.017
Neg logit contributions
 restaurants-0.014
 Son-0.013
sets-0.013
 Thanksgiving-0.013
 vegetables-0.013
Token day
Token ID1110
Feature activation+4.15e-02

Pos logit contributions
 restaurants+0.100
 Son+0.093
sets+0.089
 Thanksgiving+0.088
kids+0.082
Neg logit contributions
anda-0.146
 can-0.144
 dod-0.139
ig-0.139
 slips-0.136
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
 can+0.042
anda+0.042
ig+0.038
ite+0.037
 slips+0.035
Neg logit contributions
 restaurants-0.088
 Son-0.088
sets-0.086
 Thanksgiving-0.081
 Father-0.081
Token a
Token ID257
Feature activation+7.09e-03

Pos logit contributions
anda+0.033
ig+0.033
ite+0.029
br+0.029
uddle+0.029
Neg logit contributions
 Son-0.016
 Frankie-0.016
 Thanksgiving-0.016
 Joey-0.015
 Kevin-0.014
Token quiet
Token ID5897
Feature activation+7.70e-02

Pos logit contributions
anda+0.014
ig+0.014
 slips+0.014
ows+0.014
ocus+0.013
Neg logit contributions
 Son-0.008
 Thanksgiving-0.007
 restaurants-0.007
 Father-0.007
 couple-0.007
Token all
Token ID477
Feature activation-9.50e-02

Pos logit contributions
anda+0.136
 can+0.135
 else+0.133
 slips+0.131
ows+0.130
Neg logit contributions
 restaurants-0.105
 Son-0.104
sets-0.097
 Thanksgiving-0.097
 couple-0.096
Tokenigator
Token ID23823
Feature activation+2.26e-01

Pos logit contributions
 restaurants+0.133
sets+0.131
 Son+0.121
 Thanksgiving+0.112
 vegetables+0.111
Neg logit contributions
anda-0.170
ig-0.161
 can-0.160
 dod-0.155
 slips-0.154
Token went
Token ID1816
Feature activation+2.80e-02

Pos logit contributions
arl+0.344
ows+0.342
anda+0.339
ixie+0.325
as+0.312
Neg logit contributions
 restaurants-0.342
sets-0.315
 foods-0.296
 vegetables-0.291
 veggies-0.287
Token for
Token ID329
Feature activation-3.96e-03

Pos logit contributions
anda+0.041
 can+0.039
ite+0.037
ig+0.037
 else+0.036
Neg logit contributions
sets-0.050
 restaurants-0.049
 vegetables-0.042
 Thanksgiving-0.042
 Son-0.042
Token a
Token ID257
Feature activation+7.38e-03

Pos logit contributions
sets+0.007
 restaurants+0.007
 Son+0.007
kids+0.006
 veggies+0.006
Neg logit contributions
 can-0.006
anda-0.005
ig-0.005
 slips-0.005
 does-0.005
Token walk
Token ID2513
Feature activation-1.79e-02

Pos logit contributions
 can+0.013
 slips+0.013
anda+0.013
 does+0.013
ig+0.012
Neg logit contributions
 restaurants-0.010
 Thanksgiving-0.009
 Son-0.009
sets-0.009
 couple-0.009
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
sets+0.034
 restaurants+0.034
 Thanksgiving+0.031
 Son+0.031
ices+0.030
Neg logit contributions
anda-0.023
 can-0.023
ig-0.021
 slips-0.021
 dod-0.019
Token.
Token ID13
Feature activation+2.28e-02

Pos logit contributions
 can+0.020
anda+0.019
ite+0.018
 slips+0.018
ig+0.018
Neg logit contributions
sets-0.026
 Son-0.025
 restaurants-0.025
urg-0.024
 decisions-0.023
Token They
Token ID1119
Feature activation-3.82e-02

Pos logit contributions
 can+0.044
anda+0.041
ig+0.041
 slips+0.040
ite+0.039
Neg logit contributions
sets-0.030
 Son-0.030
 restaurants-0.029
 Daughter-0.025
 Father-0.025
Token realized
Token ID6939
Feature activation-9.72e-02

Pos logit contributions
 Son+0.061
 restaurants+0.058
sets+0.055
 Father+0.053
 veggies+0.053
Neg logit contributions
ig-0.060
ite-0.058
anda-0.057
br-0.052
 can-0.052
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
sets+0.167
 restaurants+0.164
 Son+0.162
 veggies+0.146
kids+0.145
Neg logit contributions
 can-0.150
anda-0.140
ig-0.137
 slips-0.133
ite-0.130
Token all
Token ID477
Feature activation-9.20e-02

Pos logit contributions
 can+0.016
anda+0.014
 high+0.014
 does+0.013
ig+0.013
Neg logit contributions
 Son-0.018
sets-0.018
 restaurants-0.018
urg-0.017
kids-0.017
Tokenigator
Token ID23823
Feature activation+2.26e-01

Pos logit contributions
 restaurants+0.175
sets+0.171
 vegetables+0.162
 Son+0.161
 veggies+0.158
Neg logit contributions
ig-0.120
anda-0.116
ider-0.110
 slips-0.107
 can-0.106
Token was
Token ID373
Feature activation+2.80e-02

Pos logit contributions
ig+0.310
anda+0.308
ite+0.301
br+0.293
 can+0.285
Neg logit contributions
sets-0.369
 restaurants-0.362
 Son-0.355
 vegetables-0.346
 foods-0.345
Token real
Token ID1103
Feature activation-2.68e-02

Pos logit contributions
ig+0.046
ite+0.046
 can+0.046
anda+0.045
 slips+0.043
Neg logit contributions
 Son-0.043
 restaurants-0.043
sets-0.042
 Thanksgiving-0.039
 veggies-0.039
Token.
Token ID13
Feature activation+2.27e-02

Pos logit contributions
sets+0.047
 restaurants+0.047
 foods+0.046
 vegetables+0.045
 veggies+0.044
Neg logit contributions
ig-0.035
anda-0.035
 slips-0.034
ite-0.033
 can-0.033
Token They
Token ID1119
Feature activation-3.85e-02

Pos logit contributions
 can+0.042
anda+0.039
ig+0.038
 slips+0.037
ite+0.036
Neg logit contributions
sets-0.032
 restaurants-0.032
 Son-0.031
kids-0.028
 veggies-0.028
Token were
Token ID547
Feature activation-2.70e-02

Pos logit contributions
 Son+0.056
 restaurants+0.052
 veggies+0.051
 vegetables+0.051
 Father+0.050
Neg logit contributions
ig-0.066
anda-0.061
ite-0.061
br-0.057
ider-0.055
Token fun
Token ID1257
Feature activation+8.50e-02

Pos logit contributions
anda+0.007
ig+0.007
ite+0.007
 can+0.007
 slips+0.006
Neg logit contributions
 restaurants-0.008
 foods-0.007
 vegetables-0.007
 veggies-0.007
kids-0.007
Token,
Token ID11
Feature activation+1.90e-02

Pos logit contributions
 can+0.107
anda+0.098
ig+0.090
 slips+0.090
ite+0.087
Neg logit contributions
 restaurants-0.172
sets-0.172
 Son-0.161
 veggies-0.155
 foods-0.154
Token making
Token ID1642
Feature activation-9.09e-02

Pos logit contributions
 can+0.029
anda+0.027
ig+0.026
 slips+0.025
ite+0.025
Neg logit contributions
sets-0.032
 restaurants-0.032
 Son-0.032
kids-0.029
 veggies-0.029
Token the
Token ID262
Feature activation+8.46e-03

Pos logit contributions
sets+0.144
 restaurants+0.143
 Son+0.140
kids+0.128
 veggies+0.126
Neg logit contributions
 can-0.152
anda-0.139
ig-0.136
 slips-0.133
 does-0.131
Token all
Token ID477
Feature activation-9.59e-02

Pos logit contributions
 can+0.013
ig+0.013
 slips+0.013
anda+0.012
ite+0.012
Neg logit contributions
 restaurants-0.014
sets-0.014
 Son-0.013
 veggies-0.012
 vegetables-0.012
Tokenigator
Token ID23823
Feature activation+2.26e-01

Pos logit contributions
 restaurants+0.185
sets+0.183
 Son+0.175
 Thanksgiving+0.164
 veggies+0.164
Neg logit contributions
 can-0.124
ig-0.116
anda-0.115
 slips-0.111
ite-0.103
Token and
Token ID290
Feature activation-7.41e-03

Pos logit contributions
 can+0.329
anda+0.293
ig+0.285
 slips+0.280
 does+0.275
Neg logit contributions
sets-0.409
 Son-0.405
 restaurants-0.404
kids-0.376
 Daughter-0.364
Token the
Token ID262
Feature activation+7.85e-03

Pos logit contributions
 restaurants+0.011
sets+0.011
 Son+0.010
 vegetables+0.010
 veggies+0.010
Neg logit contributions
ig-0.013
anda-0.013
 can-0.012
ite-0.012
br-0.012
Token bird
Token ID6512
Feature activation+2.93e-02

Pos logit contributions
ig+0.015
 can+0.015
anda+0.014
 slips+0.014
ite+0.014
Neg logit contributions
 restaurants-0.010
sets-0.010
 Son-0.009
 vegetables-0.009
 veggies-0.009
Token fight
Token ID1907
Feature activation-3.53e-02

Pos logit contributions
 can+0.039
 slips+0.036
anda+0.036
ig+0.035
ite+0.035
Neg logit contributions
sets-0.053
 restaurants-0.049
 veggies-0.046
 Son-0.046
 vegetables-0.045
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
sets+0.051
 decisions+0.049
 reminders+0.047
 restaurants+0.046
 foods+0.046
Neg logit contributions
ig-0.061
anda-0.060
ite-0.057
itty-0.052
ang-0.050
Token It
Token ID632
Feature activation+2.19e-02

Pos logit contributions
 can+0.040
anda+0.039
ig+0.039
 slips+0.036
ite+0.036
Neg logit contributions
 Son-0.031
 restaurants-0.031
sets-0.030
 Father-0.027
 Daughter-0.027
Token was
Token ID373
Feature activation+2.64e-02

Pos logit contributions
ite+0.041
ig+0.040
arl+0.039
ocus+0.036
oms+0.036
Neg logit contributions
 Son-0.025
 youngest-0.024
 foods-0.024
 vegetables-0.024
sets-0.023
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
ite+0.050
ig+0.048
anda+0.046
 can+0.045
uddle+0.042
Neg logit contributions
 Son-0.038
 Thanksgiving-0.036
 restaurants-0.036
sets-0.034
 Father-0.034
Token real
Token ID1103
Feature activation-2.87e-02

Pos logit contributions
ows+0.019
ig+0.017
anda+0.017
ite+0.016
ocus+0.016
Neg logit contributions
 Son-0.015
 restaurants-0.014
 Thanksgiving-0.014
 couple-0.014
 meal-0.014
Token all
Token ID477
Feature activation-9.45e-02

Pos logit contributions
sets+0.043
 restaurants+0.042
 Son+0.041
 veggies+0.038
kids+0.037
Neg logit contributions
 can-0.049
anda-0.047
 slips-0.046
ig-0.046
 does-0.044
Tokenigator
Token ID23823
Feature activation+2.26e-01

Pos logit contributions
 restaurants+0.159
sets+0.149
 Son+0.145
 Thanksgiving+0.141
 vegetables+0.141
Neg logit contributions
ig-0.148
anda-0.140
uddle-0.139
 slips-0.132
uffy-0.131
Token that
Token ID326
Feature activation-3.64e-02

Pos logit contributions
arl+0.327
anda+0.316
ig+0.312
ixie+0.301
br+0.295
Neg logit contributions
 restaurants-0.353
sets-0.333
 farmers-0.327
 vegetables-0.325
 foods-0.318
Token had
Token ID550
Feature activation+6.76e-02

Pos logit contributions
 restaurants+0.061
sets+0.059
 Son+0.058
 vegetables+0.056
 veggies+0.054
Neg logit contributions
ig-0.056
anda-0.053
ider-0.048
ite-0.048
arl-0.047
Token escaped
Token ID13537
Feature activation+9.34e-02

Pos logit contributions
ig+0.115
ite+0.114
anda+0.113
 can+0.107
uddle+0.101
Neg logit contributions
 restaurants-0.100
 Son-0.092
sets-0.092
 vegetables-0.091
 foods-0.089
Token from
Token ID422
Feature activation-3.02e-02

Pos logit contributions
anda+0.156
ig+0.153
ite+0.147
 can+0.141
 slips+0.139
Neg logit contributions
 restaurants-0.147
sets-0.145
 Son-0.140
 decisions-0.128
 foods-0.128
Token the
Token ID262
Feature activation+8.97e-03

Pos logit contributions
 restaurants+0.044
 Son+0.040
sets+0.039
 Thanksgiving+0.037
 avail+0.037
Neg logit contributions
anda-0.056
 can-0.055
ig-0.053
ite-0.053
br-0.050
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
 can+0.023
ig+0.021
anda+0.021
 slips+0.021
ite+0.020
Neg logit contributions
 restaurants-0.024
 Son-0.024
sets-0.023
kids-0.022
Father-0.022
TokenSuddenly
Token ID38582
Feature activation-6.25e-02

Pos logit contributions
 can+0.023
anda+0.022
ig+0.022
 slips+0.021
hy+0.020
Neg logit contributions
 Son-0.022
Father-0.022
 restaurants-0.021
Next-0.020
sets-0.019
Token,
Token ID11
Feature activation+2.14e-02

Pos logit contributions
 restaurants+0.109
 Son+0.105
sets+0.104
 Kevin+0.102
 Father+0.101
Neg logit contributions
anda-0.089
ite-0.077
hy-0.077
itty-0.076
ang-0.075
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
anda+0.040
ig+0.037
 can+0.037
ite+0.035
 slips+0.034
Neg logit contributions
 restaurants-0.031
sets-0.030
 Son-0.030
 Father-0.027
 veggies-0.026
Token all
Token ID477
Feature activation-9.26e-02

Pos logit contributions
 can+0.018
anda+0.016
ig+0.015
 high+0.015
 slips+0.015
Neg logit contributions
sets-0.016
 restaurants-0.016
 Son-0.016
kids-0.014
urg-0.014
Tokenigator
Token ID23823
Feature activation+2.26e-01

Pos logit contributions
 restaurants+0.180
sets+0.179
 Son+0.174
 vegetables+0.161
 Frankie+0.161
Neg logit contributions
 can-0.107
ig-0.106
anda-0.104
 slips-0.099
ider-0.096
Token snapped
Token ID20821
Feature activation+3.16e-02

Pos logit contributions
anda+0.309
ig+0.303
 can+0.300
ite+0.292
ows+0.270
Neg logit contributions
 restaurants-0.376
sets-0.372
 Son-0.364
 vegetables-0.343
 veggies-0.342
Token its
Token ID663
Feature activation+2.98e-03

Pos logit contributions
anda+0.048
 can+0.045
ite+0.045
ig+0.045
itty+0.044
Neg logit contributions
 Son-0.051
sets-0.050
urg-0.047
 vegetables-0.046
 Father-0.045
Token jaws
Token ID39940
Feature activation-6.21e-02

Pos logit contributions
 can+0.005
ig+0.005
 slips+0.005
ite+0.005
anda+0.005
Neg logit contributions
sets-0.004
 restaurants-0.004
 Son-0.004
 veggies-0.004
 vegetables-0.004
Token and
Token ID290
Feature activation-5.14e-03

Pos logit contributions
 Son+0.084
sets+0.083
 vegetables+0.078
 onions+0.076
 grains+0.076
Neg logit contributions
anda-0.110
ig-0.101
ite-0.097
uffy-0.092
itty-0.089
Token bit
Token ID1643
Feature activation+2.58e-02

Pos logit contributions
 restaurants+0.009
 Son+0.009
sets+0.009
 vegetables+0.008
 veggies+0.008
Neg logit contributions
anda-0.007
ig-0.007
ite-0.007
 can-0.007
uddle-0.006
Token!
Token ID0
Feature activation-6.00e-02

Pos logit contributions
 dod+0.132
anda+0.132
ig+0.127
 slips+0.126
 can+0.123
Neg logit contributions
sets-0.161
 Son-0.146
 Frankie-0.141
 restaurants-0.141
 asking-0.135
Token
Token ID198
Feature activation+7.06e-03

Pos logit contributions
sets+0.100
 restaurants+0.099
 Son+0.097
kids+0.089
 veggies+0.088
Neg logit contributions
 can-0.096
anda-0.087
ig-0.085
 slips-0.084
 does-0.082
Token
Token ID198
Feature activation+7.10e-03

Pos logit contributions
anda+0.012
ig+0.012
 can+0.012
 slips+0.012
ite+0.011
Neg logit contributions
 restaurants-0.010
sets-0.010
 Son-0.010
Father-0.009
kids-0.009
TokenThe
Token ID464
Feature activation-3.51e-02

Pos logit contributions
 can+0.011
anda+0.011
ig+0.011
 slips+0.010
 does+0.010
Neg logit contributions
sets-0.012
 restaurants-0.011
 Son-0.011
kids-0.010
 veggies-0.010
Token all
Token ID477
Feature activation-9.61e-02

Pos logit contributions
sets+0.055
 restaurants+0.055
 Son+0.055
urg+0.053
kids+0.051
Neg logit contributions
 can-0.061
anda-0.053
 high-0.052
 else-0.050
 does-0.050
Tokenigator
Token ID23823
Feature activation+2.25e-01

Pos logit contributions
sets+0.223
 restaurants+0.221
 Son+0.219
 veggies+0.204
kids+0.204
Neg logit contributions
 can-0.089
anda-0.077
ig-0.075
 slips-0.073
 does-0.067
Token said
Token ID531
Feature activation-1.65e-02

Pos logit contributions
 can+0.315
anda+0.296
ig+0.290
 slips+0.281
ite+0.280
Neg logit contributions
sets-0.395
 Son-0.388
 restaurants-0.383
 veggies-0.361
kids-0.354
Token "
Token ID366
Feature activation+1.53e-02

Pos logit contributions
sets+0.019
 Son+0.018
 Father+0.017
 Thanksgiving+0.017
 Sonia+0.017
Neg logit contributions
anda-0.033
ig-0.033
ite-0.032
 slips-0.032
ang-0.030
TokenThis
Token ID1212
Feature activation-1.07e-01

Pos logit contributions
 can+0.025
ite+0.022
anda+0.021
ig+0.021
 else+0.021
Neg logit contributions
 restaurants-0.026
sets-0.025
 Son-0.024
urg-0.023
kids-0.023
Token key
Token ID1994
Feature activation+9.66e-02

Pos logit contributions
sets+0.251
 restaurants+0.251
 Son+0.246
 veggies+0.231
kids+0.231
Neg logit contributions
 can-0.092
anda-0.083
ig-0.079
 slips-0.075
ite-0.071
Token is
Token ID318
Feature activation+7.50e-02

Pos logit contributions
ig+0.081
 dod+0.079
anda+0.076
 high+0.076
 slips+0.075
Neg logit contributions
sets-0.206
 Son-0.193
 restaurants-0.190
 decisions-0.185
 vegetables-0.183
Token closer
Token ID5699
Feature activation+7.31e-02

Pos logit contributions
anda+0.013
 can+0.012
 slips+0.012
 does+0.012
 else+0.011
Neg logit contributions
 Son-0.015
sets-0.015
 restaurants-0.014
 Daughter-0.014
 vegetables-0.014
Token and
Token ID290
Feature activation-1.44e-02

Pos logit contributions
 can+0.099
anda+0.096
ig+0.089
 slips+0.089
ite+0.089
Neg logit contributions
sets-0.136
 restaurants-0.132
 Son-0.131
 veggies-0.120
 vegetables-0.120
Token opened
Token ID4721
Feature activation+7.72e-02

Pos logit contributions
 Son+0.026
sets+0.026
 restaurants+0.025
 Father+0.024
 Kevin+0.024
Neg logit contributions
anda-0.019
 can-0.019
ig-0.019
ite-0.018
 slips-0.017
Token the
Token ID262
Feature activation+2.16e-03

Pos logit contributions
 can+0.106
 high+0.097
ig+0.091
anda+0.088
 does+0.085
Neg logit contributions
 restaurants-0.146
sets-0.146
kids-0.141
 Son-0.140
urg-0.132
Token all
Token ID477
Feature activation-9.64e-02

Pos logit contributions
 can+0.003
anda+0.003
ig+0.003
 slips+0.003
ite+0.003
Neg logit contributions
sets-0.004
 restaurants-0.004
 Son-0.004
 veggies-0.003
 vegetables-0.003
Tokenigator
Token ID23823
Feature activation+2.25e-01

Pos logit contributions
sets+0.186
 restaurants+0.180
 Son+0.172
 vegetables+0.160
 veggies+0.159
Neg logit contributions
 can-0.127
anda-0.120
 slips-0.117
ig-0.117
br-0.110
Token's
Token ID338
Feature activation-1.59e-01

Pos logit contributions
 can+0.342
anda+0.326
ig+0.307
 slips+0.300
ite+0.298
Neg logit contributions
sets-0.387
 restaurants-0.381
 Son-0.368
 veggies-0.344
 vegetables-0.340
Token mouth
Token ID5422
Feature activation-4.01e-02

Pos logit contributions
 decisions+0.179
 vegetables+0.176
 Son+0.174
 veggies+0.174
 foods+0.168
Neg logit contributions
 can-0.311
 else-0.302
ite-0.302
br-0.299
osh-0.293
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
sets+0.079
 Son+0.076
 restaurants+0.072
 veggies+0.069
 vegetables+0.069
Neg logit contributions
 can-0.050
anda-0.049
ig-0.047
 slips-0.044
ite-0.044
Token Inside
Token ID14384
Feature activation+4.34e-02

Pos logit contributions
 can+0.017
anda+0.016
ig+0.015
 slips+0.015
 does+0.015
Neg logit contributions
sets-0.017
 restaurants-0.016
 Son-0.016
kids-0.015
 veggies-0.015
Token she
Token ID673
Feature activation+2.62e-02

Pos logit contributions
anda+0.056
 can+0.054
 slips+0.050
 dod+0.050
ig+0.050
Neg logit contributions
sets-0.083
 restaurants-0.081
 Son-0.079
 Kevin-0.073
 Frankie-0.073
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
sets+0.074
 restaurants+0.073
 Son+0.071
 foods+0.067
kids+0.066
Neg logit contributions
 can-0.040
ig-0.038
 slips-0.038
anda-0.038
br-0.036
Token They
Token ID1119
Feature activation-3.98e-02

Pos logit contributions
 can+0.039
anda+0.036
ig+0.035
 slips+0.034
 does+0.034
Neg logit contributions
sets-0.029
 restaurants-0.029
 Son-0.028
kids-0.025
 veggies-0.025
Token left
Token ID1364
Feature activation+9.11e-02

Pos logit contributions
 Son+0.068
 restaurants+0.067
sets+0.067
kids+0.062
 veggies+0.061
Neg logit contributions
 can-0.058
anda-0.056
ig-0.055
 slips-0.053
ite-0.052
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
 can+0.160
anda+0.159
ig+0.155
 slips+0.148
ite+0.147
Neg logit contributions
 restaurants-0.137
sets-0.133
 Son-0.130
 veggies-0.117
 vegetables-0.117
Token all
Token ID477
Feature activation-9.33e-02

Pos logit contributions
 can+0.018
anda+0.017
ig+0.017
 slips+0.016
 does+0.016
Neg logit contributions
 restaurants-0.017
sets-0.016
 Son-0.015
kids-0.014
 veggies-0.014
Tokenigator
Token ID23823
Feature activation+2.25e-01

Pos logit contributions
 restaurants+0.166
sets+0.163
 Son+0.156
 Thanksgiving+0.155
 vegetables+0.145
Neg logit contributions
ig-0.126
 can-0.126
anda-0.123
 slips-0.121
uddle-0.120
Token pond
Token ID16723
Feature activation+6.48e-02

Pos logit contributions
anda+0.275
 can+0.273
br+0.271
 slips+0.243
ig+0.242
Neg logit contributions
 restaurants-0.412
sets-0.394
 vegetables-0.386
 veggies-0.373
 foods-0.372
Token and
Token ID290
Feature activation-5.47e-03

Pos logit contributions
br+0.100
anda+0.099
 can+0.098
ig+0.093
 slips+0.092
Neg logit contributions
sets-0.093
 vegetables-0.092
 restaurants-0.089
 grains-0.088
 foods-0.087
Token went
Token ID1816
Feature activation+3.01e-02

Pos logit contributions
 restaurants+0.010
sets+0.010
 Son+0.009
 veggies+0.009
 vegetables+0.009
Neg logit contributions
 can-0.008
anda-0.008
ig-0.008
ite-0.007
 slips-0.007
Token to
Token ID284
Feature activation-1.15e-02

Pos logit contributions
 can+0.050
anda+0.047
 slips+0.047
 does+0.047
ig+0.046
Neg logit contributions
 restaurants-0.047
sets-0.042
 Son-0.039
 Thanksgiving-0.039
 couple-0.039
Token see
Token ID766
Feature activation-1.45e-02

Pos logit contributions
sets+0.023
 Son+0.022
 restaurants+0.021
umm+0.021
kids+0.021
Neg logit contributions
 can-0.016
 high-0.014
 does-0.013
 else-0.012
 slips-0.012
Token up
Token ID510
Feature activation-2.30e-02

Pos logit contributions
 restaurants+0.102
 Son+0.099
sets+0.099
 veggies+0.093
 vegetables+0.092
Neg logit contributions
 can-0.098
ig-0.094
anda-0.092
ite-0.091
 slips-0.090
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
 restaurants+0.047
sets+0.046
 Son+0.045
 veggies+0.042
 Thanksgiving+0.041
Neg logit contributions
 can-0.028
anda-0.026
ig-0.025
 slips-0.024
ite-0.024
Token The
Token ID383
Feature activation-3.45e-02

Pos logit contributions
 can+0.030
anda+0.028
ig+0.027
 slips+0.026
ite+0.026
Neg logit contributions
sets-0.028
 restaurants-0.028
 Son-0.028
kids-0.025
 Daughter-0.025
Token quiet
Token ID5897
Feature activation+7.72e-02

Pos logit contributions
sets+0.049
 restaurants+0.049
 Son+0.047
 veggies+0.043
kids+0.042
Neg logit contributions
 can-0.064
anda-0.060
ig-0.059
 slips-0.058
ite-0.056
Token all
Token ID477
Feature activation-9.23e-02

Pos logit contributions
 can+0.124
anda+0.106
 high+0.102
ig+0.102
 does+0.101
Neg logit contributions
 restaurants-0.130
 Son-0.130
sets-0.129
urg-0.118
kids-0.118
Tokenigator
Token ID23823
Feature activation+2.25e-01

Pos logit contributions
sets+0.161
 restaurants+0.159
 Son+0.149
heart+0.146
 vegetables+0.145
Neg logit contributions
anda-0.125
ig-0.125
uddle-0.121
ider-0.119
 can-0.116
Token did
Token ID750
Feature activation+1.66e-01

Pos logit contributions
arl+0.396
anda+0.351
ite+0.346
ig+0.338
och+0.336
Neg logit contributions
sets-0.297
 restaurants-0.296
 vegetables-0.288
 foods-0.287
 Son-0.287
Token not
Token ID407
Feature activation-9.49e-03

Pos logit contributions
anda+0.298
 can+0.298
ig+0.288
ite+0.286
 slips+0.274
Neg logit contributions
 restaurants-0.230
sets-0.228
 Son-0.216
 veggies-0.197
 foods-0.197
Token understand
Token ID1833
Feature activation-1.26e-01

Pos logit contributions
 restaurants+0.014
sets+0.014
 Son+0.014
kids+0.013
 veggies+0.013
Neg logit contributions
anda-0.016
ig-0.015
 can-0.015
 slips-0.015
ite-0.015
Token and
Token ID290
Feature activation-7.34e-03

Pos logit contributions
 restaurants+0.207
 Son+0.198
 foods+0.190
sets+0.189
 vegetables+0.182
Neg logit contributions
ite-0.202
anda-0.195
ig-0.193
 slips-0.183
ider-0.182
Token walked
Token ID6807
Feature activation+4.47e-02

Pos logit contributions
sets+0.011
 restaurants+0.011
 Son+0.010
 vegetables+0.010
 veggies+0.010
Neg logit contributions
ig-0.012
anda-0.012
ite-0.012
br-0.011
 can-0.011
Token the
Token ID262
Feature activation+4.48e-03

Pos logit contributions
 restaurants+0.119
sets+0.118
 Son+0.118
 veggies+0.108
kids+0.107
Neg logit contributions
 can-0.068
anda-0.066
ig-0.063
 slips-0.061
ite-0.060
Token bird
Token ID6512
Feature activation+2.94e-02

Pos logit contributions
 can+0.006
 high+0.005
anda+0.005
 else+0.005
ite+0.005
Neg logit contributions
 Son-0.009
 restaurants-0.009
sets-0.009
urg-0.009
kids-0.008
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
anda+0.035
ite+0.034
 can+0.034
 slips+0.033
suit+0.033
Neg logit contributions
sets-0.052
 Son-0.051
 restaurants-0.050
 veggies-0.049
 vegetables-0.049
Token the
Token ID262
Feature activation+3.89e-03

Pos logit contributions
 can+0.022
 high+0.018
 slips+0.018
 does+0.018
ig+0.018
Neg logit contributions
sets-0.027
 restaurants-0.027
 Son-0.026
kids-0.025
urg-0.024
Token all
Token ID477
Feature activation-9.66e-02

Pos logit contributions
 can+0.006
anda+0.005
ig+0.005
 slips+0.005
 does+0.005
Neg logit contributions
sets-0.007
 restaurants-0.007
 Son-0.007
kids-0.006
 veggies-0.006
Tokenigator
Token ID23823
Feature activation+2.25e-01

Pos logit contributions
sets+0.191
 restaurants+0.188
 Son+0.187
 veggies+0.173
 vegetables+0.173
Neg logit contributions
 can-0.111
anda-0.110
ig-0.103
 slips-0.102
ider-0.098
Token came
Token ID1625
Feature activation+2.35e-02

Pos logit contributions
anda+0.332
 can+0.328
ig+0.313
ite+0.309
arl+0.296
Neg logit contributions
sets-0.363
 restaurants-0.344
 Son-0.342
 veggies-0.326
 vegetables-0.321
Token out
Token ID503
Feature activation-2.59e-02

Pos logit contributions
 can+0.030
anda+0.028
ig+0.027
 slips+0.026
 does+0.026
Neg logit contributions
sets-0.046
 Son-0.045
 restaurants-0.044
urg-0.041
 veggies-0.041
Token of
Token ID286
Feature activation+2.02e-03

Pos logit contributions
sets+0.048
 Son+0.047
 restaurants+0.047
 Father+0.043
urg+0.042
Neg logit contributions
 can-0.035
anda-0.034
ig-0.032
 slips-0.032
ite-0.031
Token the
Token ID262
Feature activation+4.48e-03

Pos logit contributions
 can+0.003
anda+0.003
ig+0.003
 slips+0.003
 high+0.003
Neg logit contributions
sets-0.004
 restaurants-0.003
 Son-0.003
kids-0.003
 veggies-0.003
Token pond
Token ID16723
Feature activation+6.56e-02

Pos logit contributions
 can+0.007
anda+0.006
 slips+0.006
ig+0.006
 does+0.006
Neg logit contributions
 restaurants-0.008
sets-0.008
 Son-0.007
 veggies-0.007
 vegetables-0.007
Token we
Token ID356
Feature activation-4.26e-02

Pos logit contributions
 can+0.253
anda+0.189
 else+0.189
 does+0.188
 slips+0.179
Neg logit contributions
 restaurants-0.477
sets-0.475
 Son-0.467
kids-0.452
urg-0.434
Token take
Token ID1011
Feature activation-5.53e-02

Pos logit contributions
sets+0.073
 restaurants+0.073
 Son+0.071
kids+0.066
 veggies+0.065
Neg logit contributions
 can-0.067
anda-0.060
ig-0.058
 slips-0.057
 does-0.056
Token the
Token ID262
Feature activation+5.88e-04

Pos logit contributions
sets+0.095
 restaurants+0.093
 Son+0.092
kids+0.085
 veggies+0.083
Neg logit contributions
 can-0.087
anda-0.077
ig-0.075
 slips-0.074
 high-0.073
Token cute
Token ID13779
Feature activation+2.73e-02

Pos logit contributions
 can+0.001
 high+0.001
anda+0.001
ite+0.001
 else+0.001
Neg logit contributions
 Son-0.001
 restaurants-0.001
urg-0.001
kids-0.001
sets-0.001
Token all
Token ID477
Feature activation-9.54e-02

Pos logit contributions
 can+0.044
anda+0.040
ig+0.038
 high+0.037
 slips+0.037
Neg logit contributions
sets-0.045
 restaurants-0.045
 Son-0.045
kids-0.041
Father-0.040
Tokenigator
Token ID23823
Feature activation+2.25e-01

Pos logit contributions
sets+0.180
 restaurants+0.179
 Son+0.172
 Thanksgiving+0.158
 Daughter+0.157
Neg logit contributions
 can-0.127
anda-0.116
 slips-0.116
ig-0.113
 does-0.110
Token home
Token ID1363
Feature activation-6.38e-02

Pos logit contributions
anda+0.284
 can+0.281
 slips+0.264
ig+0.263
 dod+0.261
Neg logit contributions
 restaurants-0.426
sets-0.419
 Son-0.389
 Thanksgiving-0.368
 veggies-0.366
Token with
Token ID351
Feature activation-4.43e-02

Pos logit contributions
sets+0.124
 restaurants+0.124
 Son+0.119
 veggies+0.110
kids+0.110
Neg logit contributions
 can-0.083
anda-0.076
ig-0.074
 slips-0.074
 dod-0.070
Token us
Token ID514
Feature activation-1.29e-02

Pos logit contributions
sets+0.082
 restaurants+0.082
 Son+0.080
kids+0.074
 veggies+0.074
Neg logit contributions
 can-0.062
anda-0.057
ig-0.055
 slips-0.054
 does-0.052
Token?"
Token ID1701
Feature activation+9.36e-03

Pos logit contributions
sets+0.022
 restaurants+0.022
 Son+0.020
 Thanksgiving+0.019
 veggies+0.019
Neg logit contributions
 slips-0.019
 can-0.019
anda-0.018
ig-0.018
 dod-0.018
Token
Token ID198
Feature activation+6.55e-03

Pos logit contributions
 can+0.015
anda+0.013
ig+0.013
 does+0.013
 slips+0.013
Neg logit contributions
sets-0.016
 restaurants-0.016
 Son-0.015
kids-0.014
urg-0.014
Token fun
Token ID1257
Feature activation+9.02e-02

Pos logit contributions
 can+0.094
anda+0.089
ig+0.088
ite+0.084
 slips+0.082
Neg logit contributions
 restaurants-0.123
sets-0.121
 Son-0.119
 veggies-0.111
kids-0.110
Token playing
Token ID2712
Feature activation+4.46e-03

Pos logit contributions
anda+0.137
 can+0.136
ite+0.128
ig+0.124
 slips+0.123
Neg logit contributions
 restaurants-0.156
sets-0.152
 Thanksgiving-0.142
 foods-0.141
 veggies-0.140
Token with
Token ID351
Feature activation-4.55e-02

Pos logit contributions
anda+0.008
ite+0.007
ig+0.007
hy+0.007
 slips+0.006
Neg logit contributions
 foods-0.008
 restaurants-0.007
 Thanksgiving-0.007
 vegetables-0.007
 veggies-0.007
Token the
Token ID262
Feature activation+9.33e-03

Pos logit contributions
 restaurants+0.066
 Son+0.065
sets+0.064
 veggies+0.059
 vegetables+0.059
Neg logit contributions
 can-0.081
ig-0.077
anda-0.077
 slips-0.076
ite-0.074
Token all
Token ID477
Feature activation-9.36e-02

Pos logit contributions
 can+0.016
anda+0.015
ig+0.014
 high+0.014
ite+0.014
Neg logit contributions
sets-0.014
 restaurants-0.014
 Son-0.014
kids-0.013
urg-0.013
Tokenigator
Token ID23823
Feature activation+2.24e-01

Pos logit contributions
sets+0.190
 restaurants+0.189
 Son+0.187
 Thanksgiving+0.177
 vegetables+0.173
Neg logit contributions
 can-0.106
ig-0.098
anda-0.097
 slips-0.097
 does-0.087
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
anda+0.305
br+0.298
 can+0.296
ig+0.288
 slips+0.281
Neg logit contributions
 restaurants-0.372
sets-0.372
 vegetables-0.357
 foods-0.349
 veggies-0.347
Token They
Token ID1119
Feature activation-3.81e-02

Pos logit contributions
anda+0.037
ig+0.037
 can+0.036
ite+0.035
 slips+0.034
Neg logit contributions
sets-0.022
 restaurants-0.022
 Son-0.022
 Father-0.019
kids-0.019
Token were
Token ID547
Feature activation-2.91e-02

Pos logit contributions
 restaurants+0.057
 Son+0.056
sets+0.054
 veggies+0.053
 couple+0.052
Neg logit contributions
ig-0.063
anda-0.061
ite-0.057
 slips-0.053
 can-0.053
Token happy
Token ID3772
Feature activation+1.48e-02

Pos logit contributions
 restaurants+0.050
sets+0.049
 Son+0.048
 veggies+0.044
 Thanksgiving+0.044
Neg logit contributions
 can-0.045
ig-0.041
anda-0.041
ite-0.040
 slips-0.039
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
 can+0.018
anda+0.018
ig+0.017
 slips+0.017
ite+0.016
Neg logit contributions
sets-0.029
 restaurants-0.029
 Son-0.028
 veggies-0.026
 foods-0.026
Token a
Token ID257
Feature activation+6.54e-03

Pos logit contributions
 can+0.027
 high+0.024
anda+0.022
ig+0.022
 does+0.022
Neg logit contributions
sets-0.043
 restaurants-0.042
 Son-0.041
kids-0.039
urg-0.038
Token big
Token ID1263
Feature activation+4.17e-02

Pos logit contributions
 can+0.012
 slips+0.012
anda+0.012
ig+0.012
ows+0.011
Neg logit contributions
 Son-0.008
 restaurants-0.008
 Thanksgiving-0.008
sets-0.008
 couple-0.008
Token,
Token ID11
Feature activation+1.35e-02

Pos logit contributions
ows+0.068
anda+0.065
 else+0.064
 slips+0.063
ider+0.063
Neg logit contributions
 restaurants-0.068
sets-0.063
 Son-0.063
 Thanksgiving-0.061
 vegetables-0.059
Token scary
Token ID14343
Feature activation+5.99e-02

Pos logit contributions
anda+0.021
 can+0.021
ig+0.021
 slips+0.020
ite+0.019
Neg logit contributions
 Son-0.021
 restaurants-0.021
sets-0.020
 Thanksgiving-0.020
 Daughter-0.019
Token all
Token ID477
Feature activation-9.50e-02

Pos logit contributions
 can+0.099
anda+0.095
 slips+0.095
 else+0.094
ite+0.094
Neg logit contributions
 restaurants-0.091
sets-0.086
 Son-0.085
 Thanksgiving-0.082
kids-0.080
Tokenigator
Token ID23823
Feature activation+2.24e-01

Pos logit contributions
 restaurants+0.192
sets+0.189
 Son+0.180
 Thanksgiving+0.171
 vegetables+0.166
Neg logit contributions
 can-0.110
ig-0.109
anda-0.106
 slips-0.101
uddle-0.095
Token who
Token ID508
Feature activation+6.02e-02

Pos logit contributions
 can+0.286
 slips+0.276
arl+0.273
ig+0.269
anda+0.267
Neg logit contributions
sets-0.373
 restaurants-0.373
 Thanksgiving-0.346
 Son-0.342
itive-0.342
Token lived
Token ID5615
Feature activation+1.30e-01

Pos logit contributions
ig+0.088
anda+0.081
ite+0.078
 slips+0.075
oms+0.074
Neg logit contributions
 restaurants-0.105
sets-0.099
kids-0.093
 Son-0.093
 foods-0.091
Token in
Token ID287
Feature activation+1.17e-01

Pos logit contributions
anda+0.157
 can+0.156
ig+0.142
br+0.141
ite+0.140
Neg logit contributions
sets-0.231
 restaurants-0.229
 youngest-0.217
 Thanksgiving-0.217
 farmers-0.215
Token the
Token ID262
Feature activation+2.92e-03

Pos logit contributions
 can+0.154
 high+0.133
 does+0.122
 slips+0.119
 far+0.117
Neg logit contributions
sets-0.243
 Son-0.232
 restaurants-0.227
kids-0.226
 veggies-0.216
Token wild
Token ID4295
Feature activation+4.49e-02

Pos logit contributions
 can+0.005
anda+0.004
ig+0.004
 slips+0.004
 does+0.004
Neg logit contributions
 restaurants-0.005
sets-0.005
 Son-0.005
 veggies-0.004
 vegetables-0.004
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
 can+0.102
anda+0.096
 slips+0.088
ig+0.087
ite+0.084
Neg logit contributions
 restaurants-0.191
sets-0.190
 foods-0.176
 Son-0.175
 veggies-0.173
Token
Token ID198
Feature activation+8.96e-03

Pos logit contributions
 can+0.025
anda+0.023
ig+0.023
 slips+0.022
 does+0.022
Neg logit contributions
sets-0.023
 restaurants-0.023
 Son-0.022
kids-0.020
 veggies-0.020
Token
Token ID198
Feature activation+9.01e-03

Pos logit contributions
 can+0.014
anda+0.012
ig+0.012
 slips+0.012
ite+0.012
Neg logit contributions
 restaurants-0.015
sets-0.015
 Son-0.015
kids-0.014
 veggies-0.014
TokenThe
Token ID464
Feature activation-3.36e-02

Pos logit contributions
 can+0.015
anda+0.014
ig+0.014
 slips+0.013
 does+0.013
Neg logit contributions
sets-0.014
 restaurants-0.014
 Son-0.014
kids-0.013
 veggies-0.012
Token all
Token ID477
Feature activation-9.45e-02

Pos logit contributions
sets+0.053
 restaurants+0.053
 Son+0.052
kids+0.048
urg+0.047
Neg logit contributions
 can-0.058
anda-0.051
 high-0.051
ig-0.049
 does-0.048
Tokenigator
Token ID23823
Feature activation+2.24e-01

Pos logit contributions
sets+0.218
 restaurants+0.217
 Son+0.213
 veggies+0.199
kids+0.199
Neg logit contributions
 can-0.088
anda-0.077
ig-0.074
 slips-0.072
 does-0.067
Token was
Token ID373
Feature activation+2.39e-02

Pos logit contributions
anda+0.325
 can+0.324
ig+0.308
arl+0.303
ite+0.300
Neg logit contributions
sets-0.359
 restaurants-0.354
 Son-0.346
kids-0.322
 veggies-0.320
Token so
Token ID523
Feature activation-3.86e-02

Pos logit contributions
 can+0.034
anda+0.031
ig+0.031
 slips+0.030
ite+0.029
Neg logit contributions
sets-0.044
 restaurants-0.044
 Son-0.042
kids-0.039
 Thanksgiving-0.039
Token happy
Token ID3772
Feature activation+1.56e-02

Pos logit contributions
 restaurants+0.057
sets+0.055
 Son+0.055
 Thanksgiving+0.050
 Daughter+0.049
Neg logit contributions
 can-0.065
ig-0.064
anda-0.061
 slips-0.059
 does-0.056
Token to
Token ID284
Feature activation-1.34e-02

Pos logit contributions
 can+0.019
anda+0.018
ig+0.017
 slips+0.016
ite+0.016
Neg logit contributions
sets-0.032
 restaurants-0.031
 Son-0.030
 veggies-0.028
kids-0.028
Token have
Token ID423
Feature activation+1.88e-02

Pos logit contributions
sets+0.027
 restaurants+0.026
 Son+0.026
 veggies+0.024
kids+0.024
Neg logit contributions
 can-0.018
anda-0.015
ig-0.015
 high-0.015
 slips-0.015
Token was
Token ID373
Feature activation+2.68e-02

Pos logit contributions
ite+0.079
ig+0.077
anda+0.075
arl+0.070
 can+0.069
Neg logit contributions
 Son-0.071
 foods-0.068
 vegetables-0.068
 french-0.068
 fries-0.068
Token lucky
Token ID9670
Feature activation-2.23e-02

Pos logit contributions
ite+0.046
ig+0.046
 can+0.045
anda+0.043
br+0.039
Neg logit contributions
 restaurants-0.037
 Son-0.037
sets-0.034
aging-0.034
 Thanksgiving-0.034
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
sets+0.038
 restaurants+0.038
 Son+0.037
 vegetables+0.036
 foods+0.035
Neg logit contributions
anda-0.032
ig-0.031
ite-0.030
 can-0.030
ars-0.029
Token The
Token ID383
Feature activation-3.76e-02

Pos logit contributions
ig+0.032
 can+0.032
anda+0.032
 slips+0.030
ite+0.029
Neg logit contributions
 Son-0.028
 restaurants-0.028
sets-0.028
 Father-0.025
 Daughter-0.025
Token all
Token ID477
Feature activation-9.44e-02

Pos logit contributions
sets+0.058
 restaurants+0.058
 Son+0.058
kids+0.053
urg+0.052
Neg logit contributions
 can-0.067
anda-0.058
 high-0.058
 does-0.056
ig-0.055
Tokenigator
Token ID23823
Feature activation+2.24e-01

Pos logit contributions
 restaurants+0.196
sets+0.193
 Son+0.190
 veggies+0.178
 vegetables+0.177
Neg logit contributions
 can-0.102
ig-0.098
anda-0.097
 slips-0.091
ite-0.085
Token did
Token ID750
Feature activation+1.65e-01

Pos logit contributions
arl+0.363
anda+0.334
ite+0.333
ig+0.332
och+0.329
Neg logit contributions
 restaurants-0.308
 vegetables-0.302
 foods-0.298
 veggies-0.295
 Son-0.291
Token not
Token ID407
Feature activation-1.19e-02

Pos logit contributions
 can+0.280
anda+0.259
ig+0.254
 slips+0.247
ite+0.243
Neg logit contributions
sets-0.258
 restaurants-0.257
 Son-0.250
kids-0.227
 veggies-0.226
Token catch
Token ID4929
Feature activation-2.70e-02

Pos logit contributions
 restaurants+0.019
sets+0.018
 Son+0.017
 veggies+0.017
 foods+0.016
Neg logit contributions
anda-0.019
ig-0.019
 can-0.018
ite-0.018
 slips-0.017
Token him
Token ID683
Feature activation-2.59e-02

Pos logit contributions
 restaurants+0.030
sets+0.029
 Son+0.027
 foods+0.026
 veggies+0.025
Neg logit contributions
 can-0.055
ig-0.054
anda-0.054
ite-0.053
 slips-0.052
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
 restaurants+0.045
sets+0.045
 Son+0.043
 decisions+0.042
 foods+0.042
Neg logit contributions
 can-0.033
anda-0.033
ig-0.033
ite-0.032
 slips-0.031
Token?"
Token ID1701
Feature activation+8.37e-03

Pos logit contributions
sets+0.015
 restaurants+0.014
 Son+0.014
 foods+0.012
 Thanksgiving+0.012
Neg logit contributions
anda-0.012
ig-0.012
 slips-0.011
 can-0.011
 dod-0.011
Token
Token ID198
Feature activation+4.59e-03

Pos logit contributions
 can+0.015
anda+0.015
ig+0.014
 slips+0.014
ite+0.013
Neg logit contributions
sets-0.012
 Son-0.012
 restaurants-0.012
 Daughter-0.011
 veggies-0.011
Token
Token ID198
Feature activation+4.84e-03

Pos logit contributions
 can+0.008
anda+0.008
ig+0.008
 slips+0.008
 does+0.008
Neg logit contributions
 Son-0.006
sets-0.006
 restaurants-0.006
Father-0.006
 Daughter-0.006
TokenThe
Token ID464
Feature activation-3.66e-02

Pos logit contributions
 can+0.009
anda+0.009
ig+0.009
 slips+0.009
 does+0.008
Neg logit contributions
sets-0.006
 Son-0.006
 restaurants-0.006
Father-0.006
 Daughter-0.006
Token all
Token ID477
Feature activation-9.75e-02

Pos logit contributions
 restaurants+0.057
sets+0.057
 Son+0.056
urg+0.053
kids+0.052
Neg logit contributions
 can-0.064
anda-0.056
 high-0.055
 does-0.053
 else-0.052
Tokenigator
Token ID23823
Feature activation+2.24e-01

Pos logit contributions
sets+0.197
 restaurants+0.192
 Son+0.192
 Frankie+0.179
 Daughter+0.178
Neg logit contributions
 can-0.108
anda-0.106
ig-0.103
 slips-0.103
ider-0.094
Token smiled
Token ID13541
Feature activation+6.06e-02

Pos logit contributions
 can+0.340
anda+0.329
ig+0.321
ite+0.317
arl+0.315
Neg logit contributions
 Son-0.350
sets-0.348
 restaurants-0.327
 Father-0.320
 veggies-0.306
Token and
Token ID290
Feature activation-1.55e-02

Pos logit contributions
anda+0.087
 can+0.084
ig+0.082
 slips+0.080
ite+0.079
Neg logit contributions
sets-0.108
 Son-0.104
 restaurants-0.094
 Father-0.094
 veggies-0.091
Token said
Token ID531
Feature activation-1.76e-02

Pos logit contributions
sets+0.029
 restaurants+0.029
 Son+0.029
kids+0.026
 veggies+0.026
Neg logit contributions
 can-0.021
anda-0.019
ig-0.019
 slips-0.018
ite-0.018
Token "
Token ID366
Feature activation+1.45e-02

Pos logit contributions
sets+0.022
 Son+0.021
 restaurants+0.019
 Father+0.019
 Kevin+0.018
Neg logit contributions
anda-0.034
ite-0.033
ig-0.033
 slips-0.032
 can-0.031
TokenOpen
Token ID11505
Feature activation+1.96e-02

Pos logit contributions
 can+0.023
anda+0.020
ite+0.020
ig+0.020
 slips+0.019
Neg logit contributions
 restaurants-0.025
sets-0.024
 Son-0.024
kids-0.022
 veggies-0.022
Token
Token ID198
Feature activation+9.97e-03

Pos logit contributions
sets+0.081
 Son+0.079
 restaurants+0.078
 Daughter+0.071
 veggies+0.070
Neg logit contributions
 can-0.107
anda-0.104
ig-0.102
 slips-0.098
 does-0.095
Token
Token ID198
Feature activation+1.01e-02

Pos logit contributions
 can+0.016
anda+0.015
ig+0.015
 slips+0.015
 does+0.014
Neg logit contributions
 restaurants-0.016
sets-0.016
 Son-0.016
kids-0.014
Father-0.014
TokenThe
Token ID464
Feature activation-3.40e-02

Pos logit contributions
 can+0.018
anda+0.017
ig+0.017
 slips+0.016
 does+0.016
Neg logit contributions
sets-0.015
 restaurants-0.015
 Son-0.014
Father-0.013
 veggies-0.013
Token quiet
Token ID5897
Feature activation+7.44e-02

Pos logit contributions
sets+0.050
 restaurants+0.050
 Son+0.050
kids+0.045
urg+0.044
Neg logit contributions
 can-0.062
anda-0.056
ig-0.054
 high-0.053
 does-0.053
Token all
Token ID477
Feature activation-9.53e-02

Pos logit contributions
 can+0.122
anda+0.107
 high+0.102
ig+0.102
 does+0.100
Neg logit contributions
 restaurants-0.123
 Son-0.121
sets-0.121
kids-0.110
urg-0.110
Tokenigator
Token ID23823
Feature activation+2.24e-01

Pos logit contributions
sets+0.165
 restaurants+0.161
 Son+0.159
 Daughter+0.153
 vegetables+0.151
Neg logit contributions
ig-0.122
 slips-0.122
 can-0.120
ider-0.120
uddle-0.119
Token did
Token ID750
Feature activation+1.63e-01

Pos logit contributions
arl+0.373
anda+0.346
 can+0.336
ig+0.335
ite+0.332
Neg logit contributions
sets-0.310
 Son-0.309
 veggies-0.297
 restaurants-0.296
 vegetables-0.295
Token not
Token ID407
Feature activation-1.22e-02

Pos logit contributions
 can+0.199
anda+0.175
ig+0.170
 slips+0.165
 does+0.159
Neg logit contributions
sets-0.338
 restaurants-0.336
 Son-0.330
kids-0.308
 veggies-0.305
Token know
Token ID760
Feature activation-1.30e-01

Pos logit contributions
sets+0.021
 restaurants+0.021
 Son+0.021
kids+0.019
 veggies+0.019
Neg logit contributions
 can-0.019
anda-0.017
ig-0.017
 slips-0.016
 does-0.016
Token what
Token ID644
Feature activation-3.92e-02

Pos logit contributions
 restaurants+0.168
sets+0.164
 vegetables+0.155
 foods+0.149
 Son+0.149
Neg logit contributions
ite-0.245
suit-0.244
tted-0.230
 slips-0.228
ars-0.228
Token a
Token ID257
Feature activation+9.93e-03

Pos logit contributions
 Thanksgiving+0.053
 vegetables+0.049
 Farmer+0.048
 restaurants+0.048
 foods+0.047
Neg logit contributions
ite-0.076
ig-0.075
anda-0.075
uddle-0.071
suit-0.069
Token!"
Token ID2474
Feature activation-1.14e-01

Pos logit contributions
 can+0.055
anda+0.053
 slips+0.050
ig+0.049
 does+0.048
Neg logit contributions
 restaurants-0.066
sets-0.066
 Son-0.063
kids-0.058
 veggies-0.057
Token Maria
Token ID14200
Feature activation-7.69e-02

Pos logit contributions
 restaurants+0.189
sets+0.188
 Son+0.184
kids+0.170
 veggies+0.167
Neg logit contributions
 can-0.186
anda-0.163
ig-0.162
 slips-0.159
 does-0.159
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
sets+0.127
 Son+0.124
kids+0.115
 restaurants+0.115
 youngest+0.112
Neg logit contributions
 can-0.114
 slips-0.109
anda-0.105
ite-0.101
ig-0.099
Token the
Token ID262
Feature activation+4.85e-03

Pos logit contributions
 restaurants+0.023
sets+0.023
 Son+0.022
kids+0.021
umm+0.021
Neg logit contributions
 can-0.017
 high-0.014
anda-0.014
ig-0.014
 does-0.014
Token all
Token ID477
Feature activation-9.71e-02

Pos logit contributions
 can+0.008
 high+0.007
anda+0.007
 else+0.007
 does+0.007
Neg logit contributions
 restaurants-0.008
sets-0.008
 Son-0.008
urg-0.007
kids-0.007
Tokenigator
Token ID23823
Feature activation+2.24e-01

Pos logit contributions
sets+0.169
 Son+0.167
 restaurants+0.163
 Daughter+0.155
 Frankie+0.153
Neg logit contributions
 can-0.136
 slips-0.129
anda-0.127
ider-0.121
ig-0.118
Token set
Token ID900
Feature activation-3.32e-02

Pos logit contributions
 can+0.343
anda+0.322
 slips+0.309
ig+0.303
ite+0.295
Neg logit contributions
sets-0.375
 restaurants-0.355
 Son-0.352
kids-0.327
 Father-0.324
Token out
Token ID503
Feature activation-2.55e-02

Pos logit contributions
 restaurants+0.059
sets+0.058
 Son+0.057
kids+0.053
 veggies+0.052
Neg logit contributions
 can-0.050
ig-0.044
 high-0.044
anda-0.044
 slips-0.043
Token on
Token ID319
Feature activation+7.87e-02

Pos logit contributions
sets+0.044
 restaurants+0.044
 Son+0.039
 anticipation+0.038
 vegetables+0.037
Neg logit contributions
anda-0.037
 can-0.037
 slips-0.035
ite-0.034
 does-0.033
Token their
Token ID511
Feature activation-3.85e-02

Pos logit contributions
 can+0.128
anda+0.117
ig+0.114
 slips+0.112
 does+0.109
Neg logit contributions
sets-0.130
 restaurants-0.129
 Son-0.126
kids-0.116
 veggies-0.114
Token journey
Token ID7002
Feature activation-2.85e-02

Pos logit contributions
 restaurants+0.051
sets+0.050
 Son+0.048
kids+0.045
 veggies+0.043
Neg logit contributions
 can-0.074
anda-0.069
 slips-0.067
ig-0.067
ite-0.067
Token stayed
Token ID9658
Feature activation+7.79e-02

Pos logit contributions
sets+0.034
 restaurants+0.033
 Son+0.033
kids+0.030
 veggies+0.030
Neg logit contributions
 can-0.027
anda-0.024
ig-0.024
 slips-0.023
 does-0.023
Token away
Token ID1497
Feature activation+1.29e-01

Pos logit contributions
 can+0.094
anda+0.086
 slips+0.079
ig+0.078
ite+0.077
Neg logit contributions
 restaurants-0.161
sets-0.160
 Son-0.158
 avail-0.146
 veggies-0.145
Token from
Token ID422
Feature activation-3.35e-02

Pos logit contributions
 can+0.185
anda+0.166
ig+0.162
 slips+0.159
 does+0.154
Neg logit contributions
sets-0.237
 restaurants-0.236
 Son-0.231
kids-0.214
 veggies-0.212
Token the
Token ID262
Feature activation+2.82e-03

Pos logit contributions
sets+0.056
 restaurants+0.056
 Son+0.054
kids+0.050
 veggies+0.049
Neg logit contributions
 can-0.054
anda-0.049
ig-0.048
 slips-0.047
 does-0.046
Token all
Token ID477
Feature activation-1.00e-01

Pos logit contributions
 can+0.005
anda+0.005
ig+0.005
 slips+0.004
 does+0.004
Neg logit contributions
sets-0.004
 restaurants-0.004
 Son-0.004
 veggies-0.004
kids-0.004
Tokenigator
Token ID23823
Feature activation+2.24e-01

Pos logit contributions
 restaurants+0.215
sets+0.214
 Son+0.209
kids+0.193
 veggies+0.192
Neg logit contributions
 can-0.112
anda-0.100
ig-0.097
 slips-0.096
 does-0.090
Token.
Token ID13
Feature activation+7.62e-03

Pos logit contributions
 can+0.279
anda+0.255
 slips+0.241
ig+0.237
br+0.234
Neg logit contributions
sets-0.443
 restaurants-0.441
 Son-0.423
 veggies-0.398
 decisions-0.394
Token But
Token ID887
Feature activation+6.80e-03

Pos logit contributions
 can+0.012
anda+0.010
 slips+0.010
 does+0.010
 high+0.010
Neg logit contributions
 restaurants-0.014
sets-0.014
 Son-0.013
 avail-0.012
 veggies-0.012
Token then
Token ID788
Feature activation+5.65e-02

Pos logit contributions
 can+0.011
anda+0.011
ig+0.010
 slips+0.010
ite+0.010
Neg logit contributions
sets-0.011
 restaurants-0.011
 Son-0.011
 veggies-0.009
kids-0.009
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 can+0.065
ig+0.053
anda+0.052
 slips+0.050
 does+0.050
Neg logit contributions
sets-0.122
 restaurants-0.121
 Son-0.119
kids-0.114
 veggies-0.110
Token he
Token ID339
Feature activation+1.80e-02

Pos logit contributions
 can+0.018
anda+0.016
ig+0.015
 high+0.015
 slips+0.015
Neg logit contributions
sets-0.020
 restaurants-0.019
 Son-0.019
kids-0.018
 veggies-0.017
Token a
Token ID257
Feature activation-3.94e-03

Pos logit contributions
 can+0.021
anda+0.019
ig+0.018
 slips+0.018
 does+0.018
Neg logit contributions
sets-0.025
 restaurants-0.025
 Son-0.024
kids-0.022
 veggies-0.022
Token girl
Token ID2576
Feature activation+4.75e-02

Pos logit contributions
 Son+0.005
 restaurants+0.004
 couple+0.004
 Thanksgiving+0.004
 Father+0.004
Neg logit contributions
 can-0.008
 slips-0.008
ig-0.007
ows-0.007
anda-0.007
Token who
Token ID508
Feature activation+5.24e-02

Pos logit contributions
 can+0.059
 slips+0.058
anda+0.052
 does+0.052
 dod+0.052
Neg logit contributions
 restaurants-0.087
 Son-0.085
 Thanksgiving-0.082
sets-0.081
 Father-0.081
Token loved
Token ID6151
Feature activation+2.48e-02

Pos logit contributions
ig+0.096
ite+0.090
anda+0.090
oms+0.089
 slips+0.087
Neg logit contributions
 restaurants-0.064
 Son-0.061
 veggies-0.058
 foods-0.057
 Thanksgiving-0.057
Token sun
Token ID4252
Feature activation-7.80e-03

Pos logit contributions
 can+0.031
anda+0.027
ig+0.026
 high+0.026
 slips+0.026
Neg logit contributions
sets-0.049
 Son-0.048
 restaurants-0.048
kids-0.045
 avail-0.044
Tokensets
Token ID28709
Feature activation-3.11e-01

Pos logit contributions
 restaurants+0.018
 Son+0.018
sets+0.017
kids+0.017
Father+0.017
Neg logit contributions
 can-0.007
anda-0.006
ig-0.006
 does-0.005
ite-0.005
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
sets+0.674
 restaurants+0.642
 Son+0.625
 veggies+0.591
 Thanksgiving+0.580
Neg logit contributions
 can-0.348
anda-0.320
 slips-0.315
ig-0.312
 does-0.284
Token Every
Token ID3887
Feature activation-1.04e-01

Pos logit contributions
 can+0.009
anda+0.008
ig+0.008
 slips+0.008
 does+0.008
Neg logit contributions
sets-0.011
 restaurants-0.011
 Son-0.010
kids-0.010
 veggies-0.009
Token day
Token ID1110
Feature activation+3.23e-02

Pos logit contributions
sets+0.151
 restaurants+0.150
 Son+0.147
 Thanksgiving+0.134
kids+0.132
Neg logit contributions
 can-0.180
anda-0.177
ig-0.166
 slips-0.164
 dod-0.159
Token she
Token ID673
Feature activation+2.09e-02

Pos logit contributions
anda+0.041
 can+0.041
ig+0.040
 slips+0.038
ite+0.036
Neg logit contributions
 restaurants-0.061
sets-0.061
 Son-0.060
 veggies-0.055
 Daughter-0.055
Token would
Token ID561
Feature activation+1.10e-01

Pos logit contributions
anda+0.026
ig+0.025
 slips+0.024
ite+0.024
ider+0.024
Neg logit contributions
 restaurants-0.038
sets-0.037
 veggies-0.036
 vegetables-0.036
 Son-0.035
Token The
Token ID383
Feature activation-4.18e-02

Pos logit contributions
 can+0.020
anda+0.018
ig+0.018
 slips+0.018
 does+0.017
Neg logit contributions
sets-0.021
 restaurants-0.021
 Son-0.021
kids-0.019
 veggies-0.019
Token people
Token ID661
Feature activation-6.45e-02

Pos logit contributions
 Son+0.048
 vegetables+0.046
sets+0.046
 restaurants+0.046
 veggies+0.043
Neg logit contributions
anda-0.082
ig-0.081
 slips-0.081
ocus-0.080
ows-0.078
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
sets+0.105
 restaurants+0.102
 Son+0.102
 couple+0.095
kids+0.092
Neg logit contributions
 slips-0.087
anda-0.086
 dod-0.084
br-0.084
ig-0.084
Token the
Token ID262
Feature activation+3.67e-03

Pos logit contributions
 can+0.020
anda+0.020
ig+0.019
 slips+0.019
ite+0.018
Neg logit contributions
 restaurants-0.022
sets-0.022
 Son-0.021
kids-0.020
 veggies-0.020
Token sun
Token ID4252
Feature activation-1.05e-03

Pos logit contributions
ig+0.007
 slips+0.007
ocus+0.007
anda+0.007
ite+0.007
Neg logit contributions
 Son-0.004
 vegetables-0.004
 restaurants-0.004
 couple-0.004
sets-0.004
Tokensets
Token ID28709
Feature activation-3.09e-01

Pos logit contributions
sets+0.002
 Son+0.002
 restaurants+0.002
kids+0.002
 Father+0.002
Neg logit contributions
 can-0.001
anda-0.001
 slips-0.001
 else-0.001
ig-0.001
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
sets+0.493
 restaurants+0.419
 Son+0.416
 couple+0.385
 Frankie+0.371
Neg logit contributions
anda-0.511
 can-0.472
ig-0.466
 slips-0.465
 does-0.441
Token the
Token ID262
Feature activation+4.38e-03

Pos logit contributions
sets+0.025
 restaurants+0.025
 Son+0.024
kids+0.023
 veggies+0.022
Neg logit contributions
 can-0.023
anda-0.020
ig-0.020
 slips-0.019
 does-0.019
Token b
Token ID275
Feature activation-6.54e-02

Pos logit contributions
ig+0.009
anda+0.009
 slips+0.008
ocus+0.008
 can+0.008
Neg logit contributions
sets-0.005
 Son-0.005
 restaurants-0.005
 vegetables-0.005
 veggies-0.005
Tokenowing
Token ID7855
Feature activation-5.49e-02

Pos logit contributions
sets+0.097
 Son+0.095
 restaurants+0.092
kids+0.088
urg+0.085
Neg logit contributions
 can-0.118
 does-0.105
 slips-0.102
 high-0.101
 else-0.100
Token all
Token ID477
Feature activation-9.67e-02

Pos logit contributions
sets+0.097
 restaurants+0.087
 Son+0.085
 Frankie+0.078
kids+0.078
Neg logit contributions
 can-0.081
anda-0.079
ig-0.077
 slips-0.075
 else-0.072
Token adm
Token ID6178
Feature activation-1.58e-01

Pos logit contributions
 restaurants+0.026
sets+0.026
 Son+0.025
 veggies+0.023
 decisions+0.022
Neg logit contributions
anda-0.025
 can-0.025
ig-0.024
ite-0.022
 slips-0.022
Tokeniring
Token ID3428
Feature activation-1.27e-01

Pos logit contributions
sets+0.097
urg+0.079
 decisions+0.075
 Son+0.075
 restaurants+0.069
Neg logit contributions
 can-0.432
 slips-0.394
 slammed-0.388
 dod-0.387
 does-0.386
Token the
Token ID262
Feature activation+5.06e-03

Pos logit contributions
sets+0.163
 Son+0.162
 restaurants+0.158
 vegetables+0.147
 Frankie+0.143
Neg logit contributions
anda-0.246
ite-0.232
 can-0.228
 slips-0.226
ig-0.224
Token beautiful
Token ID4950
Feature activation+1.69e-02

Pos logit contributions
anda+0.009
 can+0.009
ig+0.009
 slips+0.009
 else+0.008
Neg logit contributions
sets-0.007
 restaurants-0.007
 Son-0.007
 vegetables-0.006
 veggies-0.006
Token sun
Token ID4252
Feature activation+1.12e-03

Pos logit contributions
 can+0.033
anda+0.032
 slips+0.031
ig+0.031
 else+0.030
Neg logit contributions
sets-0.022
 restaurants-0.021
 Son-0.020
 veggies-0.018
kids-0.018
Tokensets
Token ID28709
Feature activation-3.05e-01

Pos logit contributions
 can+0.001
anda+0.001
 slips+0.001
ig+0.001
 else+0.001
Neg logit contributions
sets-0.002
 restaurants-0.002
 Son-0.002
 veggies-0.002
kids-0.002
Token from
Token ID422
Feature activation-3.29e-02

Pos logit contributions
sets+0.620
 restaurants+0.548
 Son+0.545
 veggies+0.510
 vegetables+0.507
Neg logit contributions
anda-0.395
 can-0.368
 slips-0.359
ig-0.358
 does-0.324
Token afar
Token ID44246
Feature activation-5.50e-02

Pos logit contributions
 restaurants+0.046
 Son+0.046
sets+0.046
 avail+0.043
 Frankie+0.042
Neg logit contributions
anda-0.062
 can-0.058
ig-0.057
br-0.055
ite-0.055
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
sets+0.104
 restaurants+0.098
 Son+0.097
 vegetables+0.093
 foods+0.092
Neg logit contributions
anda-0.075
ig-0.070
 can-0.069
 slips-0.067
ite-0.067
Token<|endoftext|>
Token ID50256
Feature activation+7.63e-02

Pos logit contributions
 can+0.018
anda+0.016
ig+0.016
 slips+0.015
ite+0.015
Neg logit contributions
sets-0.021
 restaurants-0.021
 Son-0.020
kids-0.019
 veggies-0.018
TokenOnce
Token ID7454
Feature activation-3.20e-02

Pos logit contributions
anda+0.141
ig+0.129
 slips+0.128
 else+0.127
 dod+0.127
Neg logit contributions
sets-0.106
 restaurants-0.106
 Son-0.096
 Kevin-0.086
 Frankie-0.085
Token big
Token ID1263
Feature activation+3.73e-02

Pos logit contributions
 can+0.001
anda+0.001
ig+0.001
 slips+0.001
ite+0.001
Neg logit contributions
 restaurants-0.001
sets-0.001
 Son-0.001
kids-0.001
 veggies-0.001
Token mountains
Token ID12269
Feature activation-4.27e-02

Pos logit contributions
 can+0.062
anda+0.055
ig+0.054
 slips+0.053
 does+0.053
Neg logit contributions
sets-0.060
 Son-0.060
 restaurants-0.059
kids-0.054
 Daughter-0.054
Token and
Token ID290
Feature activation-1.49e-02

Pos logit contributions
sets+0.080
 restaurants+0.077
 Son+0.071
 foods+0.066
 vegetables+0.066
Neg logit contributions
 can-0.059
anda-0.058
ig-0.057
 slips-0.054
 dod-0.051
Token beautiful
Token ID4950
Feature activation+1.67e-02

Pos logit contributions
 Son+0.028
sets+0.027
kids+0.026
urg+0.026
 Daughter+0.026
Neg logit contributions
 can-0.022
 high-0.019
 else-0.019
 far-0.018
 did-0.017
Token sun
Token ID4252
Feature activation+7.24e-04

Pos logit contributions
 can+0.032
anda+0.029
ig+0.029
 slips+0.028
 does+0.028
Neg logit contributions
sets-0.023
 restaurants-0.023
 Son-0.022
kids-0.020
 veggies-0.019
Tokensets
Token ID28709
Feature activation-3.05e-01

Pos logit contributions
 can+0.001
anda+0.001
ig+0.001
 slips+0.001
 does+0.001
Neg logit contributions
 restaurants-0.001
sets-0.001
 Son-0.001
kids-0.001
 veggies-0.001
Token.
Token ID13
Feature activation+6.26e-03

Pos logit contributions
sets+0.657
 restaurants+0.614
 Son+0.595
 veggies+0.560
kids+0.550
Neg logit contributions
 can-0.346
anda-0.338
ig-0.330
 slips-0.309
ite-0.289
Token He
Token ID679
Feature activation+4.45e-02

Pos logit contributions
 can+0.008
 does+0.006
anda+0.006
 slips+0.006
 high+0.006
Neg logit contributions
sets-0.013
 restaurants-0.013
 avail-0.012
 Son-0.012
kids-0.012
Token felt
Token ID2936
Feature activation-7.53e-02

Pos logit contributions
 can+0.067
anda+0.061
ig+0.061
 slips+0.060
ite+0.059
Neg logit contributions
sets-0.074
 restaurants-0.074
 Son-0.074
kids-0.069
 veggies-0.067
Token happy
Token ID3772
Feature activation+1.18e-02

Pos logit contributions
 restaurants+0.139
 Son+0.137
sets+0.136
 veggies+0.126
 vegetables+0.122
Neg logit contributions
 can-0.101
anda-0.100
ig-0.099
 slips-0.093
ite-0.092
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
 can+0.015
anda+0.013
ig+0.013
 slips+0.012
 does+0.012
Neg logit contributions
sets-0.024
 restaurants-0.024
 Son-0.024
kids-0.022
 veggies-0.022
Token could
Token ID714
Feature activation+1.76e-01

Pos logit contributions
suit+0.032
ider+0.032
arl+0.032
anda+0.031
hy+0.031
Neg logit contributions
sets-0.049
 Son-0.046
 veggies-0.044
 restaurants-0.044
 vegetables-0.043
Token look
Token ID804
Feature activation-1.69e-02

Pos logit contributions
anda+0.294
 can+0.272
ig+0.264
 slips+0.263
suit+0.258
Neg logit contributions
sets-0.265
 restaurants-0.255
 Son-0.250
 veggies-0.232
 perfection-0.228
Token at
Token ID379
Feature activation-7.25e-03

Pos logit contributions
 restaurants+0.027
 Son+0.026
sets+0.026
 vegetables+0.025
 veggies+0.025
Neg logit contributions
anda-0.026
 slips-0.025
ig-0.025
ite-0.025
itty-0.024
Token the
Token ID262
Feature activation+5.86e-03

Pos logit contributions
sets+0.012
 restaurants+0.012
 Son+0.012
kids+0.011
 veggies+0.011
Neg logit contributions
 can-0.012
anda-0.010
ig-0.010
 slips-0.010
 high-0.010
Token sun
Token ID4252
Feature activation+5.29e-03

Pos logit contributions
 can+0.012
 high+0.010
anda+0.010
ite+0.010
 slips+0.010
Neg logit contributions
 restaurants-0.008
sets-0.007
 Son-0.007
kids-0.007
urg-0.007
Tokensets
Token ID28709
Feature activation-3.04e-01

Pos logit contributions
 can+0.007
anda+0.007
ig+0.007
 slips+0.007
 does+0.006
Neg logit contributions
sets-0.010
 restaurants-0.010
 Son-0.009
 veggies-0.009
kids-0.008
Token pieces
Token ID5207
Feature activation-1.43e-01

Pos logit contributions
sets+0.545
 restaurants+0.483
 Son+0.477
 veggies+0.439
umm+0.435
Neg logit contributions
anda-0.454
 can-0.433
 slips-0.412
ig-0.407
 does-0.386
Token every
Token ID790
Feature activation-9.19e-02

Pos logit contributions
sets+0.236
 restaurants+0.221
 Son+0.210
 vegetables+0.201
 Kevin+0.200
Neg logit contributions
anda-0.215
 can-0.192
 dod-0.190
 slips-0.186
 does-0.182
Token day
Token ID1110
Feature activation+4.25e-02

Pos logit contributions
 restaurants+0.116
 Thanksgiving+0.114
sets+0.113
 Son+0.110
 avail+0.103
Neg logit contributions
anda-0.177
 can-0.167
ows-0.166
 slips-0.159
 dod-0.157
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
anda+0.059
 can+0.056
ig+0.054
 slips+0.051
ite+0.051
Neg logit contributions
sets-0.078
 restaurants-0.075
 Son-0.074
 Kevin-0.069
 Thanksgiving-0.068
Token remember
Token ID3505
Feature activation-1.37e-03

Pos logit contributions
sets+0.016
 restaurants+0.016
 Son+0.015
 veggies+0.014
 vegetables+0.014
Neg logit contributions
anda-0.018
 can-0.017
ig-0.017
ite-0.016
 slips-0.016
Token big
Token ID1263
Feature activation+3.93e-02

Pos logit contributions
 can+0.003
anda+0.003
 high+0.003
 does+0.003
 else+0.003
Neg logit contributions
sets-0.003
 Son-0.002
 restaurants-0.002
 Daughter-0.002
 avail-0.002
Token mountains
Token ID12269
Feature activation-4.20e-02

Pos logit contributions
 can+0.063
anda+0.053
 high+0.052
 does+0.052
ig+0.052
Neg logit contributions
 Son-0.068
sets-0.066
 restaurants-0.063
 Daughter-0.062
Father-0.060
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
sets+0.082
 restaurants+0.080
 Son+0.075
kids+0.070
 foods+0.069
Neg logit contributions
 can-0.055
anda-0.053
ig-0.052
 slips-0.049
 does-0.046
Token beautiful
Token ID4950
Feature activation+1.88e-02

Pos logit contributions
 Son+0.033
ainer+0.032
kids+0.032
urg+0.031
umm+0.031
Neg logit contributions
 can-0.023
 high-0.021
 far-0.020
 else-0.019
 did-0.018
Token sun
Token ID4252
Feature activation+2.59e-03

Pos logit contributions
 can+0.034
anda+0.031
ig+0.030
 high+0.029
 does+0.029
Neg logit contributions
 Son-0.028
sets-0.027
 restaurants-0.027
 Daughter-0.025
kids-0.025
Tokensets
Token ID28709
Feature activation-3.04e-01

Pos logit contributions
 can+0.003
anda+0.002
ig+0.002
 high+0.002
 does+0.002
Neg logit contributions
 restaurants-0.005
 Son-0.005
sets-0.005
kids-0.005
 Daughter-0.005
Token.
Token ID13
Feature activation+4.61e-03

Pos logit contributions
sets+0.679
 restaurants+0.653
 Son+0.638
 veggies+0.598
kids+0.594
Neg logit contributions
 can-0.320
anda-0.294
ig-0.285
 slips-0.270
ite-0.253
Token But
Token ID887
Feature activation+9.89e-03

Pos logit contributions
 can+0.006
 does+0.005
 high+0.005
anda+0.005
 slips+0.004
Neg logit contributions
 restaurants-0.010
sets-0.010
 Son-0.009
 avail-0.009
urg-0.009
Token one
Token ID530
Feature activation-1.79e-01

Pos logit contributions
 can+0.013
anda+0.011
ig+0.011
 does+0.011
 high+0.011
Neg logit contributions
sets-0.020
 restaurants-0.019
 Son-0.019
kids-0.018
 veggies-0.018
Token day
Token ID1110
Feature activation+3.94e-02

Pos logit contributions
 restaurants+0.263
 Son+0.259
sets+0.256
 Thanksgiving+0.234
kids+0.234
Neg logit contributions
 can-0.309
anda-0.295
 slips-0.292
ig-0.287
 dod-0.275
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
 can+0.029
anda+0.021
ig+0.021
 slips+0.020
 does+0.019
Neg logit contributions
sets-0.102
 restaurants-0.101
 Son-0.099
kids-0.095
 veggies-0.093
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 can+0.015
anda+0.014
ig+0.013
 slips+0.013
ite+0.012
Neg logit contributions
sets-0.023
 restaurants-0.023
 Son-0.022
kids-0.021
 veggies-0.021
Token She
Token ID1375
Feature activation+2.07e-02

Pos logit contributions
 can+0.022
anda+0.020
ig+0.020
 slips+0.019
 does+0.019
Neg logit contributions
sets-0.020
 restaurants-0.020
 Son-0.019
kids-0.017
 Daughter-0.017
Token watched
Token ID7342
Feature activation-2.63e-02

Pos logit contributions
anda+0.032
ig+0.031
 can+0.031
ite+0.030
 slips+0.030
Neg logit contributions
 Son-0.031
sets-0.030
 restaurants-0.029
kids-0.028
 Father-0.028
Token the
Token ID262
Feature activation+4.75e-03

Pos logit contributions
sets+0.044
 restaurants+0.043
 Son+0.042
 veggies+0.039
kids+0.038
Neg logit contributions
 can-0.042
anda-0.039
ig-0.038
 slips-0.037
ite-0.037
Token sun
Token ID4252
Feature activation-1.05e-03

Pos logit contributions
 can+0.009
anda+0.008
ig+0.008
 slips+0.008
ite+0.008
Neg logit contributions
sets-0.007
 restaurants-0.007
 Son-0.007
 veggies-0.006
kids-0.006
Tokensets
Token ID28709
Feature activation-3.04e-01

Pos logit contributions
sets+0.002
 restaurants+0.002
 Son+0.002
kids+0.002
 veggies+0.002
Neg logit contributions
 can-0.001
anda-0.001
ig-0.001
 slips-0.001
 does-0.001
Token and
Token ID290
Feature activation-1.56e-02

Pos logit contributions
sets+0.578
 restaurants+0.535
 Son+0.527
 veggies+0.490
 vegetables+0.479
Neg logit contributions
 can-0.414
anda-0.405
ig-0.372
 slips-0.372
 does-0.352
Token fireworks
Token ID26056
Feature activation-7.62e-02

Pos logit contributions
sets+0.028
 restaurants+0.028
 Son+0.028
kids+0.026
 veggies+0.025
Neg logit contributions
 can-0.023
anda-0.020
ig-0.019
 slips-0.019
 does-0.019
Token for
Token ID329
Feature activation-7.87e-03

Pos logit contributions
sets+0.151
 restaurants+0.139
 Son+0.135
 veggies+0.125
urg+0.124
Neg logit contributions
anda-0.100
 can-0.099
ig-0.094
 slips-0.093
ite-0.087
Token hours
Token ID2250
Feature activation-4.04e-02

Pos logit contributions
sets+0.012
 Son+0.012
 restaurants+0.012
kids+0.011
 Daughter+0.011
Neg logit contributions
 can-0.013
 high-0.012
ig-0.011
 else-0.011
 does-0.011
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
sets+0.082
 restaurants+0.081
 Son+0.078
kids+0.075
 Thanksgiving+0.073
Neg logit contributions
 can-0.049
anda-0.045
 slips-0.043
 does-0.041
ig-0.041
Token bear
Token ID6842
Feature activation+1.06e-01

Pos logit contributions
 slips+0.027
 can+0.027
 does+0.025
 cannot+0.025
ite+0.025
Neg logit contributions
 Son-0.020
 restaurants-0.018
 couple-0.018
 Father-0.017
kids-0.017
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
 can+0.124
 slips+0.114
 does+0.104
anda+0.103
ig+0.103
Neg logit contributions
sets-0.214
 restaurants-0.209
 Son-0.207
kids-0.193
 Father-0.192
Token He
Token ID679
Feature activation+4.51e-02

Pos logit contributions
 can+0.017
anda+0.016
ig+0.016
 slips+0.015
 does+0.015
Neg logit contributions
 restaurants-0.016
sets-0.016
 Son-0.015
kids-0.014
 veggies-0.014
Token loved
Token ID6151
Feature activation+3.57e-02

Pos logit contributions
arl+0.087
oms+0.083
ite+0.083
ows+0.081
ig+0.080
Neg logit contributions
 restaurants-0.049
 veggies-0.047
 vegetables-0.047
 Son-0.046
kids-0.045
Token sun
Token ID4252
Feature activation+3.68e-03

Pos logit contributions
 can+0.040
 high+0.032
anda+0.032
 does+0.031
ig+0.030
Neg logit contributions
 Son-0.077
sets-0.077
 restaurants-0.075
 avail-0.072
kids-0.071
Tokensets
Token ID28709
Feature activation-3.03e-01

Pos logit contributions
 can+0.007
anda+0.006
ig+0.006
 high+0.006
 does+0.006
Neg logit contributions
 restaurants-0.005
 Son-0.005
sets-0.005
kids-0.005
 Daughter-0.005
Token and
Token ID290
Feature activation-1.43e-02

Pos logit contributions
sets+0.659
 restaurants+0.637
 Son+0.626
 veggies+0.585
kids+0.578
Neg logit contributions
 can-0.332
anda-0.301
ig-0.291
 slips-0.291
 does-0.265
Token he
Token ID339
Feature activation+1.41e-02

Pos logit contributions
 Son+0.031
 decisions+0.030
sets+0.030
umm+0.030
 restaurants+0.030
Neg logit contributions
 can-0.018
 high-0.013
 does-0.013
 did-0.013
 has-0.012
Token wanted
Token ID2227
Feature activation-4.42e-02

Pos logit contributions
 can+0.017
anda+0.017
 slips+0.017
ig+0.016
ite+0.016
Neg logit contributions
sets-0.025
 restaurants-0.025
 Son-0.025
 veggies-0.023
kids-0.023
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
 Son+0.104
sets+0.104
 restaurants+0.100
umm+0.100
kids+0.099
Neg logit contributions
 can-0.041
 high-0.034
 else-0.033
 does-0.028
anda-0.026
Token take
Token ID1011
Feature activation-5.65e-02

Pos logit contributions
sets+0.027
 Son+0.027
 restaurants+0.026
kids+0.025
umm+0.024
Neg logit contributions
 can-0.020
 high-0.017
 else-0.016
 does-0.016
anda-0.016
Token bakery
Token ID38164
Feature activation-3.69e-02

Pos logit contributions
 restaurants+0.010
 Son+0.009
sets+0.009
 vegetables+0.009
 Father+0.008
Neg logit contributions
ig-0.013
 can-0.012
anda-0.012
 slips-0.012
ite-0.012
Token to
Token ID284
Feature activation-3.14e-02

Pos logit contributions
sets+0.081
 restaurants+0.081
 Son+0.080
kids+0.075
 veggies+0.074
Neg logit contributions
 can-0.039
anda-0.034
ig-0.033
 slips-0.032
 does-0.030
Token buy
Token ID2822
Feature activation-1.50e-01

Pos logit contributions
sets+0.053
 Son+0.048
 restaurants+0.047
urg+0.045
kids+0.045
Neg logit contributions
 can-0.053
 high-0.045
 else-0.045
 does-0.044
 slips-0.043
Token the
Token ID262
Feature activation-5.55e-03

Pos logit contributions
sets+0.240
 restaurants+0.230
 Son+0.225
kids+0.208
urg+0.206
Neg logit contributions
 can-0.257
anda-0.222
 high-0.219
 does-0.218
 else-0.216
Token y
Token ID331
Feature activation-1.53e-02

Pos logit contributions
 restaurants+0.007
sets+0.007
 Son+0.007
 veggies+0.007
 vegetables+0.006
Neg logit contributions
 can-0.010
ig-0.010
anda-0.010
 slips-0.010
ite-0.010
Tokenumm
Token ID13929
Feature activation-3.03e-01

Pos logit contributions
sets+0.016
 restaurants+0.015
 Son+0.015
Father+0.013
 Father+0.013
Neg logit contributions
 can-0.034
anda-0.032
ig-0.032
ite-0.031
 slips-0.030
Tokeniest
Token ID6386
Feature activation-1.17e-01

Pos logit contributions
sets+0.454
 restaurants+0.452
 Son+0.440
kids+0.402
 veggies+0.389
Neg logit contributions
 can-0.541
anda-0.491
ig-0.488
 slips-0.468
ite-0.467
Token pastry
Token ID41738
Feature activation-2.54e-02

Pos logit contributions
 restaurants+0.145
sets+0.144
 Son+0.140
 veggies+0.127
kids+0.126
Neg logit contributions
 can-0.231
anda-0.218
ig-0.214
 slips-0.212
 does-0.206
Token he
Token ID339
Feature activation+1.12e-02

Pos logit contributions
sets+0.049
 restaurants+0.048
 Son+0.047
 veggies+0.045
 vegetables+0.045
Neg logit contributions
 can-0.032
ig-0.031
anda-0.031
 slips-0.030
 dod-0.028
Token could
Token ID714
Feature activation+1.64e-01

Pos logit contributions
ig+0.017
anda+0.016
ite+0.016
 slips+0.015
uddle+0.015
Neg logit contributions
 restaurants-0.018
 veggies-0.018
 Son-0.018
kids-0.017
 vegetables-0.017
Token find
Token ID1064
Feature activation-1.21e-01

Pos logit contributions
 can+0.263
anda+0.241
ig+0.236
 slips+0.231
 does+0.225
Neg logit contributions
sets-0.271
 restaurants-0.269
 Son-0.263
kids-0.241
 veggies-0.239
Token made
Token ID925
Feature activation-4.97e-02

Pos logit contributions
ite+0.005
ig+0.005
anda+0.005
br+0.005
arl+0.004
Neg logit contributions
 Son-0.004
 restaurants-0.004
 Father-0.004
kids-0.004
sets-0.004
Token him
Token ID683
Feature activation-3.56e-02

Pos logit contributions
sets+0.074
 restaurants+0.074
 Son+0.072
kids+0.065
 veggies+0.065
Neg logit contributions
 can-0.087
anda-0.081
ig-0.080
 slips-0.078
ite-0.077
Token an
Token ID281
Feature activation-8.20e-02

Pos logit contributions
 restaurants+0.056
sets+0.056
 Son+0.054
 veggies+0.050
 decisions+0.049
Neg logit contributions
 can-0.057
anda-0.054
ig-0.054
 slips-0.051
ite-0.051
Token even
Token ID772
Feature activation-1.45e-01

Pos logit contributions
sets+0.142
 restaurants+0.138
 Son+0.133
 veggies+0.121
kids+0.120
Neg logit contributions
 can-0.129
anda-0.120
ig-0.115
 high-0.110
 slips-0.110
Token y
Token ID331
Feature activation-1.46e-02

Pos logit contributions
 vegetables+0.188
 veggies+0.185
 restaurants+0.184
 Son+0.181
sets+0.181
Neg logit contributions
anda-0.252
ite-0.247
ig-0.245
uffy-0.232
 can-0.225
Tokenumm
Token ID13929
Feature activation-3.00e-01

Pos logit contributions
sets+0.017
 restaurants+0.016
 Son+0.014
kids+0.014
Father+0.014
Neg logit contributions
 can-0.031
ig-0.031
anda-0.030
ite-0.029
 slips-0.028
Tokenier
Token ID959
Feature activation-5.36e-02

Pos logit contributions
sets+0.395
 restaurants+0.390
 Son+0.383
kids+0.339
 veggies+0.336
Neg logit contributions
 can-0.569
anda-0.532
 slips-0.517
ig-0.510
 does-0.504
Token snack
Token ID26906
Feature activation-8.17e-02

Pos logit contributions
sets+0.098
 restaurants+0.096
 Son+0.094
kids+0.087
 veggies+0.085
Neg logit contributions
 can-0.079
anda-0.069
ig-0.068
 slips-0.066
 does-0.065
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
sets+0.146
 restaurants+0.140
 Son+0.138
 vegetables+0.134
 veggies+0.133
Neg logit contributions
anda-0.111
ig-0.109
 can-0.103
 slips-0.100
ite-0.099
Token they
Token ID484
Feature activation-5.75e-02

Pos logit contributions
sets+0.033
 restaurants+0.033
 Son+0.032
 veggies+0.029
kids+0.029
Neg logit contributions
 can-0.033
anda-0.031
ig-0.030
 slips-0.029
ite-0.029
Token both
Token ID1111
Feature activation-5.17e-02

Pos logit contributions
sets+0.086
 Son+0.085
 restaurants+0.084
 veggies+0.080
 vegetables+0.078
Neg logit contributions
anda-0.091
ig-0.091
 can-0.087
 slips-0.087
ite-0.086
Token ice
Token ID4771
Feature activation-2.48e-02

Pos logit contributions
 can+0.015
anda+0.013
ig+0.013
 slips+0.013
 does+0.012
Neg logit contributions
sets-0.013
 restaurants-0.013
 Son-0.013
kids-0.012
 veggies-0.012
Token cream
Token ID8566
Feature activation+3.20e-02

Pos logit contributions
sets+0.046
 restaurants+0.046
 Son+0.045
kids+0.041
 veggies+0.041
Neg logit contributions
 can-0.035
anda-0.031
ig-0.030
 slips-0.030
 does-0.029
Token taste
Token ID6938
Feature activation-1.81e-01

Pos logit contributions
 can+0.039
anda+0.034
ig+0.033
 slips+0.032
 does+0.031
Neg logit contributions
sets-0.066
 restaurants-0.065
 Son-0.064
kids-0.060
 veggies-0.059
Token even
Token ID772
Feature activation-1.42e-01

Pos logit contributions
sets+0.373
 restaurants+0.370
 Son+0.365
 veggies+0.343
 vegetables+0.334
Neg logit contributions
 can-0.211
anda-0.197
ig-0.189
 slips-0.183
ite-0.179
Token y
Token ID331
Feature activation-7.56e-03

Pos logit contributions
sets+0.261
 restaurants+0.260
 Son+0.257
 veggies+0.242
 vegetables+0.235
Neg logit contributions
 can-0.194
anda-0.185
ig-0.180
 slips-0.172
ite-0.172
Tokenumm
Token ID13929
Feature activation-2.95e-01

Pos logit contributions
 restaurants+0.008
sets+0.008
 Son+0.007
kids+0.007
Father+0.006
Neg logit contributions
 can-0.017
anda-0.016
ig-0.016
ite-0.015
 high-0.015
Tokenier
Token ID959
Feature activation-5.11e-02

Pos logit contributions
sets+0.493
 restaurants+0.492
 Son+0.478
kids+0.441
 veggies+0.431
Neg logit contributions
 can-0.472
anda-0.427
ig-0.421
 slips-0.407
 does-0.399
Token!
Token ID0
Feature activation-7.00e-02

Pos logit contributions
sets+0.114
 restaurants+0.114
 Son+0.112
kids+0.105
 veggies+0.104
Neg logit contributions
 can-0.052
anda-0.044
ig-0.043
 slips-0.041
 does-0.040
Token And
Token ID843
Feature activation+5.00e-02

Pos logit contributions
sets+0.109
 Son+0.108
 restaurants+0.107
 Daughter+0.097
kids+0.097
Neg logit contributions
 can-0.117
anda-0.109
ig-0.108
 slips-0.107
 does-0.103
Token every
Token ID790
Feature activation-9.71e-02

Pos logit contributions
 can+0.081
anda+0.076
ig+0.074
 slips+0.074
ite+0.071
Neg logit contributions
sets-0.080
 restaurants-0.080
 Son-0.079
 veggies-0.071
kids-0.071
Token time
Token ID640
Feature activation+4.77e-02

Pos logit contributions
 restaurants+0.166
 Son+0.164
sets+0.158
 Thanksgiving+0.151
kids+0.148
Neg logit contributions
 can-0.145
anda-0.140
ig-0.131
 slips-0.129
ite-0.128
Token to
Token ID284
Feature activation-2.50e-02

Pos logit contributions
sets+0.158
 restaurants+0.150
 Son+0.148
 vegetables+0.146
 veggies+0.144
Neg logit contributions
anda-0.099
 can-0.099
ite-0.096
ig-0.093
 slips-0.092
Token make
Token ID787
Feature activation-7.42e-02

Pos logit contributions
sets+0.043
 restaurants+0.040
 Son+0.039
kids+0.038
 Daughter+0.036
Neg logit contributions
 can-0.042
ig-0.036
 high-0.036
 slips-0.035
 does-0.035
Token food
Token ID2057
Feature activation-1.20e-01

Pos logit contributions
sets+0.125
 restaurants+0.122
 Son+0.121
kids+0.116
Father+0.109
Neg logit contributions
 can-0.118
 high-0.103
 else-0.102
ig-0.100
 does-0.100
Token look
Token ID804
Feature activation-2.74e-02

Pos logit contributions
sets+0.243
 restaurants+0.241
 Son+0.232
 veggies+0.223
 vegetables+0.221
Neg logit contributions
anda-0.140
 can-0.137
ig-0.133
ite-0.128
 slips-0.126
Token y
Token ID331
Feature activation-5.73e-03

Pos logit contributions
sets+0.052
 restaurants+0.052
 Son+0.051
 veggies+0.048
 vegetables+0.046
Neg logit contributions
 can-0.036
anda-0.034
ig-0.033
 slips-0.032
ite-0.032
Tokenumm
Token ID13929
Feature activation-2.93e-01

Pos logit contributions
 restaurants+0.007
sets+0.006
 Son+0.006
kids+0.006
Father+0.006
Neg logit contributions
 can-0.012
ig-0.011
anda-0.011
 high-0.011
ite-0.011
Tokenier
Token ID959
Feature activation-5.00e-02

Pos logit contributions
 restaurants+0.551
sets+0.547
 Son+0.545
kids+0.509
Father+0.505
Neg logit contributions
 can-0.408
ig-0.366
anda-0.356
 slips-0.336
 does-0.332
Token than
Token ID621
Feature activation-1.73e-02

Pos logit contributions
 restaurants+0.114
sets+0.114
 Son+0.113
kids+0.109
 Daughter+0.106
Neg logit contributions
 can-0.050
anda-0.038
ig-0.038
 does-0.036
 slips-0.036
Token before
Token ID878
Feature activation-3.85e-02

Pos logit contributions
 restaurants+0.024
sets+0.024
 Son+0.024
 veggies+0.022
 vegetables+0.022
Neg logit contributions
anda-0.031
 can-0.031
ig-0.031
ite-0.030
 slips-0.029
Token.
Token ID13
Feature activation+6.23e-03

Pos logit contributions
sets+0.088
 restaurants+0.087
 Son+0.086
kids+0.081
 veggies+0.080
Neg logit contributions
 can-0.038
anda-0.032
ig-0.031
 slips-0.030
 does-0.029
Token She
Token ID1375
Feature activation+1.24e-02

Pos logit contributions
 can+0.010
anda+0.009
ig+0.009
 slips+0.009
 does+0.008
Neg logit contributions
sets-0.010
 restaurants-0.010
 Son-0.010
kids-0.009
urg-0.009
Token little
Token ID1310
Feature activation-3.96e-02

Pos logit contributions
 can+0.000
ig+0.000
anda+0.000
 slips+0.000
ite+0.000
Neg logit contributions
 restaurants-0.000
 Son-0.000
sets-0.000
 veggies-0.000
kids-0.000
Token boy
Token ID2933
Feature activation+5.71e-02

Pos logit contributions
sets+0.064
 restaurants+0.064
 Son+0.063
 veggies+0.057
kids+0.057
Neg logit contributions
 can-0.065
anda-0.060
ig-0.059
 slips-0.057
 does-0.055
Token had
Token ID550
Feature activation+6.23e-02

Pos logit contributions
 can+0.085
anda+0.081
ig+0.079
ite+0.077
 slips+0.076
Neg logit contributions
 Son-0.097
 restaurants-0.095
sets-0.093
kids-0.088
 Father-0.087
Token the
Token ID262
Feature activation+3.84e-04

Pos logit contributions
 can+0.097
anda+0.085
ig+0.083
 slips+0.083
 does+0.081
Neg logit contributions
sets-0.109
 restaurants-0.107
 Son-0.104
kids-0.097
 veggies-0.095
Token y
Token ID331
Feature activation+3.65e-04

Pos logit contributions
 can+0.001
anda+0.001
ig+0.001
 slips+0.001
ite+0.001
Neg logit contributions
 restaurants-0.001
 Son-0.001
sets-0.001
 veggies-0.001
kids-0.001
Tokenumm
Token ID13929
Feature activation-2.92e-01

Pos logit contributions
 can+0.001
ig+0.001
anda+0.001
ite+0.001
 slips+0.001
Neg logit contributions
sets-0.000
 restaurants-0.000
 Son-0.000
Father-0.000
 Father-0.000
Tokeniest
Token ID6386
Feature activation-1.06e-01

Pos logit contributions
sets+0.395
 restaurants+0.393
 Son+0.383
kids+0.344
 veggies+0.339
Neg logit contributions
 can-0.553
anda-0.513
ig-0.500
 slips-0.496
 does-0.484
Token treat
Token ID2190
Feature activation-5.45e-02

Pos logit contributions
 restaurants+0.158
sets+0.157
 Son+0.153
kids+0.139
 veggies+0.138
Neg logit contributions
 can-0.187
anda-0.172
ig-0.169
 slips-0.166
 does-0.162
Token.
Token ID13
Feature activation+3.09e-03

Pos logit contributions
 Son+0.089
sets+0.088
 restaurants+0.087
 veggies+0.081
 vegetables+0.079
Neg logit contributions
 can-0.082
anda-0.082
ig-0.081
 slips-0.079
 dod-0.075
Token He
Token ID679
Feature activation+4.01e-02

Pos logit contributions
 can+0.005
 slips+0.004
anda+0.004
 does+0.004
ite+0.003
Neg logit contributions
sets-0.006
 restaurants-0.006
 Son-0.005
 avail-0.005
kids-0.005
Token asked
Token ID1965
Feature activation-9.32e-02

Pos logit contributions
 can+0.056
anda+0.054
ig+0.054
ite+0.053
 slips+0.052
Neg logit contributions
 Son-0.070
 restaurants-0.068
sets-0.066
 veggies-0.064
kids-0.064
Token it
Token ID340
Feature activation-3.85e-02

Pos logit contributions
 can+0.097
 high+0.089
ig+0.078
 does+0.076
 slips+0.073
Neg logit contributions
sets-0.150
 restaurants-0.147
kids-0.144
 Son-0.140
umm-0.138
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
sets+0.077
 restaurants+0.076
 Son+0.075
 veggies+0.069
kids+0.069
Neg logit contributions
 can-0.048
anda-0.044
ig-0.042
 slips-0.041
ite-0.040
Token found
Token ID1043
Feature activation-2.94e-02

Pos logit contributions
sets+0.036
 restaurants+0.036
 Son+0.035
kids+0.033
 veggies+0.032
Neg logit contributions
 can-0.019
anda-0.016
ig-0.015
 slips-0.015
 does-0.015
Token some
Token ID617
Feature activation-1.94e-01

Pos logit contributions
sets+0.058
 restaurants+0.056
kids+0.053
 Son+0.053
 Daughter+0.051
Neg logit contributions
 can-0.043
 high-0.036
 else-0.035
anda-0.035
 does-0.034
Token y
Token ID331
Feature activation+9.69e-04

Pos logit contributions
 restaurants+0.263
sets+0.252
 Son+0.251
 veggies+0.245
 vegetables+0.241
Neg logit contributions
 can-0.356
ig-0.352
anda-0.350
 slips-0.341
ite-0.338
Tokenumm
Token ID13929
Feature activation-2.91e-01

Pos logit contributions
 can+0.002
ig+0.002
anda+0.002
 high+0.002
 slips+0.002
Neg logit contributions
sets-0.001
 Son-0.001
 restaurants-0.001
Father-0.001
kids-0.001
Tokenmy
Token ID1820
Feature activation-3.53e-02

Pos logit contributions
sets+0.470
 restaurants+0.465
 Son+0.456
kids+0.417
 veggies+0.409
Neg logit contributions
 can-0.485
anda-0.442
ig-0.436
 slips-0.424
 does-0.414
Token food
Token ID2057
Feature activation-1.14e-01

Pos logit contributions
sets+0.061
 restaurants+0.060
 Son+0.059
kids+0.055
 avail+0.054
Neg logit contributions
 can-0.055
anda-0.050
ig-0.048
 slips-0.046
 does-0.046
Token inside
Token ID2641
Feature activation+1.54e-02

Pos logit contributions
 restaurants+0.213
sets+0.209
 Son+0.206
 vegetables+0.201
 veggies+0.200
Neg logit contributions
ig-0.147
anda-0.145
 can-0.137
ite-0.136
 slips-0.135
Token.
Token ID13
Feature activation+7.12e-03

Pos logit contributions
 can+0.016
anda+0.015
ig+0.015
 slips+0.014
ite+0.013
Neg logit contributions
 restaurants-0.034
sets-0.034
 Son-0.033
 veggies-0.031
kids-0.031
Token Leo
Token ID19632
Feature activation+5.59e-02

Pos logit contributions
 can+0.009
 does+0.008
 slips+0.008
 high+0.008
anda+0.008
Neg logit contributions
sets-0.015
 restaurants-0.014
 Son-0.014
kids-0.013
 avail-0.013
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
anda+0.048
 can+0.048
ig+0.042
 does+0.042
 slips+0.042
Neg logit contributions
sets-0.066
 Son-0.065
 restaurants-0.063
 veggies-0.059
 vegetables-0.059
Token it
Token ID340
Feature activation-3.87e-02

Pos logit contributions
sets+0.030
 restaurants+0.029
 Son+0.029
kids+0.027
 avail+0.026
Neg logit contributions
 can-0.022
anda-0.019
ig-0.019
 slips-0.019
 does-0.018
Token was
Token ID373
Feature activation+2.27e-02

Pos logit contributions
sets+0.073
 restaurants+0.071
 Son+0.070
 veggies+0.065
kids+0.064
Neg logit contributions
 can-0.052
anda-0.049
ig-0.047
 slips-0.046
ite-0.046
Token the
Token ID262
Feature activation+4.37e-03

Pos logit contributions
 can+0.035
anda+0.034
ig+0.033
 slips+0.032
ite+0.032
Neg logit contributions
sets-0.038
 restaurants-0.038
 Son-0.037
 veggies-0.034
 Thanksgiving-0.034
Token y
Token ID331
Feature activation-4.04e-03

Pos logit contributions
 can+0.008
anda+0.007
ig+0.007
 slips+0.007
ite+0.007
Neg logit contributions
 Son-0.007
sets-0.006
 restaurants-0.006
 veggies-0.006
 vegetables-0.006
Tokenumm
Token ID13929
Feature activation-2.91e-01

Pos logit contributions
sets+0.003
 restaurants+0.003
 Son+0.003
kids+0.003
Father+0.003
Neg logit contributions
 can-0.010
ig-0.009
anda-0.009
ite-0.009
 slips-0.009
Tokeniest
Token ID6386
Feature activation-1.06e-01

Pos logit contributions
sets+0.414
 restaurants+0.410
 Son+0.400
kids+0.361
 veggies+0.356
Neg logit contributions
 can-0.533
anda-0.493
ig-0.481
 slips-0.476
 does-0.465
Token ice
Token ID4771
Feature activation-1.93e-02

Pos logit contributions
sets+0.163
 restaurants+0.162
 Son+0.158
kids+0.144
 veggies+0.142
Neg logit contributions
 can-0.183
anda-0.168
ig-0.164
 slips-0.162
 does-0.158
Token cream
Token ID8566
Feature activation+3.68e-02

Pos logit contributions
sets+0.030
 restaurants+0.029
 Son+0.029
 veggies+0.027
 flavours+0.027
Neg logit contributions
 can-0.032
anda-0.029
 slips-0.029
ig-0.028
 does-0.028
Token she
Token ID673
Feature activation+2.78e-02

Pos logit contributions
anda+0.051
 can+0.049
ig+0.048
 slips+0.047
 does+0.045
Neg logit contributions
sets-0.067
 Son-0.065
 restaurants-0.063
 veggies-0.059
umm-0.058
Token ever
Token ID1683
Feature activation+3.30e-02

Pos logit contributions
anda+0.039
ig+0.039
 slams+0.037
ite+0.036
ows+0.036
Neg logit contributions
 Son-0.046
 veggies-0.045
kids-0.044
 restaurants-0.044
 french-0.043
Token little
Token ID1310
Feature activation-3.50e-02

Pos logit contributions
anda+0.007
ig+0.007
ows+0.006
as+0.006
ite+0.006
Neg logit contributions
 restaurants-0.005
 vegetables-0.005
 Son-0.005
 veggies-0.005
 foods-0.005
Token cracks
Token ID23217
Feature activation-5.51e-02

Pos logit contributions
 restaurants+0.051
sets+0.048
 Son+0.048
 vegetables+0.046
 veggies+0.045
Neg logit contributions
anda-0.060
 can-0.060
ig-0.057
 does-0.056
 slips-0.055
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
sets+0.097
 restaurants+0.094
 Son+0.090
 vegetables+0.089
 veggies+0.085
Neg logit contributions
anda-0.079
ig-0.070
 can-0.069
ite-0.067
 dod-0.067
Token cre
Token ID1126
Feature activation-1.20e-02

Pos logit contributions
 restaurants+0.022
sets+0.022
 Son+0.021
 veggies+0.020
 vegetables+0.020
Neg logit contributions
anda-0.023
ig-0.022
 can-0.022
ite-0.021
 slips-0.020
Tokenv
Token ID85
Feature activation-6.38e-02

Pos logit contributions
sets+0.021
 restaurants+0.021
 Son+0.021
kids+0.019
 veggies+0.019
Neg logit contributions
 can-0.018
anda-0.017
ig-0.016
 slips-0.016
ite-0.015
Tokenices
Token ID1063
Feature activation-2.90e-01

Pos logit contributions
sets+0.073
 restaurants+0.072
 Son+0.070
kids+0.062
 veggies+0.061
Neg logit contributions
 can-0.135
anda-0.126
ig-0.124
 slips-0.122
 does-0.120
Token of
Token ID286
Feature activation+6.66e-04

Pos logit contributions
sets+0.521
 Son+0.492
 restaurants+0.482
 Kevin+0.451
 Father+0.445
Neg logit contributions
anda-0.374
ig-0.367
 can-0.346
 slips-0.341
 dod-0.333
Token the
Token ID262
Feature activation+3.34e-03

Pos logit contributions
anda+0.001
 can+0.001
ig+0.001
ite+0.001
 slips+0.001
Neg logit contributions
 restaurants-0.001
 Son-0.001
sets-0.001
 vegetables-0.001
 veggies-0.001
Token rocks
Token ID12586
Feature activation+5.20e-03

Pos logit contributions
anda+0.006
ite+0.006
ig+0.006
 can+0.006
ows+0.006
Neg logit contributions
 restaurants-0.004
 vegetables-0.004
 veggies-0.004
 foods-0.004
 Son-0.004
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
anda+0.007
ig+0.006
 dod+0.006
itty+0.006
 can+0.006
Neg logit contributions
sets-0.009
 restaurants-0.009
 vegetables-0.009
 veggies-0.008
 Son-0.008
Token trees
Token ID7150
Feature activation+1.61e-02

Pos logit contributions
 restaurants+0.020
sets+0.019
 Son+0.019
 veggies+0.018
 vegetables+0.018
Neg logit contributions
anda-0.023
 can-0.023
ig-0.023
ite-0.022
 slips-0.021
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
sets+0.004
 restaurants+0.004
 Son+0.004
 foods+0.003
 veggies+0.003
Neg logit contributions
 can-0.003
anda-0.003
ig-0.003
 slips-0.002
ite-0.002
Token They
Token ID1119
Feature activation-5.27e-02

Pos logit contributions
 can+0.024
ig+0.023
anda+0.023
 slips+0.022
ite+0.021
Neg logit contributions
 Son-0.019
 restaurants-0.018
sets-0.018
 Father-0.016
 Daughter-0.016
Token had
Token ID550
Feature activation+6.01e-02

Pos logit contributions
 Son+0.079
 restaurants+0.075
sets+0.071
 Father+0.071
kids+0.068
Neg logit contributions
ig-0.089
anda-0.086
ite-0.082
br-0.081
 slips-0.080
Token stopped
Token ID5025
Feature activation+3.25e-02

Pos logit contributions
 can+0.091
anda+0.082
ig+0.080
 slips+0.079
 does+0.076
Neg logit contributions
sets-0.106
 restaurants-0.106
 Son-0.103
kids-0.095
 veggies-0.094
Token qu
Token ID627
Feature activation-4.91e-02

Pos logit contributions
 can+0.053
anda+0.048
ig+0.047
 slips+0.046
 does+0.045
Neg logit contributions
sets-0.054
 restaurants-0.053
 Son-0.052
kids-0.048
 veggies-0.047
Tokenarre
Token ID9624
Feature activation-2.90e-01

Pos logit contributions
sets+0.047
 restaurants+0.047
 Son+0.045
kids+0.039
 veggies+0.038
Neg logit contributions
 can-0.113
anda-0.107
ig-0.105
 slips-0.103
ite-0.102
Tokenling
Token ID1359
Feature activation-1.25e-01

Pos logit contributions
 Son+0.458
 restaurants+0.453
sets+0.447
 Father+0.412
 Daughter+0.411
Neg logit contributions
 can-0.488
anda-0.436
 does-0.432
ig-0.431
 high-0.424
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
sets+0.218
 restaurants+0.208
 Son+0.203
 foods+0.192
 vegetables+0.191
Neg logit contributions
anda-0.190
ig-0.184
 can-0.178
ite-0.173
 slips-0.171
Token To
Token ID1675
Feature activation-8.08e-03

Pos logit contributions
 can+0.020
ig+0.020
anda+0.020
 slips+0.019
ite+0.018
Neg logit contributions
 Son-0.016
 restaurants-0.016
sets-0.016
 Daughter-0.014
 Father-0.014
Token celebrate
Token ID10648
Feature activation-7.43e-02

Pos logit contributions
 restaurants+0.013
sets+0.013
 Son+0.012
kids+0.011
 veggies+0.011
Neg logit contributions
 can-0.014
anda-0.013
ig-0.012
 slips-0.012
ite-0.012
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
 restaurants+0.161
sets+0.159
 Son+0.158
 foods+0.146
 veggies+0.145
Neg logit contributions
 can-0.079
anda-0.077
ig-0.072
ite-0.070
 slips-0.069
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
anda+0.027
 can+0.027
 slips+0.026
ig+0.026
 does+0.024
Neg logit contributions
 Son-0.036
sets-0.035
 restaurants-0.035
 vegetables-0.034
 veggies-0.034
Token It
Token ID632
Feature activation+1.77e-02

Pos logit contributions
 can+0.026
anda+0.023
ig+0.023
 slips+0.022
 does+0.022
Neg logit contributions
sets-0.027
 restaurants-0.027
 Son-0.026
kids-0.024
 veggies-0.024
Token was
Token ID373
Feature activation+2.40e-02

Pos logit contributions
 can+0.024
anda+0.023
ig+0.023
 slips+0.022
ite+0.022
Neg logit contributions
sets-0.032
 Son-0.032
 restaurants-0.032
 veggies-0.029
kids-0.029
Token the
Token ID262
Feature activation+6.14e-03

Pos logit contributions
 can+0.039
anda+0.037
ig+0.037
ite+0.036
 slips+0.035
Neg logit contributions
 Son-0.039
sets-0.038
 restaurants-0.038
 veggies-0.035
 Thanksgiving-0.034
Token y
Token ID331
Feature activation-2.08e-03

Pos logit contributions
 can+0.011
ig+0.011
anda+0.011
 slips+0.010
ite+0.010
Neg logit contributions
sets-0.009
 Son-0.009
 restaurants-0.009
 veggies-0.008
 vegetables-0.008
Tokenumm
Token ID13929
Feature activation-2.89e-01

Pos logit contributions
sets+0.002
 restaurants+0.002
 Son+0.002
kids+0.002
Father+0.002
Neg logit contributions
 can-0.005
ig-0.005
anda-0.005
ite-0.004
 slips-0.004
Tokeniest
Token ID6386
Feature activation-1.03e-01

Pos logit contributions
sets+0.377
 restaurants+0.371
 Son+0.363
kids+0.322
 veggies+0.321
Neg logit contributions
 can-0.560
anda-0.524
ig-0.508
 slips-0.506
 does-0.493
Token thing
Token ID1517
Feature activation-4.48e-02

Pos logit contributions
 restaurants+0.151
sets+0.148
 Son+0.143
kids+0.134
 veggies+0.133
Neg logit contributions
 can-0.183
anda-0.172
ig-0.168
 slips-0.168
ite-0.163
Token he
Token ID339
Feature activation+1.82e-02

Pos logit contributions
sets+0.066
 Son+0.065
 restaurants+0.061
 Kevin+0.060
 vegetables+0.058
Neg logit contributions
ig-0.076
anda-0.074
 slips-0.071
 can-0.070
ars-0.068
Token ever
Token ID1683
Feature activation+3.43e-02

Pos logit contributions
ig+0.026
anda+0.025
ite+0.024
 slams+0.023
 slips+0.023
Neg logit contributions
 Son-0.031
 restaurants-0.031
 veggies-0.031
kids-0.030
 vegetables-0.030
Token tasted
Token ID29187
Feature activation-1.77e-01

Pos logit contributions
 can+0.042
anda+0.037
ig+0.036
 slips+0.035
ite+0.033
Neg logit contributions
sets-0.070
 restaurants-0.070
 Son-0.069
kids-0.064
 veggies-0.064
Token and
Token ID290
Feature activation-1.28e-02

Pos logit contributions
 restaurants+0.004
sets+0.004
 Son+0.004
 veggies+0.003
 Thanksgiving+0.003
Neg logit contributions
anda-0.003
 can-0.003
 slips-0.002
ite-0.002
ig-0.002
Token it
Token ID340
Feature activation-3.63e-02

Pos logit contributions
sets+0.023
 restaurants+0.023
 Son+0.022
kids+0.021
 veggies+0.020
Neg logit contributions
 can-0.019
anda-0.017
ig-0.016
 slips-0.016
 does-0.016
Token was
Token ID373
Feature activation+2.58e-02

Pos logit contributions
sets+0.077
 restaurants+0.076
 Son+0.075
kids+0.070
 veggies+0.070
Neg logit contributions
 can-0.042
anda-0.036
ig-0.035
 slips-0.034
 does-0.033
Token the
Token ID262
Feature activation+8.39e-03

Pos logit contributions
 can+0.038
anda+0.033
ig+0.033
 slips+0.032
 does+0.032
Neg logit contributions
sets-0.047
 restaurants-0.046
 Son-0.045
kids-0.042
 veggies-0.041
Token y
Token ID331
Feature activation+1.85e-03

Pos logit contributions
 can+0.014
anda+0.013
ig+0.013
 slips+0.013
ite+0.012
Neg logit contributions
sets-0.013
 restaurants-0.013
 Son-0.013
 veggies-0.012
kids-0.012
Tokenumm
Token ID13929
Feature activation-2.89e-01

Pos logit contributions
 can+0.004
ig+0.004
anda+0.004
uddle+0.003
 high+0.003
Neg logit contributions
 restaurants-0.002
sets-0.002
 Son-0.002
Father-0.002
kids-0.002
Tokeniest
Token ID6386
Feature activation-1.04e-01

Pos logit contributions
 restaurants+0.452
sets+0.451
 Son+0.433
kids+0.401
 veggies+0.389
Neg logit contributions
 can-0.500
ig-0.451
anda-0.445
 slips-0.426
 does-0.423
Token dinner
Token ID8073
Feature activation-1.83e-01

Pos logit contributions
sets+0.141
 restaurants+0.134
 Son+0.128
kids+0.117
Father+0.116
Neg logit contributions
 can-0.208
ig-0.183
anda-0.181
 high-0.176
 else-0.176
Token they
Token ID484
Feature activation-5.52e-02

Pos logit contributions
sets+0.323
 restaurants+0.321
 Son+0.314
 veggies+0.290
kids+0.288
Neg logit contributions
anda-0.259
 can-0.258
ig-0.247
 slips-0.242
ite-0.234
Token ever
Token ID1683
Feature activation+3.47e-02

Pos logit contributions
 restaurants+0.107
 Son+0.105
sets+0.103
 veggies+0.100
kids+0.098
Neg logit contributions
anda-0.070
ig-0.069
 slips-0.065
ite-0.062
 can-0.062
Token had
Token ID550
Feature activation+6.71e-02

Pos logit contributions
 can+0.045
anda+0.038
ig+0.037
 slips+0.037
 does+0.036
Neg logit contributions
sets-0.070
 restaurants-0.069
 Son-0.068
kids-0.063
 veggies-0.062
Token,
Token ID11
Feature activation+1.35e-02

Pos logit contributions
 restaurants+0.002
sets+0.002
 Son+0.002
 veggies+0.002
 Thanksgiving+0.002
Neg logit contributions
anda-0.001
 can-0.001
ig-0.001
 slips-0.001
ite-0.001
Token picking
Token ID10868
Feature activation-1.35e-01

Pos logit contributions
 can+0.024
 high+0.021
anda+0.021
 slips+0.021
ig+0.020
Neg logit contributions
sets-0.021
 restaurants-0.020
 Son-0.020
 avail-0.018
kids-0.018
Token out
Token ID503
Feature activation-2.60e-02

Pos logit contributions
sets+0.210
 restaurants+0.207
 Son+0.203
kids+0.187
 veggies+0.181
Neg logit contributions
 can-0.231
anda-0.210
ig-0.206
 slips-0.201
 does-0.198
Token the
Token ID262
Feature activation+3.31e-03

Pos logit contributions
sets+0.037
 restaurants+0.037
 Son+0.036
kids+0.033
 veggies+0.032
Neg logit contributions
 can-0.048
anda-0.044
ig-0.043
 slips-0.042
 does-0.041
Token y
Token ID331
Feature activation+3.04e-03

Pos logit contributions
 can+0.005
anda+0.005
ig+0.005
 slips+0.005
ite+0.004
Neg logit contributions
 restaurants-0.006
sets-0.006
 Son-0.006
 veggies-0.005
kids-0.005
Tokenumm
Token ID13929
Feature activation-2.88e-01

Pos logit contributions
 can+0.007
ig+0.006
anda+0.006
 high+0.006
 slips+0.006
Neg logit contributions
sets-0.003
 Son-0.003
 restaurants-0.003
Father-0.003
 Father-0.003
Tokeniest
Token ID6386
Feature activation-1.04e-01

Pos logit contributions
sets+0.426
 restaurants+0.420
 Son+0.414
kids+0.376
 Father+0.367
Neg logit contributions
 can-0.519
anda-0.473
ig-0.472
 slips-0.454
 does-0.447
Token treats
Token ID18432
Feature activation-6.45e-02

Pos logit contributions
sets+0.188
 restaurants+0.184
 Son+0.183
kids+0.167
 Father+0.165
Neg logit contributions
 can-0.156
anda-0.138
ig-0.135
 does-0.129
 slips-0.129
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
sets+0.108
 restaurants+0.106
 Son+0.105
 veggies+0.098
 vegetables+0.097
Neg logit contributions
anda-0.096
 can-0.093
ig-0.093
 slips-0.090
ite-0.085
Token sharing
Token ID7373
Feature activation-1.01e-01

Pos logit contributions
sets+0.021
 restaurants+0.021
 Son+0.021
kids+0.019
 avail+0.018
Neg logit contributions
 can-0.022
anda-0.020
 does-0.019
ig-0.019
 slips-0.019
Token them
Token ID606
Feature activation-6.58e-02

Pos logit contributions
sets+0.157
 restaurants+0.157
 Son+0.151
 veggies+0.139
kids+0.137
Neg logit contributions
 can-0.171
anda-0.161
ig-0.158
 slips-0.155
ite-0.150
Token Another
Token ID6023
Feature activation-2.53e-02

Pos logit contributions
 can+0.034
ig+0.033
anda+0.033
ite+0.031
 slips+0.031
Neg logit contributions
 Son-0.030
 restaurants-0.030
sets-0.028
 Daughter-0.026
 Father-0.026
Token truck
Token ID7779
Feature activation+1.12e-01

Pos logit contributions
 restaurants+0.044
 Son+0.042
sets+0.041
kids+0.040
 foods+0.038
Neg logit contributions
 can-0.037
ig-0.036
anda-0.035
 slips-0.034
ite-0.033
Token came
Token ID1625
Feature activation+2.79e-02

Pos logit contributions
ig+0.222
ite+0.192
ixie+0.192
ang+0.191
anda+0.191
Neg logit contributions
 miners-0.113
sets-0.111
 restaurants-0.111
kids-0.111
 foods-0.108
Token.
Token ID13
Feature activation+2.05e-02

Pos logit contributions
anda+0.039
 can+0.038
ig+0.038
ite+0.038
 slips+0.036
Neg logit contributions
sets-0.047
 Son-0.043
 restaurants-0.041
urg-0.041
ices-0.039
Token It
Token ID632
Feature activation+1.98e-02

Pos logit contributions
 can+0.035
ig+0.034
anda+0.033
ite+0.031
 slips+0.031
Neg logit contributions
 Son-0.031
 restaurants-0.031
sets-0.029
 Father-0.027
 Daughter-0.027
Token did
Token ID750
Feature activation+1.65e-01

Pos logit contributions
ig+0.035
ite+0.032
och+0.031
ocus+0.031
anda+0.030
Neg logit contributions
sets-0.024
 restaurants-0.024
 youngest-0.023
 Son-0.023
 hamb-0.022
Token not
Token ID407
Feature activation-1.05e-02

Pos logit contributions
ite+0.349
anda+0.343
ig+0.343
 can+0.328
itty+0.323
Neg logit contributions
 restaurants-0.175
sets-0.160
 Son-0.156
 vegetables-0.145
 foods-0.144
Token see
Token ID766
Feature activation-1.60e-02

Pos logit contributions
 restaurants+0.014
sets+0.013
 Son+0.013
 avail+0.012
 foods+0.012
Neg logit contributions
anda-0.019
ig-0.019
ite-0.018
uddle-0.017
 slips-0.017
Token crab
Token ID32202
Feature activation+1.25e-02

Pos logit contributions
 restaurants+0.023
 Son+0.023
sets+0.022
kids+0.020
 Father+0.020
Neg logit contributions
 can-0.028
ig-0.027
ite-0.026
anda-0.026
 slips-0.025
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
 can+0.018
anda+0.016
ig+0.016
ite+0.016
 slips+0.016
Neg logit contributions
 restaurants-0.022
sets-0.022
 Son-0.019
urg-0.019
 foods-0.019
Token It
Token ID632
Feature activation+2.01e-02

Pos logit contributions
 can+0.033
anda+0.031
ig+0.030
 slips+0.029
ite+0.028
Neg logit contributions
 restaurants-0.034
 Son-0.034
sets-0.033
kids-0.030
 veggies-0.030
Token store
Token ID3650
Feature activation+1.71e-03

Pos logit contributions
anda+0.007
ig+0.007
 can+0.007
 slips+0.007
ocus+0.006
Neg logit contributions
 restaurants-0.008
 Son-0.008
sets-0.008
kids-0.007
 veggies-0.007
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
ig+0.003
 slips+0.002
br+0.002
anda+0.002
 dod+0.002
Neg logit contributions
 restaurants-0.003
sets-0.003
 decisions-0.003
 Son-0.003
kids-0.003
Token She
Token ID1375
Feature activation+2.19e-02

Pos logit contributions
 can+0.014
anda+0.012
 slips+0.012
 does+0.012
ig+0.011
Neg logit contributions
sets-0.023
 restaurants-0.023
 Son-0.022
 veggies-0.021
kids-0.021
Token says
Token ID1139
Feature activation+4.10e-03

Pos logit contributions
ig+0.033
br+0.032
ite+0.032
anda+0.031
suit+0.031
Neg logit contributions
 Son-0.032
 restaurants-0.032
 vegetables-0.030
kids-0.029
 veggies-0.029
Token he
Token ID339
Feature activation+1.80e-02

Pos logit contributions
anda+0.007
ig+0.007
 can+0.007
ite+0.007
 slips+0.007
Neg logit contributions
 restaurants-0.006
 Son-0.006
sets-0.006
 Father-0.005
 Thanksgiving-0.005
Token can
Token ID460
Feature activation+2.17e-01

Pos logit contributions
ig+0.013
 high+0.012
br+0.012
anda+0.011
hog+0.010
Neg logit contributions
kids-0.039
 restaurants-0.039
 Son-0.039
sets-0.038
 veggies-0.037
Token bring
Token ID2222
Feature activation-5.92e-02

Pos logit contributions
 can+0.368
anda+0.358
ig+0.351
 slips+0.341
 does+0.329
Neg logit contributions
 restaurants-0.316
 Son-0.315
sets-0.315
kids-0.280
 Daughter-0.277
Token a
Token ID257
Feature activation+6.65e-03

Pos logit contributions
 restaurants+0.084
sets+0.083
 Son+0.082
kids+0.074
 veggies+0.073
Neg logit contributions
 can-0.107
anda-0.102
ig-0.100
 slips-0.098
ite-0.096
Token small
Token ID1402
Feature activation+2.74e-02

Pos logit contributions
anda+0.011
ig+0.011
 slips+0.010
 does+0.010
ocus+0.010
Neg logit contributions
 Son-0.010
 restaurants-0.010
 couple-0.009
sets-0.008
 Father-0.008
Token toy
Token ID13373
Feature activation+3.74e-03

Pos logit contributions
anda+0.039
 slips+0.038
ows+0.037
 does+0.037
 can+0.037
Neg logit contributions
 restaurants-0.046
 Son-0.044
sets-0.042
 foods-0.041
 couple-0.041
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
anda+0.006
ig+0.006
 slips+0.005
ows+0.005
ighed+0.005
Neg logit contributions
 restaurants-0.005
 Son-0.005
 vegetables-0.005
 son-0.005
sets-0.005
Token her
Token ID607
Feature activation+1.44e-02

Pos logit contributions
 can+0.009
anda+0.009
ig+0.008
 slips+0.008
ite+0.008
Neg logit contributions
sets-0.007
 Son-0.007
 restaurants-0.007
 Daughter-0.007
 veggies-0.007
Token wounds
Token ID14129
Feature activation+3.87e-04

Pos logit contributions
 can+0.025
anda+0.023
ig+0.022
 slips+0.022
 does+0.021
Neg logit contributions
sets-0.023
 restaurants-0.023
 Son-0.022
kids-0.020
 veggies-0.020
Token.
Token ID13
Feature activation+1.82e-02

Pos logit contributions
 can+0.000
anda+0.000
 slips+0.000
ig+0.000
ite+0.000
Neg logit contributions
sets-0.001
 restaurants-0.001
 Son-0.001
kids-0.001
 veggies-0.001
Token She
Token ID1375
Feature activation+2.63e-02

Pos logit contributions
 can+0.027
 slips+0.022
anda+0.022
 does+0.022
 high+0.021
Neg logit contributions
sets-0.035
 restaurants-0.034
 Son-0.033
kids-0.032
 avail-0.031
Token scared
Token ID12008
Feature activation-1.72e-04

Pos logit contributions
 can+0.037
ig+0.036
anda+0.035
ite+0.035
 slips+0.034
Neg logit contributions
 Son-0.045
sets-0.044
 restaurants-0.044
kids-0.041
 veggies-0.041
Token away
Token ID1497
Feature activation+1.30e-01

Pos logit contributions
sets+0.000
 restaurants+0.000
 Son+0.000
kids+0.000
 veggies+0.000
Neg logit contributions
 can-0.000
anda-0.000
ig-0.000
 slips-0.000
ite-0.000
Token the
Token ID262
Feature activation+9.48e-03

Pos logit contributions
 can+0.175
anda+0.164
ite+0.161
ig+0.160
 slips+0.155
Neg logit contributions
sets-0.244
 Son-0.238
 restaurants-0.238
 Father-0.217
 Daughter-0.217
Token bees
Token ID17002
Feature activation-1.29e-02

Pos logit contributions
 can+0.019
anda+0.017
ig+0.017
 slips+0.017
ite+0.017
Neg logit contributions
 restaurants-0.012
sets-0.012
 Son-0.012
kids-0.011
 veggies-0.010
Token with
Token ID351
Feature activation-4.50e-02

Pos logit contributions
sets+0.020
 restaurants+0.019
 vegetables+0.018
 Son+0.018
 decisions+0.017
Neg logit contributions
anda-0.019
ite-0.018
ig-0.017
 can-0.017
suit-0.017
Token a
Token ID257
Feature activation+1.26e-02

Pos logit contributions
 restaurants+0.067
sets+0.067
 Son+0.066
 veggies+0.059
kids+0.059
Neg logit contributions
 can-0.078
anda-0.074
ig-0.073
 slips-0.071
ite-0.070
Token loud
Token ID7812
Feature activation+6.86e-02

Pos logit contributions
anda+0.019
ig+0.018
 can+0.018
ite+0.018
ocus+0.017
Neg logit contributions
 Son-0.019
 restaurants-0.019
sets-0.018
 couple-0.018
 Thanksgiving-0.018
Token Every
Token ID3887
Feature activation-1.01e-01

Pos logit contributions
 can+0.007
 high+0.005
 does+0.005
anda+0.005
 slips+0.005
Neg logit contributions
sets-0.009
 restaurants-0.009
 Son-0.008
kids-0.008
 veggies-0.008
Token night
Token ID1755
Feature activation+5.41e-03

Pos logit contributions
 restaurants+0.134
sets+0.131
 Son+0.130
 Thanksgiving+0.122
kids+0.117
Neg logit contributions
anda-0.186
 can-0.185
 dod-0.171
 slips-0.171
ig-0.170
Token the
Token ID262
Feature activation-1.31e-03

Pos logit contributions
 can+0.007
anda+0.006
ig+0.006
 slips+0.006
 does+0.006
Neg logit contributions
sets-0.011
 restaurants-0.011
 Son-0.011
kids-0.010
 veggies-0.010
Token old
Token ID1468
Feature activation+6.08e-02

Pos logit contributions
sets+0.002
 restaurants+0.002
 Son+0.002
urg+0.002
kids+0.002
Neg logit contributions
 can-0.002
 high-0.002
 does-0.002
anda-0.002
 slips-0.002
Token man
Token ID582
Feature activation+4.94e-02

Pos logit contributions
 can+0.074
 high+0.061
 does+0.060
 did+0.058
 slips+0.057
Neg logit contributions
sets-0.126
 restaurants-0.123
 Son-0.122
urg-0.119
umm-0.118
Token would
Token ID561
Feature activation+1.12e-01

Pos logit contributions
anda+0.059
ig+0.056
ite+0.054
ider+0.054
suit+0.054
Neg logit contributions
sets-0.091
 restaurants-0.090
 veggies-0.087
 vegetables-0.086
 Son-0.085
Token laugh
Token ID6487
Feature activation+2.45e-02

Pos logit contributions
 can+0.185
anda+0.185
ig+0.179
ite+0.173
 slips+0.170
Neg logit contributions
 restaurants-0.171
sets-0.167
 Son-0.165
 veggies-0.149
kids-0.146
Token while
Token ID981
Feature activation-8.28e-02

Pos logit contributions
anda+0.039
ig+0.036
ite+0.036
 slips+0.036
 can+0.034
Neg logit contributions
sets-0.043
 restaurants-0.039
 Son-0.039
 Thanksgiving-0.037
 veggies-0.037
Token he
Token ID339
Feature activation+1.37e-02

Pos logit contributions
sets+0.143
 restaurants+0.140
 Son+0.139
kids+0.131
 veggies+0.126
Neg logit contributions
 can-0.128
ig-0.109
 high-0.108
 slips-0.108
anda-0.108
Token was
Token ID373
Feature activation+1.97e-02

Pos logit contributions
anda+0.020
 can+0.019
ig+0.019
ite+0.019
 high+0.018
Neg logit contributions
sets-0.022
 restaurants-0.021
 Son-0.020
 veggies-0.020
 foods-0.019
Token alone
Token ID3436
Feature activation+1.78e-01

Pos logit contributions
 can+0.036
ite+0.035
anda+0.035
ig+0.034
 slips+0.032
Neg logit contributions
sets-0.028
 restaurants-0.028
 Son-0.025
 Thanksgiving-0.024
 veggies-0.023
Token
Token ID198
Feature activation+3.70e-03

Pos logit contributions
sets+0.101
 restaurants+0.099
 Son+0.098
kids+0.089
 veggies+0.088
Neg logit contributions
 can-0.093
anda-0.085
ig-0.085
 slips-0.082
 does-0.080
Token
Token ID198
Feature activation+3.92e-03

Pos logit contributions
 can+0.005
anda+0.004
ig+0.004
 slips+0.004
 high+0.004
Neg logit contributions
sets-0.008
 restaurants-0.007
 Son-0.007
 veggies-0.007
 avail-0.007
TokenIt
Token ID1026
Feature activation-4.35e-03

Pos logit contributions
 can+0.005
anda+0.005
 slips+0.005
ig+0.005
ite+0.005
Neg logit contributions
sets-0.007
 restaurants-0.007
 Son-0.007
 veggies-0.007
 avail-0.007
Token was
Token ID373
Feature activation+2.53e-02

Pos logit contributions
 restaurants+0.010
sets+0.010
 Son+0.010
kids+0.009
 veggies+0.009
Neg logit contributions
 can-0.005
anda-0.004
ig-0.004
 does-0.004
 slips-0.004
Token a
Token ID257
Feature activation+5.89e-03

Pos logit contributions
 can+0.042
anda+0.041
ig+0.040
 slips+0.039
ite+0.039
Neg logit contributions
sets-0.040
 Son-0.040
 restaurants-0.039
 Thanksgiving-0.035
 veggies-0.035
Token very
Token ID845
Feature activation+1.17e-01

Pos logit contributions
ows+0.011
anda+0.010
ig+0.010
 slips+0.010
ocus+0.010
Neg logit contributions
 Son-0.009
 Thanksgiving-0.008
 couple-0.008
 Father-0.008
 restaurants-0.008
Token special
Token ID2041
Feature activation-1.76e-03

Pos logit contributions
ig+0.209
 can+0.204
ite+0.201
 slips+0.195
anda+0.190
Neg logit contributions
 Son-0.163
 Father-0.149
 Thanksgiving-0.149
 restaurants-0.148
 couple-0.143
Token moment
Token ID2589
Feature activation-9.25e-02

Pos logit contributions
 restaurants+0.002
 Son+0.002
sets+0.002
kids+0.002
 couple+0.002
Neg logit contributions
 can-0.003
 slips-0.003
anda-0.003
 dod-0.003
ig-0.003
Token for
Token ID329
Feature activation-8.72e-03

Pos logit contributions
sets+0.179
 Son+0.178
 restaurants+0.171
 Thanksgiving+0.164
 Father+0.163
Neg logit contributions
anda-0.117
 can-0.114
 slips-0.110
ig-0.103
 does-0.101
Token them
Token ID606
Feature activation-6.37e-02

Pos logit contributions
 restaurants+0.013
sets+0.013
 Son+0.013
kids+0.012
 veggies+0.012
Neg logit contributions
 can-0.015
anda-0.015
ig-0.014
 slips-0.014
ite-0.014
Token both
Token ID1111
Feature activation-4.40e-02

Pos logit contributions
sets+0.108
 restaurants+0.102
 Son+0.102
 Father+0.095
 vegetables+0.093
Neg logit contributions
anda-0.089
ig-0.089
br-0.088
 slips-0.086
 dod-0.086
Token knocked
Token ID13642
Feature activation-3.14e-02

Pos logit contributions
anda+0.070
ig+0.069
ite+0.069
 can+0.068
 slips+0.067
Neg logit contributions
 Son-0.071
 restaurants-0.069
kids-0.066
 veggies-0.065
sets-0.065
Token on
Token ID319
Feature activation+7.99e-02

Pos logit contributions
 restaurants+0.063
sets+0.063
 Son+0.062
kids+0.057
 veggies+0.057
Neg logit contributions
 can-0.039
anda-0.036
ig-0.034
 slips-0.033
 does-0.032
Token the
Token ID262
Feature activation+2.65e-03

Pos logit contributions
 can+0.109
 high+0.096
anda+0.090
ig+0.089
 does+0.089
Neg logit contributions
sets-0.153
 restaurants-0.145
 Son-0.144
kids-0.138
urg-0.135
Token door
Token ID3420
Feature activation+3.57e-02

Pos logit contributions
ig+0.005
 can+0.005
anda+0.005
ite+0.005
 slips+0.004
Neg logit contributions
 Son-0.004
 restaurants-0.004
sets-0.004
 vegetables-0.003
 veggies-0.003
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
anda+0.048
 can+0.047
ig+0.046
 dod+0.044
 slips+0.043
Neg logit contributions
sets-0.066
 restaurants-0.064
 Son-0.063
 Father-0.058
 Daughter-0.057
Token Jane
Token ID12091
Feature activation-2.61e-02

Pos logit contributions
 can+0.018
 high+0.015
 does+0.015
 slips+0.015
anda+0.014
Neg logit contributions
sets-0.019
 restaurants-0.018
 Son-0.017
kids-0.017
 avail-0.017
Token said
Token ID531
Feature activation-1.78e-02

Pos logit contributions
 Son+0.044
sets+0.043
 restaurants+0.043
kids+0.040
 Father+0.040
Neg logit contributions
 can-0.036
ig-0.036
anda-0.035
ite-0.034
 slips-0.034
Token "
Token ID366
Feature activation+1.41e-02

Pos logit contributions
 Son+0.037
sets+0.036
 restaurants+0.036
 Father+0.034
 veggies+0.034
Neg logit contributions
anda-0.019
ig-0.019
 can-0.019
ite-0.018
 slips-0.018
Tokencome
Token ID2958
Feature activation+1.16e-02

Pos logit contributions
 can+0.021
ite+0.018
 high+0.017
 else+0.017
 did+0.017
Neg logit contributions
 restaurants-0.026
sets-0.025
urg-0.024
 Son-0.024
 decisions-0.023
Token in
Token ID287
Feature activation+1.17e-01

Pos logit contributions
 slips+0.014
anda+0.014
 can+0.013
ig+0.013
 else+0.012
Neg logit contributions
 restaurants-0.022
sets-0.022
 Son-0.022
 Father-0.020
 Thanksgiving-0.020
Token!".
Token ID48220
Feature activation-1.52e-01

Pos logit contributions
 can+0.135
anda+0.119
ig+0.115
 slips+0.111
 does+0.107
Neg logit contributions
sets-0.248
 restaurants-0.247
 Son-0.243
kids-0.227
 veggies-0.225
Token and
Token ID290
Feature activation-6.24e-03

Pos logit contributions
anda+0.162
 dod+0.147
 can+0.142
br+0.140
 slips+0.136
Neg logit contributions
 restaurants-0.158
sets-0.158
 vegetables-0.142
 Son-0.142
 packing-0.135
Token saw
Token ID2497
Feature activation+1.48e-02

Pos logit contributions
 restaurants+0.011
sets+0.010
 Son+0.010
 vegetables+0.009
 veggies+0.009
Neg logit contributions
anda-0.010
 can-0.009
ig-0.009
ite-0.009
br-0.008
Token a
Token ID257
Feature activation+1.12e-02

Pos logit contributions
ig+0.026
 can+0.026
anda+0.025
ite+0.025
 slips+0.024
Neg logit contributions
 restaurants-0.021
sets-0.021
 Son-0.020
 vegetables-0.019
 veggies-0.018
Token blue
Token ID4171
Feature activation+1.85e-02

Pos logit contributions
ig+0.019
anda+0.018
ite+0.018
ows+0.017
ars+0.017
Neg logit contributions
 restaurants-0.016
 couple-0.015
 Son-0.015
sets-0.015
 Daughter-0.014
Token dress
Token ID6576
Feature activation+4.05e-02

Pos logit contributions
anda+0.026
 can+0.025
 slips+0.025
 else+0.025
ig+0.024
Neg logit contributions
sets-0.032
 restaurants-0.032
 Son-0.030
 foods-0.029
 veggies-0.028
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
 can+0.056
anda+0.055
 dod+0.054
 slips+0.053
ig+0.052
Neg logit contributions
sets-0.072
 restaurants-0.065
 Son-0.060
burn-0.059
reating-0.058
Token She
Token ID1375
Feature activation+2.74e-02

Pos logit contributions
ig+0.045
anda+0.044
ite+0.041
 can+0.040
ars+0.039
Neg logit contributions
 Son-0.025
 restaurants-0.024
sets-0.024
 Daughter-0.022
 Father-0.022
Token pulled
Token ID5954
Feature activation+1.43e-02

Pos logit contributions
ig+0.046
ite+0.045
anda+0.043
arl+0.043
 can+0.042
Neg logit contributions
 Son-0.038
 restaurants-0.036
 veggies-0.034
kids-0.034
 vegetables-0.034
Token it
Token ID340
Feature activation-3.39e-02

Pos logit contributions
anda+0.027
ider+0.025
ite+0.025
ig+0.025
uddle+0.024
Neg logit contributions
 Son-0.018
sets-0.017
 restaurants-0.017
 vegetables-0.016
 Daughter-0.016
Token and
Token ID290
Feature activation-5.63e-03

Pos logit contributions
sets+0.037
 restaurants+0.030
 one+0.030
 vegetables+0.030
burn+0.030
Neg logit contributions
anda-0.066
ebra-0.060
ite-0.059
ang-0.059
ig-0.059
Token found
Token ID1043
Feature activation-2.54e-02

Pos logit contributions
 restaurants+0.008
sets+0.008
 Son+0.008
 vegetables+0.008
 veggies+0.008
Neg logit contributions
anda-0.010
ig-0.009
ite-0.009
 can-0.008
itty-0.008
Token him
Token ID683
Feature activation-3.03e-02

Pos logit contributions
sets+0.080
 restaurants+0.079
 Son+0.078
kids+0.069
 veggies+0.068
Neg logit contributions
 can-0.114
anda-0.105
ig-0.103
 slips-0.102
ite-0.101
Token that
Token ID326
Feature activation-3.98e-02

Pos logit contributions
sets+0.061
 restaurants+0.060
 Son+0.059
kids+0.055
 veggies+0.055
Neg logit contributions
 can-0.036
anda-0.033
 slips-0.032
ite-0.032
ig-0.032
Token it
Token ID340
Feature activation-3.77e-02

Pos logit contributions
 Son+0.071
sets+0.070
 restaurants+0.070
kids+0.066
urg+0.064
Neg logit contributions
 can-0.061
 does-0.050
ig-0.050
anda-0.050
 slips-0.049
Token would
Token ID561
Feature activation+1.13e-01

Pos logit contributions
sets+0.067
 Son+0.063
 restaurants+0.061
 foods+0.058
 vegetables+0.057
Neg logit contributions
ite-0.050
ig-0.049
anda-0.049
 slips-0.047
 can-0.046
Token be
Token ID307
Feature activation+5.87e-03

Pos logit contributions
 can+0.164
anda+0.150
ig+0.146
 slips+0.143
ite+0.139
Neg logit contributions
sets-0.205
 restaurants-0.203
 Son-0.200
kids-0.184
 veggies-0.182
Token fun
Token ID1257
Feature activation+8.75e-02

Pos logit contributions
 can+0.009
anda+0.009
ite+0.008
ig+0.008
 slips+0.008
Neg logit contributions
 restaurants-0.010
sets-0.010
 Son-0.010
 Thanksgiving-0.009
 veggies-0.009
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
anda+0.121
 can+0.121
 slips+0.116
ig+0.116
ite+0.113
Neg logit contributions
 restaurants-0.154
sets-0.152
 Son-0.144
kids-0.138
 foods-0.137
Token As
Token ID1081
Feature activation-1.15e-01

Pos logit contributions
 can+0.024
anda+0.022
ig+0.021
 slips+0.021
 does+0.020
Neg logit contributions
 restaurants-0.023
sets-0.023
 Son-0.022
kids-0.021
 veggies-0.020
Token they
Token ID484
Feature activation-5.38e-02

Pos logit contributions
sets+0.189
 restaurants+0.187
 Son+0.183
kids+0.168
 veggies+0.166
Neg logit contributions
 can-0.190
anda-0.173
ig-0.169
 slips-0.166
 does-0.162
Token were
Token ID547
Feature activation-3.29e-02

Pos logit contributions
 restaurants+0.090
sets+0.089
 Son+0.088
kids+0.082
 Daughter+0.079
Neg logit contributions
anda-0.076
 can-0.076
 slips-0.075
ig-0.071
 does-0.069
Token getting
Token ID1972
Feature activation-1.30e-01

Pos logit contributions
 restaurants+0.052
sets+0.051
 Son+0.050
kids+0.046
 Thanksgiving+0.045
Neg logit contributions
 can-0.056
anda-0.051
ig-0.050
ite-0.050
 slips-0.049
Token,
Token ID11
Feature activation+1.19e-02

Pos logit contributions
sets+0.003
 restaurants+0.003
 Son+0.003
 Thanksgiving+0.003
 vegetables+0.003
Neg logit contributions
anda-0.003
 can-0.003
ite-0.002
 slips-0.002
ig-0.002
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
anda+0.020
 can+0.020
ig+0.019
ite+0.019
 slips+0.018
Neg logit contributions
sets-0.018
 restaurants-0.018
 Son-0.017
 veggies-0.016
 vegetables-0.015
Token they
Token ID484
Feature activation-5.83e-02

Pos logit contributions
sets+0.024
 restaurants+0.024
 Son+0.023
kids+0.021
 veggies+0.021
Neg logit contributions
 can-0.026
anda-0.024
ig-0.023
 slips-0.023
ite-0.022
Token all
Token ID477
Feature activation-9.68e-02

Pos logit contributions
sets+0.112
 restaurants+0.110
 Son+0.107
 veggies+0.100
 vegetables+0.097
Neg logit contributions
anda-0.074
 can-0.071
ig-0.069
 slips-0.067
ite-0.064
Token tried
Token ID3088
Feature activation+1.26e-02

Pos logit contributions
sets+0.163
 restaurants+0.158
 Son+0.150
kids+0.138
 veggies+0.138
Neg logit contributions
anda-0.142
ider-0.132
 can-0.130
 slips-0.130
ig-0.130
Token their
Token ID511
Feature activation-4.25e-02

Pos logit contributions
 can+0.016
anda+0.014
ig+0.014
 slips+0.013
 does+0.013
Neg logit contributions
sets-0.025
 restaurants-0.025
 Son-0.024
kids-0.023
 veggies-0.022
Token best
Token ID1266
Feature activation+1.04e-02

Pos logit contributions
sets+0.069
 restaurants+0.069
 Son+0.066
 veggies+0.062
 foods+0.061
Neg logit contributions
 can-0.068
anda-0.064
ite-0.061
 slips-0.061
ig-0.061
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
anda+0.014
 can+0.014
 slips+0.013
ite+0.013
ig+0.012
Neg logit contributions
sets-0.019
 restaurants-0.018
 Son-0.018
 foods-0.017
 veggies-0.017
Token Whenever
Token ID21326
Feature activation-1.03e-01

Pos logit contributions
 can+0.020
anda+0.018
ig+0.017
 slips+0.017
 does+0.017
Neg logit contributions
sets-0.015
 restaurants-0.015
 Son-0.015
kids-0.013
 veggies-0.013
Token Sally
Token ID25737
Feature activation-6.71e-02

Pos logit contributions
sets+0.168
 restaurants+0.166
 Son+0.163
kids+0.150
 veggies+0.147
Neg logit contributions
 can-0.169
anda-0.152
ig-0.149
 slips-0.147
 does-0.144
Token missed
Token ID6825
Feature activation-8.71e-03

Pos logit contributions
sets+0.115
 restaurants+0.114
 Son+0.106
 veggies+0.102
 vegetables+0.100
Neg logit contributions
anda-0.093
ig-0.090
 can-0.087
ider-0.086
ows-0.086
Token relieved
Token ID22598
Feature activation+2.38e-03

Pos logit contributions
sets+0.065
 restaurants+0.064
 Son+0.064
 Sonia+0.056
 Father+0.056
Neg logit contributions
 can-0.074
ig-0.072
anda-0.070
 slips-0.066
 does-0.064
Token that
Token ID326
Feature activation-4.20e-02

Pos logit contributions
anda+0.003
 can+0.003
ig+0.003
ite+0.003
 slips+0.003
Neg logit contributions
sets-0.004
 Son-0.004
 restaurants-0.004
 Father-0.004
 Kevin-0.004
Token she
Token ID673
Feature activation+2.71e-02

Pos logit contributions
sets+0.079
 restaurants+0.078
 Son+0.077
kids+0.074
urg+0.072
Neg logit contributions
 can-0.059
 high-0.050
 slips-0.049
 does-0.048
ig-0.048
Token gave
Token ID2921
Feature activation-2.60e-02

Pos logit contributions
anda+0.040
ig+0.039
ite+0.039
arl+0.039
 slips+0.036
Neg logit contributions
sets-0.041
 Son-0.040
 couple-0.038
 Kevin-0.036
lling-0.036
Token a
Token ID257
Feature activation+5.00e-03

Pos logit contributions
 restaurants+0.043
sets+0.043
 Son+0.041
kids+0.039
urg+0.038
Neg logit contributions
 can-0.042
ig-0.037
anda-0.036
 high-0.036
 slips-0.035
Token big
Token ID1263
Feature activation+3.56e-02

Pos logit contributions
 can+0.007
anda+0.007
ig+0.007
 does+0.006
 slips+0.006
Neg logit contributions
 Son-0.009
 restaurants-0.008
sets-0.008
 couple-0.008
 Father-0.008
Token smile
Token ID8212
Feature activation+4.11e-02

Pos logit contributions
 else+0.055
anda+0.054
 can+0.051
ows+0.051
as+0.051
Neg logit contributions
 Son-0.053
 couple-0.050
 meal-0.049
 Daughter-0.048
 Father-0.048
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
 can+0.048
anda+0.048
 slips+0.044
ig+0.043
 dod+0.042
Neg logit contributions
sets-0.084
 Son-0.079
 restaurants-0.078
 Thanksgiving-0.074
 couple-0.073
Token
Token ID198
Feature activation+6.75e-03

Pos logit contributions
 can+0.018
anda+0.016
ig+0.016
 slips+0.015
 does+0.015
Neg logit contributions
sets-0.021
 restaurants-0.021
 Son-0.020
kids-0.019
 veggies-0.019
Token
Token ID198
Feature activation+7.08e-03

Pos logit contributions
 can+0.010
anda+0.009
ig+0.009
 slips+0.008
 does+0.008
Neg logit contributions
sets-0.012
 restaurants-0.012
 Son-0.012
kids-0.011
 veggies-0.011
TokenThe
Token ID464
Feature activation-3.58e-02

Pos logit contributions
 can+0.011
anda+0.010
ig+0.010
 slips+0.010
 else+0.009
Neg logit contributions
sets-0.012
 restaurants-0.012
 Son-0.012
kids-0.011
 veggies-0.011
Tokenâ
Token ID22940
Feature activation+2.06e-03

Pos logit contributions
 can+0.011
anda+0.010
ig+0.010
 slips+0.010
ite+0.010
Neg logit contributions
sets-0.008
 restaurants-0.007
 Son-0.007
kids-0.006
 Daughter-0.006
Token€
Token ID26391
Feature activation+5.07e-02

Pos logit contributions
 can+0.004
anda+0.004
 else+0.003
ig+0.003
 slips+0.003
Neg logit contributions
 Son-0.003
sets-0.003
 restaurants-0.003
kids-0.003
 Daughter-0.002
Tokenâ„¢
Token ID8151
Feature activation-9.31e-02

Pos logit contributions
 can+0.086
anda+0.080
ig+0.079
 slips+0.076
 does+0.074
Neg logit contributions
sets-0.078
 restaurants-0.077
 Son-0.075
kids-0.069
 Father-0.066
Tokent
Token ID83
Feature activation-1.07e-01

Pos logit contributions
 Son+0.129
sets+0.129
 restaurants+0.124
 Daughter+0.118
kids+0.115
Neg logit contributions
 can-0.156
 wand-0.153
ig-0.147
 slips-0.141
 does-0.141
Token lie
Token ID6486
Feature activation+3.41e-02

Pos logit contributions
sets+0.188
 restaurants+0.187
 Son+0.184
kids+0.169
 veggies+0.166
Neg logit contributions
 can-0.158
anda-0.145
ig-0.141
 slips-0.138
ite-0.134
Token!
Token ID0
Feature activation-6.37e-02

Pos logit contributions
anda+0.041
 can+0.040
ig+0.039
 dod+0.039
br+0.038
Neg logit contributions
 Son-0.064
sets-0.064
 restaurants-0.063
 Father-0.056
 Sister-0.056
Tokenâ
Token ID22940
Feature activation+2.72e-03

Pos logit contributions
sets+0.105
 restaurants+0.104
 Son+0.104
kids+0.094
 Daughter+0.092
Neg logit contributions
 can-0.101
anda-0.093
ig-0.091
 slips-0.089
 does-0.086
Token€
Token ID26391
Feature activation+5.09e-02

Pos logit contributions
 can+0.005
anda+0.005
 else+0.005
 slips+0.005
ig+0.005
Neg logit contributions
 Son-0.004
sets-0.004
 restaurants-0.003
kids-0.003
 Daughter-0.003
Token she
Token ID673
Feature activation+2.14e-02

Pos logit contributions
 can+0.083
anda+0.078
ig+0.076
 slips+0.072
 else+0.071
Neg logit contributions
 restaurants-0.080
sets-0.080
 Son-0.078
kids-0.072
 Father-0.068
Token sc
Token ID629
Feature activation+1.15e-01

Pos logit contributions
ig+0.028
br+0.027
ite+0.027
anda+0.026
 high+0.025
Neg logit contributions
 Son-0.036
sets-0.034
kids-0.033
 restaurants-0.033
 Sister-0.032
Tokenolded
Token ID48959
Feature activation+1.77e-02

Pos logit contributions
 can+0.182
br+0.168
 high+0.161
 wand+0.156
 does+0.156
Neg logit contributions
 Son-0.168
kids-0.160
 Daughter-0.153
 decisions-0.150
 permission-0.147
Token time
Token ID640
Feature activation+5.05e-02

Pos logit contributions
 restaurants+0.003
 Son+0.003
sets+0.003
 Thanksgiving+0.003
kids+0.003
Neg logit contributions
 can-0.003
anda-0.003
ite-0.002
ig-0.002
 slips-0.002
Token,
Token ID11
Feature activation+7.71e-03

Pos logit contributions
 can+0.055
anda+0.053
 slips+0.052
ig+0.051
ite+0.050
Neg logit contributions
 restaurants-0.100
sets-0.097
 Son-0.096
 Frankie-0.090
 Thanksgiving-0.090
Token in
Token ID287
Feature activation+1.08e-01

Pos logit contributions
anda+0.012
 dod+0.011
ig+0.011
 slips+0.011
 can+0.011
Neg logit contributions
 Son-0.011
 Thanksgiving-0.011
 restaurants-0.011
 Frankie-0.011
sets-0.010
Token a
Token ID257
Feature activation-1.83e-03

Pos logit contributions
 can+0.158
anda+0.139
ig+0.137
 slips+0.134
 does+0.132
Neg logit contributions
sets-0.198
 restaurants-0.195
 Son-0.190
kids-0.177
 veggies-0.175
Token small
Token ID1402
Feature activation+2.13e-02

Pos logit contributions
 restaurants+0.003
 Son+0.003
 Thanksgiving+0.003
sets+0.003
 Father+0.003
Neg logit contributions
 can-0.003
anda-0.003
 slips-0.002
ig-0.002
ite-0.002
Token village
Token ID7404
Feature activation-1.07e-01

Pos logit contributions
 can+0.041
 slips+0.039
anda+0.038
ows+0.038
 does+0.037
Neg logit contributions
 restaurants-0.029
 Son-0.027
sets-0.026
 Father-0.023
 Daughter-0.023
Token,
Token ID11
Feature activation+7.29e-03

Pos logit contributions
 restaurants+0.218
 Son+0.204
sets+0.198
 Father+0.194
 Thanksgiving+0.188
Neg logit contributions
 can-0.128
 dod-0.120
ig-0.118
 slips-0.116
anda-0.115
Token there
Token ID612
Feature activation+3.54e-02

Pos logit contributions
anda+0.010
ig+0.010
 slips+0.010
 dod+0.010
 can+0.010
Neg logit contributions
 restaurants-0.012
 Son-0.012
sets-0.012
 Frankie-0.011
 Thanksgiving-0.011
Token lived
Token ID5615
Feature activation+1.25e-01

Pos logit contributions
 can+0.040
anda+0.037
ig+0.035
 slips+0.034
ite+0.033
Neg logit contributions
 restaurants-0.074
sets-0.073
 Son-0.072
kids-0.067
 veggies-0.067
Token a
Token ID257
Feature activation-1.38e-03

Pos logit contributions
 can+0.201
anda+0.187
ig+0.183
 slips+0.178
ite+0.173
Neg logit contributions
 restaurants-0.210
sets-0.207
 Son-0.204
 veggies-0.185
kids-0.184
Token thin
Token ID7888
Feature activation-1.01e-02

Pos logit contributions
 Son+0.002
 restaurants+0.002
sets+0.002
 Father+0.002
 Thanksgiving+0.002
Neg logit contributions
 can-0.003
ig-0.002
 slips-0.002
anda-0.002
ite-0.002
Token"
Token ID1
Feature activation-2.96e-02

Pos logit contributions
 can+0.014
anda+0.014
ig+0.013
 slips+0.012
 does+0.012
Neg logit contributions
 Son-0.013
 restaurants-0.013
sets-0.013
Father-0.013
 Father-0.012
TokenSorry
Token ID14385
Feature activation-8.47e-02

Pos logit contributions
sets+0.050
 restaurants+0.049
 Son+0.048
kids+0.044
 veggies+0.044
Neg logit contributions
 can-0.048
anda-0.043
ig-0.042
 slips-0.042
 does-0.041
Token,
Token ID11
Feature activation+2.07e-02

Pos logit contributions
 restaurants+0.181
sets+0.179
 Son+0.176
 foods+0.166
 decisions+0.165
Neg logit contributions
 can-0.083
ite-0.080
ig-0.079
anda-0.077
 slips-0.077
Token Ben
Token ID3932
Feature activation+6.51e-02

Pos logit contributions
 can+0.026
anda+0.026
ig+0.025
ite+0.025
 slips+0.023
Neg logit contributions
 Son-0.039
 restaurants-0.039
sets-0.038
 veggies-0.035
 Father-0.035
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
 can+0.071
anda+0.068
ig+0.067
 slips+0.065
ite+0.065
Neg logit contributions
sets-0.132
 Son-0.130
 restaurants-0.129
kids-0.119
 Father-0.119
Token No
Token ID1400
Feature activation-8.52e-02

Pos logit contributions
 can+0.030
anda+0.028
ig+0.028
 slips+0.026
 high+0.026
Neg logit contributions
sets-0.040
 Son-0.039
 restaurants-0.039
 Daughter-0.036
 veggies-0.036
Token more
Token ID517
Feature activation+2.06e-02

Pos logit contributions
 restaurants+0.152
sets+0.151
 Son+0.145
 veggies+0.135
 foods+0.135
Neg logit contributions
 can-0.123
anda-0.117
ig-0.112
ite-0.110
 slips-0.110
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
 can+0.035
ig+0.034
anda+0.034
ite+0.033
 slips+0.032
Neg logit contributions
 restaurants-0.031
sets-0.030
 Son-0.029
 veggies-0.028
 vegetables-0.027
Token Come
Token ID7911
Feature activation+4.52e-02

Pos logit contributions
 can+0.032
anda+0.030
ig+0.029
 slips+0.028
 high+0.028
Neg logit contributions
sets-0.036
 restaurants-0.035
 Son-0.035
 veggies-0.032
kids-0.032
Token on
Token ID319
Feature activation+7.87e-02

Pos logit contributions
anda+0.059
 slips+0.054
ig+0.053
osh+0.051
 does+0.051
Neg logit contributions
sets-0.077
 Son-0.077
 restaurants-0.076
burn-0.073
 Thanksgiving-0.073
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
 can+0.092
anda+0.091
ite+0.090
ig+0.088
 slips+0.083
Neg logit contributions
 restaurants-0.157
 Son-0.154
sets-0.153
 Father-0.146
 Thanksgiving-0.144
Token It
Token ID632
Feature activation+1.83e-02

Pos logit contributions
 can+0.016
anda+0.015
ig+0.014
 slips+0.014
 does+0.014
Neg logit contributions
sets-0.017
 restaurants-0.017
 Son-0.017
kids-0.015
 veggies-0.015
Token was
Token ID373
Feature activation+2.24e-02

Pos logit contributions
 can+0.021
anda+0.020
ig+0.019
 slips+0.018
ite+0.018
Neg logit contributions
sets-0.038
 restaurants-0.037
 Son-0.037
kids-0.034
 veggies-0.034
Token a
Token ID257
Feature activation+7.22e-03

Pos logit contributions
 can+0.039
anda+0.038
ig+0.038
ite+0.037
 slips+0.036
Neg logit contributions
 restaurants-0.034
sets-0.034
 Son-0.033
 veggies-0.029
 Father-0.029
Token magical
Token ID10883
Feature activation+3.59e-02

Pos logit contributions
ows+0.013
 slips+0.012
ig+0.012
anda+0.012
ars+0.012
Neg logit contributions
 Son-0.010
 restaurants-0.010
 couple-0.010
 Father-0.010
sets-0.010
Token rainbow
Token ID27223
Feature activation+6.46e-04

Pos logit contributions
 slips+0.063
anda+0.062
 can+0.061
ars+0.060
ows+0.060
Neg logit contributions
sets-0.049
 restaurants-0.047
 Son-0.047
 couple-0.045
 vegetables-0.044
Token!
Token ID0
Feature activation-6.25e-02

Pos logit contributions
 can+0.001
 slips+0.001
anda+0.001
 cannot+0.001
 does+0.001
Neg logit contributions
sets-0.001
 restaurants-0.001
 Son-0.001
umm-0.001
 couple-0.001
Token She
Token ID1375
Feature activation+2.06e-02

Pos logit contributions
sets+0.094
 restaurants+0.094
 Son+0.091
kids+0.083
 veggies+0.082
Neg logit contributions
 can-0.110
anda-0.101
ig-0.099
 slips-0.097
ite-0.095
Token couldn
Token ID3521
Feature activation-5.90e-03

Pos logit contributions
 can+0.030
anda+0.029
 slips+0.029
ite+0.029
ig+0.028
Neg logit contributions
 Son-0.033
sets-0.032
 restaurants-0.031
 Father-0.029
kids-0.029
Token't
Token ID470
Feature activation-7.31e-02

Pos logit contributions
sets+0.008
 restaurants+0.008
 Son+0.008
kids+0.007
 veggies+0.007
Neg logit contributions
 can-0.011
anda-0.010
ig-0.010
 slips-0.010
 does-0.010
Token believe
Token ID1975
Feature activation-4.82e-02

Pos logit contributions
 restaurants+0.124
sets+0.124
 Son+0.122
kids+0.112
 veggies+0.111
Neg logit contributions
 can-0.112
anda-0.103
ig-0.100
 slips-0.098
ite-0.096
Token it
Token ID340
Feature activation-3.67e-02

Pos logit contributions
sets+0.069
 restaurants+0.067
 Son+0.067
 veggies+0.058
 avail+0.058
Neg logit contributions
 can-0.087
anda-0.083
 slips-0.081
ig-0.080
ite-0.080
Token
Token ID198
Feature activation+5.63e-03

Pos logit contributions
 can+0.008
anda+0.007
ig+0.007
 slips+0.006
 does+0.006
Neg logit contributions
sets-0.011
 restaurants-0.011
 Son-0.010
 veggies-0.010
kids-0.010
TokenSuddenly
Token ID38582
Feature activation-6.26e-02

Pos logit contributions
 can+0.009
anda+0.008
ig+0.007
 slips+0.007
 does+0.007
Neg logit contributions
sets-0.010
 restaurants-0.010
 Son-0.010
kids-0.009
 veggies-0.009
Token,
Token ID11
Feature activation+1.30e-02

Pos logit contributions
sets+0.135
 restaurants+0.135
 Son+0.132
 veggies+0.124
 Frankie+0.124
Neg logit contributions
 can-0.062
anda-0.061
ig-0.055
 slips-0.053
ite-0.053
Token Tim
Token ID5045
Feature activation+1.56e-02

Pos logit contributions
 can+0.020
 high+0.016
anda+0.016
 slips+0.016
ig+0.016
Neg logit contributions
sets-0.023
 restaurants-0.023
 Son-0.023
kids-0.022
 veggies-0.021
Tokenmy
Token ID1820
Feature activation-3.22e-02

Pos logit contributions
 can+0.030
anda+0.028
ig+0.027
 slips+0.027
ite+0.027
Neg logit contributions
sets-0.020
 restaurants-0.019
 Son-0.019
kids-0.017
 veggies-0.016
Token's
Token ID338
Feature activation-1.59e-01

Pos logit contributions
sets+0.056
 restaurants+0.055
 Son+0.054
kids+0.050
 veggies+0.049
Neg logit contributions
 can-0.048
anda-0.044
ig-0.042
 slips-0.042
ite-0.041
Token mom
Token ID1995
Feature activation-4.29e-02

Pos logit contributions
 Son+0.136
 restaurants+0.133
 youngest+0.133
sets+0.132
 father+0.121
Neg logit contributions
 can-0.355
 slips-0.336
ig-0.336
 does-0.330
ars-0.330
Token came
Token ID1625
Feature activation+2.43e-02

Pos logit contributions
sets+0.081
 restaurants+0.080
 Son+0.079
kids+0.074
 Daughter+0.072
Neg logit contributions
 can-0.057
anda-0.052
ig-0.051
 slips-0.050
ite-0.049
Token into
Token ID656
Feature activation-3.70e-03

Pos logit contributions
 can+0.025
anda+0.021
ig+0.020
 slips+0.019
 does+0.019
Neg logit contributions
sets-0.055
 restaurants-0.054
 Son-0.054
kids-0.050
 veggies-0.050
Token the
Token ID262
Feature activation+3.39e-03

Pos logit contributions
sets+0.007
 restaurants+0.006
 Son+0.006
kids+0.006
 veggies+0.006
Neg logit contributions
 can-0.006
anda-0.005
ig-0.005
 slips-0.005
 high-0.005
Token room
Token ID2119
Feature activation+7.74e-02

Pos logit contributions
 can+0.006
ig+0.006
anda+0.006
 slips+0.005
ite+0.005
Neg logit contributions
 restaurants-0.005
sets-0.005
 Son-0.005
 Daughter-0.004
kids-0.004
Token full
Token ID1336
Feature activation-8.34e-03

Pos logit contributions
 can+0.033
anda+0.029
ig+0.029
 slips+0.028
 does+0.027
Neg logit contributions
sets-0.045
 restaurants-0.044
 Son-0.043
kids-0.040
 veggies-0.040
Token from
Token ID422
Feature activation-3.24e-02

Pos logit contributions
 Son+0.014
sets+0.014
 restaurants+0.013
 veggies+0.013
 vegetables+0.013
Neg logit contributions
anda-0.013
 slips-0.012
ig-0.012
 can-0.012
 dod-0.011
Token all
Token ID477
Feature activation-9.51e-02

Pos logit contributions
 restaurants+0.045
 Son+0.044
sets+0.041
 veggies+0.039
kids+0.039
Neg logit contributions
 can-0.060
anda-0.059
ig-0.058
ite-0.057
br-0.056
Token the
Token ID262
Feature activation+6.38e-03

Pos logit contributions
 restaurants+0.145
sets+0.141
 Son+0.137
kids+0.123
 Thanksgiving+0.122
Neg logit contributions
 can-0.169
anda-0.161
ig-0.156
 slips-0.155
ite-0.148
Token y
Token ID331
Feature activation+1.86e-03

Pos logit contributions
 can+0.012
anda+0.011
ig+0.011
 slips+0.011
ite+0.011
Neg logit contributions
 restaurants-0.009
sets-0.009
 Son-0.009
 veggies-0.008
kids-0.008
Tokenummy
Token ID13513
Feature activation-2.51e-01

Pos logit contributions
 can+0.004
ig+0.004
anda+0.004
ite+0.004
 slips+0.004
Neg logit contributions
sets-0.002
 Son-0.002
 restaurants-0.002
Father-0.002
kids-0.002
Token snacks
Token ID28487
Feature activation-4.64e-02

Pos logit contributions
 restaurants+0.296
sets+0.292
 Son+0.275
 veggies+0.257
kids+0.249
Neg logit contributions
 can-0.523
anda-0.501
ig-0.487
 slips-0.484
ite-0.475
Token!
Token ID0
Feature activation-6.15e-02

Pos logit contributions
sets+0.080
 Son+0.076
 restaurants+0.075
 vegetables+0.070
 veggies+0.070
Neg logit contributions
anda-0.070
ig-0.065
 dod-0.064
 slips-0.063
 can-0.063
Token<|endoftext|>
Token ID50256
Feature activation+7.67e-02

Pos logit contributions
sets+0.117
 restaurants+0.117
 Son+0.114
 veggies+0.106
kids+0.106
Neg logit contributions
 can-0.086
anda-0.074
 slips-0.072
ig-0.072
 does-0.070
TokenOne
Token ID3198
Feature activation-8.00e-02

Pos logit contributions
anda+0.144
ig+0.136
 can+0.133
 slips+0.131
 dod+0.128
Neg logit contributions
 restaurants-0.106
sets-0.105
 Son-0.096
 foods-0.084
 Daughter-0.083
Token day
Token ID1110
Feature activation+3.85e-02

Pos logit contributions
 restaurants+0.109
 Son+0.102
sets+0.099
 Thanksgiving+0.094
kids+0.092
Neg logit contributions
 can-0.149
anda-0.146
ig-0.140
 slips-0.140
 dod-0.139
TokenThe
Token ID464
Feature activation-3.75e-02

Pos logit contributions
 can+0.015
anda+0.014
ig+0.013
 slips+0.013
 does+0.013
Neg logit contributions
sets-0.012
 restaurants-0.012
 Son-0.012
kids-0.011
 veggies-0.011
Token families
Token ID4172
Feature activation-1.18e-01

Pos logit contributions
sets+0.039
 restaurants+0.038
 Son+0.036
urg+0.032
 avail+0.031
Neg logit contributions
 can-0.085
anda-0.078
ig-0.075
 high-0.075
 does-0.074
Token decided
Token ID3066
Feature activation-7.65e-02

Pos logit contributions
sets+0.195
 Son+0.190
 restaurants+0.186
kids+0.171
 Father+0.169
Neg logit contributions
 can-0.170
 slips-0.170
anda-0.169
ig-0.164
br-0.159
Token to
Token ID284
Feature activation-1.45e-02

Pos logit contributions
sets+0.150
 restaurants+0.148
 Son+0.146
kids+0.135
 veggies+0.134
Neg logit contributions
 can-0.099
anda-0.090
ig-0.088
 slips-0.085
ite-0.082
Token put
Token ID1234
Feature activation-2.94e-02

Pos logit contributions
sets+0.023
 restaurants+0.023
 Son+0.022
 veggies+0.020
kids+0.020
Neg logit contributions
 can-0.024
anda-0.023
ig-0.022
 slips-0.022
ite-0.021
Token some
Token ID617
Feature activation-1.92e-01

Pos logit contributions
sets+0.044
 Son+0.043
 restaurants+0.043
 foods+0.039
 veggies+0.039
Neg logit contributions
anda-0.053
 can-0.049
 slips-0.047
ider-0.047
ig-0.046
Token of
Token ID286
Feature activation+1.48e-03

Pos logit contributions
 vegetables+0.266
 veggies+0.253
 foods+0.245
 potatoes+0.226
 pancakes+0.223
Neg logit contributions
anda-0.376
 slips-0.339
uffy-0.334
ocus-0.333
ider-0.332
Token the
Token ID262
Feature activation+6.27e-03

Pos logit contributions
 can+0.002
anda+0.002
ig+0.002
 slips+0.002
ite+0.002
Neg logit contributions
 restaurants-0.002
sets-0.002
 Son-0.002
 veggies-0.002
kids-0.002
Token texture
Token ID11743
Feature activation-1.01e-01

Pos logit contributions
anda+0.012
ig+0.011
 can+0.011
 slips+0.011
br+0.010
Neg logit contributions
 vegetables-0.008
 veggies-0.008
 Son-0.008
 restaurants-0.008
sets-0.008
Token in
Token ID287
Feature activation+1.13e-01

Pos logit contributions
 vegetables+0.153
 Son+0.153
sets+0.151
 grains+0.145
 veggies+0.142
Neg logit contributions
anda-0.149
 dod-0.138
ig-0.132
 slams-0.131
 slips-0.128
Token an
Token ID281
Feature activation-7.34e-02

Pos logit contributions
 can+0.193
anda+0.190
ig+0.180
 slips+0.177
ite+0.172
Neg logit contributions
 restaurants-0.172
 Son-0.172
sets-0.171
 Daughter-0.152
 Father-0.150
Token One
Token ID1881
Feature activation-1.13e-01

Pos logit contributions
 can+0.012
 does+0.010
 high+0.009
anda+0.009
 slips+0.009
Neg logit contributions
sets-0.014
 restaurants-0.014
 Son-0.014
 avail-0.013
urg-0.013
Token day
Token ID1110
Feature activation+3.91e-02

Pos logit contributions
sets+0.243
 restaurants+0.226
 Son+0.223
 avail+0.217
urg+0.215
Neg logit contributions
 can-0.146
 high-0.127
ig-0.117
anda-0.115
 else-0.115
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
 can+0.031
anda+0.025
ig+0.024
 slips+0.023
 does+0.022
Neg logit contributions
sets-0.097
 restaurants-0.096
 Son-0.095
kids-0.090
 veggies-0.089
Token Tim
Token ID5045
Feature activation+1.67e-02

Pos logit contributions
 can+0.020
 high+0.017
 slips+0.016
 does+0.016
 else+0.016
Neg logit contributions
sets-0.025
 restaurants-0.025
 Son-0.024
kids-0.023
 decisions-0.022
Token spilled
Token ID31896
Feature activation-1.03e-01

Pos logit contributions
 can+0.024
anda+0.021
ig+0.020
 slips+0.020
 does+0.020
Neg logit contributions
sets-0.032
 restaurants-0.031
 Son-0.031
 avail-0.028
kids-0.028
Token some
Token ID617
Feature activation-1.92e-01

Pos logit contributions
sets+0.161
 restaurants+0.158
 Son+0.153
kids+0.145
 avail+0.139
Neg logit contributions
 can-0.182
anda-0.158
 slips-0.157
 high-0.156
ig-0.156
Token juice
Token ID13135
Feature activation-2.24e-02

Pos logit contributions
 restaurants+0.299
sets+0.297
 Son+0.292
 veggies+0.268
kids+0.264
Neg logit contributions
 can-0.322
anda-0.300
ig-0.294
 slips-0.287
ite-0.283
Token on
Token ID319
Feature activation+7.94e-02

Pos logit contributions
sets+0.043
 Son+0.042
 restaurants+0.042
 vegetables+0.041
 veggies+0.041
Neg logit contributions
anda-0.028
 can-0.024
ig-0.024
ite-0.022
 dod-0.022
Token his
Token ID465
Feature activation-5.71e-03

Pos logit contributions
 can+0.111
 high+0.097
 does+0.090
ig+0.088
 slips+0.087
Neg logit contributions
sets-0.155
 restaurants-0.148
kids-0.143
 Son-0.142
 decisions-0.138
Token shirt
Token ID10147
Feature activation-2.40e-02

Pos logit contributions
 restaurants+0.008
sets+0.007
 Son+0.007
 veggies+0.007
kids+0.007
Neg logit contributions
 can-0.011
anda-0.010
ig-0.010
 slips-0.010
ite-0.010
Token and
Token ID290
Feature activation-1.37e-02

Pos logit contributions
sets+0.046
 restaurants+0.045
 Son+0.044
kids+0.041
 veggies+0.041
Neg logit contributions
 can-0.031
anda-0.030
ig-0.028
 slips-0.027
ite-0.026
Token is
Token ID318
Feature activation+7.98e-02

Pos logit contributions
ig+0.015
anda+0.014
 can+0.014
ite+0.013
 slips+0.013
Neg logit contributions
sets-0.041
 restaurants-0.040
 Son-0.039
 veggies-0.037
 vegetables-0.037
Token so
Token ID523
Feature activation-4.14e-02

Pos logit contributions
ig+0.111
anda+0.111
 can+0.109
ite+0.108
 slips+0.103
Neg logit contributions
 restaurants-0.143
 Son-0.141
sets-0.138
 Thanksgiving-0.132
 veggies-0.131
Token pretty
Token ID2495
Feature activation+4.51e-02

Pos logit contributions
 restaurants+0.071
 Son+0.069
sets+0.065
 veggies+0.064
 Thanksgiving+0.064
Neg logit contributions
ig-0.062
 can-0.059
anda-0.057
 slips-0.057
ite-0.056
Token and
Token ID290
Feature activation-8.65e-03

Pos logit contributions
 can+0.058
anda+0.054
ig+0.052
 slips+0.051
ite+0.049
Neg logit contributions
sets-0.089
 restaurants-0.087
 Son-0.085
 veggies-0.079
kids-0.078
Token y
Token ID331
Feature activation+2.37e-04

Pos logit contributions
 restaurants+0.015
sets+0.015
 Son+0.015
 veggies+0.014
 vegetables+0.014
Neg logit contributions
ig-0.012
anda-0.012
 can-0.012
 slips-0.011
ite-0.011
Tokenummy
Token ID13513
Feature activation-2.51e-01

Pos logit contributions
 can+0.001
anda+0.000
ig+0.000
ite+0.000
 high+0.000
Neg logit contributions
sets-0.000
 restaurants-0.000
 Son-0.000
kids-0.000
Father-0.000
Token."
Token ID526
Feature activation-5.73e-02

Pos logit contributions
sets+0.477
 restaurants+0.475
 Son+0.465
kids+0.431
 veggies+0.428
Neg logit contributions
 can-0.342
anda-0.307
ig-0.299
 slips-0.292
 does-0.282
Token
Token ID198
Feature activation+9.98e-03

Pos logit contributions
sets+0.101
 restaurants+0.100
 Son+0.098
kids+0.091
 veggies+0.089
Neg logit contributions
 can-0.088
anda-0.079
ig-0.077
 slips-0.076
 does-0.074
Token
Token ID198
Feature activation+9.64e-03

Pos logit contributions
 can+0.015
anda+0.014
ig+0.014
 slips+0.014
 does+0.013
Neg logit contributions
 restaurants-0.017
sets-0.017
 Son-0.017
kids-0.015
 Daughter-0.015
TokenMom
Token ID29252
Feature activation-4.03e-02

Pos logit contributions
 can+0.021
anda+0.020
ig+0.019
 slips+0.019
 does+0.019
Neg logit contributions
 Son-0.011
 restaurants-0.010
sets-0.010
Father-0.009
 Daughter-0.009
Token said
Token ID531
Feature activation-1.38e-02

Pos logit contributions
 Son+0.071
sets+0.070
 restaurants+0.068
kids+0.066
 Daughter+0.064
Neg logit contributions
 can-0.054
ig-0.053
anda-0.052
 slips-0.051
ite-0.050
Token's
Token ID338
Feature activation-1.63e-01

Pos logit contributions
 restaurants+0.017
sets+0.016
 Son+0.016
 Father+0.015
kids+0.014
Neg logit contributions
ig-0.011
 can-0.011
anda-0.011
 slips-0.010
ite-0.010
Token office
Token ID2607
Feature activation-1.74e-03

Pos logit contributions
 restaurants+0.235
sets+0.232
 Son+0.226
kids+0.202
 Father+0.199
Neg logit contributions
 can-0.298
anda-0.275
ig-0.274
 slips-0.269
ite-0.264
Token,
Token ID11
Feature activation+1.26e-02

Pos logit contributions
 restaurants+0.003
sets+0.003
 Son+0.003
kids+0.003
 Father+0.003
Neg logit contributions
 can-0.002
ig-0.002
anda-0.002
ite-0.002
 slips-0.002
Token Dr
Token ID1583
Feature activation-3.38e-02

Pos logit contributions
 can+0.020
 high+0.017
 does+0.017
 else+0.016
anda+0.016
Neg logit contributions
sets-0.021
 restaurants-0.021
 Son-0.021
kids-0.020
urg-0.020
Token.
Token ID13
Feature activation+9.19e-03

Pos logit contributions
sets+0.067
 restaurants+0.066
 Son+0.066
kids+0.062
 Thanksgiving+0.062
Neg logit contributions
 can-0.042
ite-0.039
ig-0.039
 slips-0.038
anda-0.036
Token Smith
Token ID4176
Feature activation-2.18e-01

Pos logit contributions
 can+0.017
anda+0.015
ig+0.015
 slips+0.015
 high+0.015
Neg logit contributions
 restaurants-0.013
 Son-0.012
sets-0.012
kids-0.012
 veggies-0.011
Token told
Token ID1297
Feature activation-9.35e-02

Pos logit contributions
sets+0.346
 restaurants+0.342
 Son+0.340
 Father+0.317
kids+0.316
Neg logit contributions
ig-0.308
 can-0.302
ite-0.294
anda-0.293
 slips-0.291
Token Lucy
Token ID22162
Feature activation+4.00e-02

Pos logit contributions
sets+0.162
 restaurants+0.151
 Son+0.149
 avail+0.137
 Daughter+0.136
Neg logit contributions
 can-0.151
anda-0.129
 does-0.127
 high-0.127
 else-0.124
Token that
Token ID326
Feature activation-4.24e-02

Pos logit contributions
 can+0.052
anda+0.044
 does+0.042
 high+0.042
 else+0.041
Neg logit contributions
sets-0.080
 restaurants-0.079
 Son-0.078
 avail-0.075
umm-0.073
Token Hop
Token ID9996
Feature activation+7.66e-02

Pos logit contributions
sets+0.072
 restaurants+0.071
 Son+0.071
kids+0.067
urg+0.065
Neg logit contributions
 can-0.067
anda-0.057
 does-0.056
 slips-0.056
ig-0.055
Token was
Token ID373
Feature activation+2.15e-02

Pos logit contributions
ig+0.096
 can+0.091
anda+0.090
ite+0.087
 slips+0.087
Neg logit contributions
 restaurants-0.142
sets-0.141
 Son-0.136
 foods-0.131
 veggies-0.130