TOP ACTIVATIONS
MAX = 11.143

a big bite of the hamb urger . " M mm
saw a picture of a hamb urger and fries , and
We have hot dogs and hamb urg ers ," said the
I want a y ummy hamb urger with French fries !"
is a picture of a hamb urger . It 's a
noticed it was an old hamb urger wrapper . The car
time she should order her hamb urger without pick les .
's mom gave him a hamb urger . Tim my was
help cook hot dogs and hamb urg ers on the grill
my loved meat , especially hamb urg ers . One day
girl decided to try a hamb urger and some fries .
cooked some hot dogs and hamb urg ers on the grill
that , Lily enjoyed her hamb urger without pick les and
. It was the best hamb urger he had ever had
" I want a big burger and fries and k etchup
him and he ordered a burger . While he
Tom did not like his burger . It was too big
happy . He ate his burger and fries and k etchup
" Did you like your burger , sweet ie ?" Lily
they saw . Tim 's burger was burnt and dry .

MIN ACTIVATIONS
MIN = -12.606

disappointment . M arl on 's mum hugged him
's okay , my d arl ings . I forgive you
to grow l and sn arl to try and scare her
you too , my d arl ings ," she said .
let out a big sn arl and the mouse knew he
you go , my d arl ings ," Mom said .
you okay , my d arl ings ?" she asked .
have fun , my d arl ings ?" she asks .
Spot grow ls and sn arl s . He tries to
you okay , my d arl ings ?" Mom asked .
have fun , my d arl ings ?" Their mother asked
. Rex sn arl s . He does not
's wonderful , my d arl ings !" Mom my says
" Hello , my d arl ings . I 'm so
'm sorry , my d arl ings . That was a
'm sorry , my d arl ings . That was a
" Hello , my d arl ings . I 'm so
What happened , my d arl ings ? Why do you
forgive you , my d arl ings . And of course
'm sorry , my d arl ings . The tornado took

INTERVAL 5.206 - 11.143
CONTAINS 0.715%

on the floor under the table . The table was strong
put the bunny in the middle of the case , next
with his mom . He noticed a big billboard . On
. The family had to choose one of the roads ,
mom made him spaghetti for dinner . He was so excited

INTERVAL -0.731 - 5.206
CONTAINS 73.629%

is not a map . It is just a leaf .
friends again ?" S ara and Tom smile at
. The bird will be sad . And the baby birds
. The net got stuck on the branch . L ila
Max wanted the cord too , but he did not want

INTERVAL -6.669 - -0.731
CONTAINS 25.492%

as he could use it as a blanket or cushion whenever
went to his desk and looked around for an idea .
the stove . They are careful not to touch the hot
again . But then , her mom my came outside and
Once upon a time , there was a girl named Lily

INTERVAL -12.606 - -6.669
CONTAINS 0.164%

happy to escape . It fl apped its wings and buzz
to each other and see how far it could go .
grow up well . He thinks they are smart . He
Sam ." Sam thinks for a moment . He
." S ara thinks for a moment . She
Token a
Token ID257
Feature activation-9.26e-02

Pos logit contributions
arl+0.066
imi+0.063
icked+0.059
ig+0.057
ixie+0.056
Neg logit contributions
 distance-0.091
 restaurants-0.086
 hamb-0.086
 burger-0.082
handed-0.078
Token big
Token ID1263
Feature activation-8.52e-01

Pos logit contributions
arl+0.005
imi+0.005
anda+0.004
ig+0.004
 eight+0.004
Neg logit contributions
 distance-0.007
 restaurants-0.007
handed-0.007
Ice-0.007
asts-0.007
Token bite
Token ID13197
Feature activation-1.77e+00

Pos logit contributions
arl+0.058
imi+0.054
ig+0.050
ixie+0.049
icked+0.049
Neg logit contributions
 distance-0.063
 restaurants-0.060
 hamb-0.059
 burger-0.055
handed-0.054
Token of
Token ID286
Feature activation+1.88e+00

Pos logit contributions
arl+0.084
imi+0.082
 how+0.075
icked+0.075
ig+0.072
Neg logit contributions
 distance-0.160
 restaurants-0.157
 hamb-0.149
handed-0.145
 burger-0.142
Token the
Token ID262
Feature activation-8.71e-03

Pos logit contributions
 distance+0.175
 restaurants+0.157
handed+0.157
Ice+0.153
glass+0.149
Neg logit contributions
 how-0.086
imi-0.084
arl-0.079
 my-0.075
 eyes-0.072
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
 eyes+0.001
arl+0.001
 is+0.001
 knows+0.001
imi+0.001
Neg logit contributions
 distance-0.000
bench-0.000
roads-0.000
chard-0.000
handed-0.000
Tokenurger
Token ID32650
Feature activation+4.76e+00

Pos logit contributions
 distance+0.542
 restaurants+0.492
 hamb+0.477
 burger+0.436
handed+0.405
Neg logit contributions
arl-1.050
imi-1.009
ig-0.940
icked-0.939
anda-0.933
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
 distance+0.412
 restaurants+0.394
 hamb+0.373
handed+0.359
 burger+0.354
Neg logit contributions
arl-0.251
imi-0.237
icked-0.223
 how-0.222
ig-0.216
Token "
Token ID366
Feature activation-7.80e-01

Pos logit contributions
 distance+0.018
 restaurants+0.017
 hamb+0.017
 burger+0.016
handed+0.016
Neg logit contributions
arl-0.013
imi-0.012
ixie-0.011
icked-0.011
uttered-0.010
TokenM
Token ID44
Feature activation-5.44e-01

Pos logit contributions
arl+0.041
ra+0.038
imi+0.038
 how+0.035
 tweet+0.035
Neg logit contributions
 restaurants-0.065
 distance-0.064
 hamb-0.057
 lobby-0.057
 burger-0.056
Tokenmm
Token ID3020
Feature activation+4.09e+00

Pos logit contributions
arl+0.034
imi+0.032
ixie+0.029
ig+0.029
uttered+0.029
Neg logit contributions
 distance-0.042
 hamb-0.041
 restaurants-0.041
 burger-0.039
aster-0.037
Token saw
Token ID2497
Feature activation+3.78e+00

Pos logit contributions
imi+0.068
 how+0.063
anda+0.063
 knows+0.063
arl+0.063
Neg logit contributions
 distance-0.081
 restaurants-0.073
 hamb-0.071
 burger-0.068
aster-0.068
Token a
Token ID257
Feature activation-1.12e-01

Pos logit contributions
 distance+0.336
 hamb+0.302
 restaurants+0.298
 burger+0.287
asts+0.286
Neg logit contributions
imi-0.193
arl-0.186
 how-0.186
icked-0.169
anda-0.160
Token picture
Token ID4286
Feature activation+1.79e+00

Pos logit contributions
arl+0.004
 wand+0.004
imi+0.004
anda+0.004
 eight+0.004
Neg logit contributions
 distance-0.010
handed-0.010
 restaurants-0.010
aster-0.010
des-0.010
Token of
Token ID286
Feature activation+1.90e+00

Pos logit contributions
 distance+0.145
 restaurants+0.139
 hamb+0.138
 burger+0.130
handed+0.127
Neg logit contributions
arl-0.107
imi-0.103
icked-0.092
ig-0.091
 how-0.090
Token a
Token ID257
Feature activation-1.07e-01

Pos logit contributions
 distance+0.175
 hamb+0.152
handed+0.151
 restaurants+0.151
ding+0.148
Neg logit contributions
 how-0.090
imi-0.089
arl-0.082
 tweet-0.077
 eyes-0.077
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
 wand+0.005
arl+0.005
 can+0.005
imi+0.005
 eight+0.005
Neg logit contributions
 distance-0.008
handed-0.008
 restaurants-0.008
des-0.008
aster-0.008
Tokenurger
Token ID32650
Feature activation+4.75e+00

Pos logit contributions
 distance+0.557
 restaurants+0.500
 hamb+0.482
 burger+0.440
handed+0.416
Neg logit contributions
arl-1.037
imi-0.998
ig-0.927
icked-0.926
anda-0.922
Token and
Token ID290
Feature activation+3.76e-01

Pos logit contributions
 distance+0.437
 restaurants+0.406
handed+0.387
 hamb+0.385
aster+0.373
Neg logit contributions
arl-0.221
imi-0.216
 how-0.205
icked-0.198
ig-0.193
Token fries
Token ID31757
Feature activation+1.01e+01

Pos logit contributions
 distance+0.032
handed+0.028
 restaurants+0.026
line+0.026
ding+0.026
Neg logit contributions
 how-0.022
 eight-0.021
imi-0.020
 eyes-0.020
icked-0.020
Token,
Token ID11
Feature activation-4.34e-01

Pos logit contributions
 distance+0.919
 restaurants+0.841
 hamb+0.799
handed+0.793
aster+0.787
Neg logit contributions
arl-0.506
icked-0.494
 how-0.478
imi-0.477
icks-0.439
Token and
Token ID290
Feature activation+3.99e-01

Pos logit contributions
 how+0.027
 eyes+0.024
 eight+0.022
 does+0.021
 knows+0.021
Neg logit contributions
 distance-0.036
handed-0.035
 restaurants-0.031
atmeal-0.031
aster-0.031
TokenWe
Token ID1135
Feature activation+3.91e+00

Pos logit contributions
arl+0.050
imi+0.044
ra+0.044
 how+0.042
 tweet+0.041
Neg logit contributions
 distance-0.059
 restaurants-0.059
 hamb-0.056
 burger-0.054
 lobby-0.050
Token have
Token ID423
Feature activation+1.49e+00

Pos logit contributions
 distance+0.327
 hamb+0.321
 restaurants+0.315
 burger+0.301
aster+0.284
Neg logit contributions
arl-0.233
imi-0.219
ixie-0.193
ig-0.189
uttered-0.186
Token hot
Token ID3024
Feature activation+3.96e+00

Pos logit contributions
 distance+0.115
 restaurants+0.109
 hamb+0.107
 burger+0.101
handed+0.098
Neg logit contributions
arl-0.094
imi-0.089
icked-0.082
 how-0.082
ig-0.080
Token dogs
Token ID6844
Feature activation+4.53e+00

Pos logit contributions
 distance+0.250
 restaurants+0.216
 hamb+0.210
handed+0.205
 burger+0.196
Neg logit contributions
arl-0.315
imi-0.301
 how-0.282
ig-0.277
icked-0.275
Token and
Token ID290
Feature activation+3.26e-01

Pos logit contributions
 distance+0.381
 restaurants+0.365
 hamb+0.363
 burger+0.346
aster+0.328
Neg logit contributions
arl-0.261
imi-0.244
ig-0.217
icked-0.215
ixie-0.215
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
 distance+0.022
 restaurants+0.021
 hamb+0.020
 burger+0.019
handed+0.018
Neg logit contributions
arl-0.024
imi-0.023
ig-0.021
ixie-0.021
icked-0.021
Tokenurg
Token ID3686
Feature activation-1.80e+00

Pos logit contributions
 hamb+0.813
 restaurants+0.770
 burger+0.766
asts+0.732
 fries+0.713
Neg logit contributions
arl-0.744
imi-0.704
ixie-0.669
ig-0.658
 tweet-0.655
Tokeners
Token ID364
Feature activation+3.01e+00

Pos logit contributions
arl+0.151
imi+0.140
icked+0.133
ig+0.132
igg+0.130
Neg logit contributions
 distance-0.107
 restaurants-0.099
 hamb-0.097
 burger-0.091
Ice-0.084
Token,"
Token ID553
Feature activation+2.95e+00

Pos logit contributions
 distance+0.255
 restaurants+0.243
 hamb+0.240
 burger+0.226
Ice+0.220
Neg logit contributions
arl-0.169
imi-0.152
ig-0.142
anda-0.140
 how-0.138
Token said
Token ID531
Feature activation+3.65e-01

Pos logit contributions
 distance+0.234
 restaurants+0.220
 hamb+0.215
 burger+0.202
asts+0.200
Neg logit contributions
imi-0.170
arl-0.167
 how-0.162
 tweet-0.153
icked-0.152
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
 distance+0.034
 restaurants+0.032
 hamb+0.031
 burger+0.030
handed+0.029
Neg logit contributions
arl-0.016
 how-0.016
imi-0.016
icked-0.016
uttered-0.015
TokenI
Token ID40
Feature activation+1.93e+00

Pos logit contributions
arl+0.046
imi+0.042
ra+0.039
 tweet+0.039
ixie+0.038
Neg logit contributions
 restaurants-0.058
 distance-0.056
 hamb-0.055
 burger-0.053
 lobby-0.049
Token want
Token ID765
Feature activation-5.45e-01

Pos logit contributions
 distance+0.163
 restaurants+0.157
 hamb+0.149
 burger+0.146
handed+0.143
Neg logit contributions
arl-0.110
imi-0.101
igg-0.093
icked-0.092
 knows-0.091
Token a
Token ID257
Feature activation-4.33e-02

Pos logit contributions
arl+0.029
imi+0.029
 how+0.027
icked+0.025
ig+0.025
Neg logit contributions
 distance-0.047
 restaurants-0.043
 hamb-0.042
handed-0.041
 burger-0.040
Token y
Token ID331
Feature activation+2.25e+00

Pos logit contributions
arl+0.003
imi+0.003
 tweet+0.003
anda+0.003
ig+0.003
Neg logit contributions
 distance-0.003
 restaurants-0.003
handed-0.002
 hamb-0.002
asts-0.002
Tokenummy
Token ID13513
Feature activation+7.15e-01

Pos logit contributions
 distance+0.209
 hamb+0.203
 restaurants+0.202
 burger+0.195
aster+0.184
Neg logit contributions
arl-0.109
imi-0.101
icked-0.089
ixie-0.088
 tweet-0.088
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
 distance+0.048
handed+0.042
line+0.040
 restaurants+0.040
David+0.039
Neg logit contributions
imi-0.049
arl-0.048
 how-0.046
 tweet-0.046
ig-0.045
Tokenurger
Token ID32650
Feature activation+4.76e+00

Pos logit contributions
 distance+0.578
 restaurants+0.512
 hamb+0.479
 burger+0.440
handed+0.431
Neg logit contributions
arl-1.030
imi-0.988
ig-0.920
icked-0.919
anda-0.912
Token with
Token ID351
Feature activation+1.17e+00

Pos logit contributions
 distance+0.402
 restaurants+0.381
 hamb+0.377
 burger+0.358
handed+0.345
Neg logit contributions
arl-0.275
imi-0.257
ig-0.229
icked-0.229
anda-0.225
Token French
Token ID4141
Feature activation+1.97e+00

Pos logit contributions
 distance+0.094
 restaurants+0.089
 hamb+0.087
 burger+0.082
handed+0.080
Neg logit contributions
arl-0.072
imi-0.068
icked-0.061
 how-0.060
ig-0.060
Token fries
Token ID31757
Feature activation+1.01e+01

Pos logit contributions
 distance+0.104
 restaurants+0.087
handed+0.083
aster+0.076
 hamb+0.074
Neg logit contributions
arl-0.178
imi-0.167
 how-0.163
anda-0.160
icked-0.159
Token!"
Token ID2474
Feature activation+8.10e-01

Pos logit contributions
 distance+0.794
 restaurants+0.753
 hamb+0.747
 burger+0.708
handed+0.675
Neg logit contributions
arl-0.636
imi-0.596
icked-0.537
ig-0.536
ixie-0.531
Token is
Token ID318
Feature activation-3.27e+00

Pos logit contributions
arl+0.070
 how+0.069
 knows+0.068
 is+0.066
imi+0.065
Neg logit contributions
 distance-0.403
Ice-0.364
handed-0.364
 restaurants-0.363
 hamb-0.363
Token a
Token ID257
Feature activation-1.62e-01

Pos logit contributions
arl+0.199
imi+0.187
ig+0.167
ixie+0.166
icked+0.166
Neg logit contributions
 distance-0.267
 restaurants-0.255
 hamb-0.253
 burger-0.241
handed-0.229
Token picture
Token ID4286
Feature activation+1.73e+00

Pos logit contributions
arl+0.009
imi+0.008
 wand+0.008
 tweet+0.008
 knows+0.008
Neg logit contributions
 distance-0.013
 restaurants-0.012
handed-0.012
aster-0.011
asts-0.011
Token of
Token ID286
Feature activation+1.80e+00

Pos logit contributions
 distance+0.128
 restaurants+0.122
 hamb+0.122
 burger+0.115
aster+0.109
Neg logit contributions
arl-0.117
imi-0.110
ixie-0.100
ig-0.100
icked-0.099
Token a
Token ID257
Feature activation-1.44e-01

Pos logit contributions
 distance+0.144
 restaurants+0.137
 hamb+0.136
 burger+0.130
handed+0.123
Neg logit contributions
arl-0.112
imi-0.105
ig-0.094
icked-0.093
ixie-0.093
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
arl+0.009
imi+0.008
 wand+0.008
 can+0.008
 tweet+0.008
Neg logit contributions
 distance-0.010
 restaurants-0.010
handed-0.010
aster-0.009
atmeal-0.009
Tokenurger
Token ID32650
Feature activation+4.73e+00

Pos logit contributions
 distance+0.540
 restaurants+0.502
 hamb+0.500
 burger+0.459
aster+0.412
Neg logit contributions
arl-1.035
imi-0.993
ig-0.927
ixie-0.924
icked-0.922
Token.
Token ID13
Feature activation+2.00e-01

Pos logit contributions
 distance+0.405
 restaurants+0.377
 hamb+0.372
 burger+0.350
handed+0.348
Neg logit contributions
arl-0.270
imi-0.252
 how-0.226
icked-0.225
ig-0.224
Token It
Token ID632
Feature activation+1.60e-01

Pos logit contributions
 distance+0.018
 restaurants+0.017
 hamb+0.016
handed+0.016
 burger+0.015
Neg logit contributions
arl-0.011
imi-0.010
 how-0.009
icked-0.009
 tweet-0.009
Token's
Token ID338
Feature activation+3.46e+00

Pos logit contributions
 hamb+0.011
 restaurants+0.011
 distance+0.011
 burger+0.011
 weighed+0.010
Neg logit contributions
arl-0.012
imi-0.010
uttered-0.010
ig-0.010
ixie-0.010
Token a
Token ID257
Feature activation-1.49e-01

Pos logit contributions
 distance+0.286
 restaurants+0.264
handed+0.255
 hamb+0.254
 burger+0.243
Neg logit contributions
arl-0.202
imi-0.199
 how-0.183
icked-0.174
icks-0.168
Token noticed
Token ID6810
Feature activation+5.84e+00

Pos logit contributions
arl+0.047
imi+0.045
ig+0.042
icked+0.041
ixie+0.041
Neg logit contributions
 distance-0.052
 restaurants-0.050
 hamb-0.049
 burger-0.046
handed-0.044
Token it
Token ID340
Feature activation+1.23e+00

Pos logit contributions
 distance+0.455
 hamb+0.438
 restaurants+0.435
 burger+0.413
handed+0.391
Neg logit contributions
arl-0.391
imi-0.356
ixie-0.322
anda-0.320
ig-0.317
Token was
Token ID373
Feature activation+1.08e-01

Pos logit contributions
 distance+0.094
 hamb+0.088
 restaurants+0.088
 burger+0.083
aster+0.079
Neg logit contributions
arl-0.083
imi-0.077
ixie-0.070
ig-0.069
icked-0.069
Token an
Token ID281
Feature activation+1.91e+00

Pos logit contributions
 distance+0.009
 restaurants+0.009
 hamb+0.008
 burger+0.008
handed+0.008
Neg logit contributions
arl-0.006
imi-0.006
 how-0.006
icked-0.005
 bud-0.005
Token old
Token ID1468
Feature activation-8.43e-01

Pos logit contributions
 distance+0.153
 restaurants+0.146
 hamb+0.144
 burger+0.138
handed+0.130
Neg logit contributions
arl-0.117
imi-0.111
ig-0.101
igg-0.100
anda-0.099
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
arl+0.048
imi+0.047
ig+0.046
 wand+0.045
 bud+0.045
Neg logit contributions
 restaurants-0.064
 distance-0.064
 hamb-0.062
asts-0.059
dogs-0.058
Tokenurger
Token ID32650
Feature activation+4.73e+00

Pos logit contributions
 hamb+0.658
 distance+0.629
 restaurants+0.614
 burger+0.614
asts+0.581
Neg logit contributions
arl-0.882
imi-0.839
ixie-0.795
 tweet-0.788
uttered-0.786
Token wrapper
Token ID29908
Feature activation+1.83e+00

Pos logit contributions
 distance+0.406
 restaurants+0.395
 hamb+0.375
handed+0.358
 burger+0.355
Neg logit contributions
arl-0.251
imi-0.235
ig-0.224
 how-0.222
icked-0.222
Token.
Token ID13
Feature activation+9.99e-02

Pos logit contributions
 distance+0.149
 restaurants+0.144
 hamb+0.139
 burger+0.132
handed+0.128
Neg logit contributions
arl-0.112
imi-0.104
icked-0.097
ig-0.097
anda-0.094
Token The
Token ID383
Feature activation+1.50e+00

Pos logit contributions
 distance+0.008
 restaurants+0.008
 hamb+0.008
 burger+0.007
aster+0.007
Neg logit contributions
arl-0.006
imi-0.006
icked-0.005
ixie-0.005
ig-0.005
Token car
Token ID1097
Feature activation-6.98e-01

Pos logit contributions
 distance+0.125
 restaurants+0.121
 hamb+0.114
handed+0.111
 french+0.110
Neg logit contributions
arl-0.080
imi-0.071
anda-0.070
 eight-0.069
 eyes-0.068
Token time
Token ID640
Feature activation+5.83e-01

Pos logit contributions
arl+0.380
imi+0.362
icked+0.341
ixie+0.336
anda+0.333
Neg logit contributions
 distance-0.271
 hamb-0.253
 restaurants-0.251
 burger-0.235
aster-0.221
Token she
Token ID673
Feature activation-8.06e-02

Pos logit contributions
 distance+0.048
 restaurants+0.046
 hamb+0.046
 burger+0.044
aster+0.042
Neg logit contributions
arl-0.035
imi-0.032
ig-0.029
ixie-0.029
icked-0.029
Token should
Token ID815
Feature activation-7.42e-01

Pos logit contributions
arl+0.006
imi+0.005
ig+0.005
ixie+0.005
uttered+0.004
Neg logit contributions
 distance-0.006
 hamb-0.006
 restaurants-0.006
 burger-0.006
aster-0.005
Token order
Token ID1502
Feature activation+1.07e+00

Pos logit contributions
arl+0.049
imi+0.046
icked+0.043
ixie+0.043
uttered+0.043
Neg logit contributions
 distance-0.056
 restaurants-0.054
 hamb-0.053
 burger-0.050
aster-0.050
Token her
Token ID607
Feature activation-1.73e+00

Pos logit contributions
 distance+0.087
 hamb+0.082
 restaurants+0.082
 burger+0.078
handed+0.074
Neg logit contributions
arl-0.064
imi-0.060
 how-0.055
icked-0.053
ixie-0.053
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
arl+0.114
imi+0.109
 how+0.102
ig+0.099
anda+0.099
Neg logit contributions
 distance-0.129
 hamb-0.119
 restaurants-0.119
 burger-0.112
asts-0.108
Tokenurger
Token ID32650
Feature activation+4.53e+00

Pos logit contributions
 distance+0.641
 restaurants+0.556
 hamb+0.519
 burger+0.483
handed+0.473
Neg logit contributions
arl-0.976
imi-0.936
icked-0.878
anda-0.871
ig-0.870
Token without
Token ID1231
Feature activation+3.83e+00

Pos logit contributions
 distance+0.374
 restaurants+0.352
 hamb+0.348
 burger+0.328
aster+0.316
Neg logit contributions
arl-0.266
imi-0.249
 how-0.228
icked-0.227
ig-0.225
Token pick
Token ID2298
Feature activation+5.84e-01

Pos logit contributions
 distance+0.296
 restaurants+0.268
 hamb+0.259
 burger+0.252
Ice+0.248
Neg logit contributions
arl-0.245
imi-0.234
 how-0.224
icked-0.217
ig-0.208
Tokenles
Token ID829
Feature activation+7.88e-02

Pos logit contributions
 distance+0.050
 restaurants+0.047
 hamb+0.046
 burger+0.044
asts+0.043
Neg logit contributions
arl-0.032
imi-0.030
icked-0.028
 how-0.028
ig-0.028
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
 distance+0.006
 restaurants+0.006
 hamb+0.006
 burger+0.006
Ice+0.005
Neg logit contributions
arl-0.005
imi-0.004
ig-0.004
 how-0.004
icked-0.004
Token's
Token ID338
Feature activation+3.46e+00

Pos logit contributions
 distance+0.021
 restaurants+0.021
 hamb+0.019
aster+0.019
dogs+0.019
Neg logit contributions
imi-0.012
 how-0.012
 knows-0.011
anda-0.011
ixie-0.011
Token mom
Token ID1995
Feature activation-2.21e-01

Pos logit contributions
 distance+0.267
 restaurants+0.253
 hamb+0.236
aster+0.234
handed+0.228
Neg logit contributions
arl-0.222
imi-0.218
anda-0.193
ig-0.186
ixie-0.186
Token gave
Token ID2921
Feature activation+1.09e+00

Pos logit contributions
imi+0.012
 how+0.012
arl+0.011
anda+0.011
icked+0.011
Neg logit contributions
 distance-0.018
 restaurants-0.018
 hamb-0.017
aster-0.016
 burger-0.016
Token him
Token ID683
Feature activation+1.79e+00

Pos logit contributions
 distance+0.086
 restaurants+0.082
 hamb+0.080
 burger+0.077
asts+0.074
Neg logit contributions
arl-0.068
imi-0.064
 how-0.061
icked-0.060
uttered-0.058
Token a
Token ID257
Feature activation-1.86e-01

Pos logit contributions
 distance+0.125
 restaurants+0.118
 hamb+0.118
 burger+0.111
aster+0.104
Neg logit contributions
arl-0.129
imi-0.123
ig-0.112
ixie-0.111
icked-0.111
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
arl+0.012
imi+0.011
ixie+0.010
 tweet+0.010
anda+0.009
Neg logit contributions
 distance-0.015
 restaurants-0.014
handed-0.013
asts-0.013
 hamb-0.013
Tokenurger
Token ID32650
Feature activation+4.74e+00

Pos logit contributions
 distance+0.472
 hamb+0.444
 restaurants+0.438
 burger+0.401
aster+0.348
Neg logit contributions
arl-1.090
imi-1.047
ig-0.984
ixie-0.982
icked-0.977
Token.
Token ID13
Feature activation+2.00e-01

Pos logit contributions
 distance+0.398
 restaurants+0.383
 hamb+0.356
handed+0.339
aster+0.336
Neg logit contributions
arl-0.278
imi-0.255
icked-0.245
 how-0.236
ig-0.231
Token Tim
Token ID5045
Feature activation+2.43e-01

Pos logit contributions
 distance+0.016
 restaurants+0.015
 hamb+0.015
 burger+0.014
handed+0.014
Neg logit contributions
arl-0.012
imi-0.012
uttered-0.011
 how-0.011
icked-0.010
Tokenmy
Token ID1820
Feature activation+2.31e-01

Pos logit contributions
 distance+0.015
 restaurants+0.014
 hamb+0.014
 burger+0.013
handed+0.012
Neg logit contributions
arl-0.020
imi-0.018
uttered-0.018
icked-0.017
 tweet-0.017
Token was
Token ID373
Feature activation-1.54e-02

Pos logit contributions
 distance+0.019
 restaurants+0.018
 hamb+0.017
 burger+0.016
handed+0.016
Neg logit contributions
imi-0.014
arl-0.013
ig-0.012
icked-0.012
anda-0.012
Token help
Token ID1037
Feature activation-1.21e+00

Pos logit contributions
arl+0.013
 how+0.013
imi+0.013
 tweet+0.012
icks+0.011
Neg logit contributions
 distance-0.030
 restaurants-0.027
handed-0.026
 hamb-0.026
glass-0.026
Token cook
Token ID4255
Feature activation+5.00e+00

Pos logit contributions
 how+0.050
arl+0.043
 bud+0.043
imi+0.041
 me+0.039
Neg logit contributions
 distance-0.118
 restaurants-0.113
 hamb-0.108
 lobby-0.106
 burger-0.106
Token hot
Token ID3024
Feature activation+3.97e+00

Pos logit contributions
 distance+0.489
 restaurants+0.427
Ice+0.416
 hamb+0.411
handed+0.410
Neg logit contributions
 how-0.232
arl-0.193
imi-0.180
icks-0.174
icked-0.172
Token dogs
Token ID6844
Feature activation+4.51e+00

Pos logit contributions
 distance+0.284
handed+0.227
 restaurants+0.221
wreck+0.216
glass+0.214
Neg logit contributions
 how-0.275
arl-0.270
imi-0.265
 eyes-0.243
icked-0.241
Token and
Token ID290
Feature activation+2.98e-01

Pos logit contributions
 distance+0.396
 restaurants+0.375
 hamb+0.366
 burger+0.349
aster+0.342
Neg logit contributions
arl-0.218
 how-0.214
imi-0.202
ig-0.190
icked-0.190
Token hamb
Token ID41478
Feature activation+1.11e+01

Pos logit contributions
 distance+0.021
 restaurants+0.019
handed+0.019
 hamb+0.018
Ice+0.018
Neg logit contributions
 how-0.019
imi-0.018
arl-0.018
 knows-0.017
icked-0.017
Tokenurg
Token ID3686
Feature activation-1.80e+00

Pos logit contributions
 hamb+0.891
 restaurants+0.862
 distance+0.848
 burger+0.840
asts+0.796
Neg logit contributions
arl-0.660
imi-0.612
ixie-0.558
 tweet-0.558
ig-0.557
Tokeners
Token ID364
Feature activation+3.07e+00

Pos logit contributions
arl+0.121
icked+0.113
igg+0.110
imi+0.107
 how+0.105
Neg logit contributions
 distance-0.135
 restaurants-0.128
 hamb-0.120
 burger-0.116
Ice-0.111
Token on
Token ID319
Feature activation+2.00e+00

Pos logit contributions
 distance+0.258
 restaurants+0.250
 hamb+0.236
 burger+0.226
Ice+0.220
Neg logit contributions
arl-0.166
imi-0.150
 how-0.149
icked-0.149
ig-0.144
Token the
Token ID262
Feature activation-8.11e-02

Pos logit contributions
 distance+0.185
 restaurants+0.173
 hamb+0.170
atmeal+0.166
 burger+0.165
Neg logit contributions
 how-0.082
arl-0.081
imi-0.080
 eight-0.072
 eyes-0.066
Token grill
Token ID29901
Feature activation+6.26e+00

Pos logit contributions
arl+0.006
imi+0.006
 me+0.006
 eight+0.005
 heads+0.005
Neg logit contributions
 restaurants-0.005
 distance-0.004
dream-0.004
bridge-0.004
handed-0.004
Tokenmy
Token ID1820
Feature activation+2.44e-01

Pos logit contributions
 distance+0.021
 hamb+0.020
 restaurants+0.020
 burger+0.019
asts+0.018
Neg logit contributions
imi-0.016
arl-0.016
ig-0.014
 how-0.014
anda-0.014
Token loved
Token ID6151
Feature activation+8.50e-01

Pos logit contributions
 distance+0.022
 restaurants+0.022
 hamb+0.021
 burger+0.020
handed+0.020
Neg logit contributions
imi-0.012
 how-0.010
anda-0.010
ig-0.010
 knows-0.010
Token meat
Token ID6174
Feature activation+7.14e+00

Pos logit contributions
 distance+0.075
 restaurants+0.069
 hamb+0.068
handed+0.066
 burger+0.065
Neg logit contributions
arl-0.045
imi-0.044
 how-0.039
ig-0.037
icked-0.037
Token,
Token ID11
Feature activation-4.54e-01

Pos logit contributions
 distance+0.658
 restaurants+0.609
 hamb+0.589
aster+0.565
handed+0.564
Neg logit contributions
imi-0.357
arl-0.351
 how-0.313
icked-0.310
ig-0.307
Token especially
Token ID2592
Feature activation+9.33e+00

Pos logit contributions
 how+0.036
imi+0.031
 can+0.030
 my+0.030
 me+0.029
Neg logit contributions
handed-0.035
 distance-0.030
keeping-0.028
roads-0.028
 lobby-0.027
Token hamb
Token ID41478
Feature activation+1.10e+01

Pos logit contributions
 distance+0.662
 restaurants+0.616
 hamb+0.584
handed+0.555
 burger+0.549
Neg logit contributions
arl-0.672
imi-0.644
 how-0.618
icked-0.596
ig-0.575
Tokenurg
Token ID3686
Feature activation-1.83e+00

Pos logit contributions
 hamb+0.708
 distance+0.684
 restaurants+0.681
 burger+0.662
asts+0.611
Neg logit contributions
arl-0.840
imi-0.782
ixie-0.738
 tweet-0.734
ig-0.732
Tokeners
Token ID364
Feature activation+3.04e+00

Pos logit contributions
arl+0.110
igg+0.101
imi+0.099
icked+0.095
ig+0.095
Neg logit contributions
 distance-0.151
 restaurants-0.138
 hamb-0.128
Ice-0.124
 lobby-0.123
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
 distance+0.268
 restaurants+0.259
 hamb+0.251
aster+0.238
 burger+0.234
Neg logit contributions
arl-0.149
imi-0.139
 how-0.139
ig-0.134
icked-0.130
Token One
Token ID1881
Feature activation-5.83e-01

Pos logit contributions
 distance+0.021
 restaurants+0.019
 hamb+0.019
handed+0.019
 burger+0.018
Neg logit contributions
arl-0.015
imi-0.015
 how-0.013
 can-0.012
icks-0.012
Token day
Token ID1110
Feature activation-9.05e-01

Pos logit contributions
imi+0.032
ia+0.026
 how+0.026
arl+0.026
ig+0.024
Neg logit contributions
 restaurants-0.048
handed-0.048
 distance-0.047
 vendors-0.045
line-0.045
Token girl
Token ID2576
Feature activation+5.78e-01

Pos logit contributions
arl+0.209
imi+0.196
icked+0.181
ig+0.177
ixie+0.177
Neg logit contributions
 distance-0.195
 restaurants-0.183
 hamb-0.182
 burger-0.172
aster-0.160
Token decided
Token ID3066
Feature activation+2.04e+00

Pos logit contributions
 distance+0.039
 hamb+0.038
 restaurants+0.037
 burger+0.036
aster+0.033
Neg logit contributions
arl-0.045
imi-0.040
ig-0.037
ixie-0.036
icked-0.036
Token to
Token ID284
Feature activation-3.15e-01

Pos logit contributions
 distance+0.174
 restaurants+0.167
 hamb+0.166
 burger+0.159
aster+0.150
Neg logit contributions
arl-0.116
imi-0.109
ig-0.096
ixie-0.096
icked-0.096
Token try
Token ID1949
Feature activation-2.38e+00

Pos logit contributions
arl+0.022
imi+0.021
ig+0.019
ixie+0.019
icked+0.019
Neg logit contributions
 distance-0.023
 restaurants-0.022
 hamb-0.022
 burger-0.020
aster-0.019
Token a
Token ID257
Feature activation-2.17e-01

Pos logit contributions
 how+0.127
imi+0.125
arl+0.121
ig+0.115
ixie+0.107
Neg logit contributions
 distance-0.200
 hamb-0.182
 restaurants-0.181
 burger-0.174
Ice-0.173
Token hamb
Token ID41478
Feature activation+1.10e+01

Pos logit contributions
arl+0.013
imi+0.012
ig+0.012
 tweet+0.012
anda+0.011
Neg logit contributions
 distance-0.017
 restaurants-0.016
handed-0.015
 hamb-0.015
dogs-0.015
Tokenurger
Token ID32650
Feature activation+4.67e+00

Pos logit contributions
 hamb+0.673
 distance+0.667
 restaurants+0.652
 burger+0.626
asts+0.581
Neg logit contributions
arl-0.869
imi-0.821
ixie-0.762
ig-0.758
uttered-0.752
Token and
Token ID290
Feature activation+1.73e-01

Pos logit contributions
 distance+0.410
 restaurants+0.389
 hamb+0.377
 burger+0.358
handed+0.356
Neg logit contributions
arl-0.239
imi-0.223
 how-0.220
icked-0.210
ig-0.205
Token some
Token ID617
Feature activation+2.13e+00

Pos logit contributions
 distance+0.012
 restaurants+0.011
handed+0.010
Ice+0.010
 hamb+0.010
Neg logit contributions
 how-0.012
imi-0.011
 eight-0.011
arl-0.011
 knows-0.010
Token fries
Token ID31757
Feature activation+9.94e+00

Pos logit contributions
 distance+0.156
 restaurants+0.143
handed+0.142
line+0.133
Ice+0.129
Neg logit contributions
 how-0.130
arl-0.124
 knows-0.123
 eyes-0.123
 can-0.121
Token.
Token ID13
Feature activation+7.50e-02

Pos logit contributions
 distance+0.867
 restaurants+0.814
 hamb+0.783
 burger+0.744
handed+0.743
Neg logit contributions
arl-0.534
imi-0.496
icked-0.488
 how-0.481
ig-0.448
Token cooked
Token ID15847
Feature activation+3.70e+00

Pos logit contributions
imi+0.017
 how+0.016
arl+0.016
icked+0.016
 knows+0.016
Neg logit contributions
 distance-0.024
 restaurants-0.022
handed-0.020
 hamb-0.020
aster-0.020
Token some
Token ID617
Feature activation+2.05e+00

Pos logit contributions
 distance+0.342
 restaurants+0.296
handed+0.295
line+0.277
 lobby+0.276
Neg logit contributions
 how-0.201
imi-0.171
arl-0.170
icked-0.162
 my-0.160
Token hot
Token ID3024
Feature activation+3.88e+00

Pos logit contributions
 distance+0.145
handed+0.129
line+0.121
 restaurants+0.120
glass+0.111
Neg logit contributions
imi-0.145
arl-0.137
 how-0.137
 wand-0.124
 eyes-0.124
Token dogs
Token ID6844
Feature activation+4.38e+00

Pos logit contributions
 distance+0.253
handed+0.227
 restaurants+0.209
wreck+0.202
chard+0.202
Neg logit contributions
imi-0.294
arl-0.285
 how-0.278
 eyes-0.256
igg-0.255
Token and
Token ID290
Feature activation+3.26e-01

Pos logit contributions
 distance+0.391
 restaurants+0.368
 hamb+0.354
handed+0.343
 burger+0.339
Neg logit contributions
arl-0.212
imi-0.201
 how-0.200
icked-0.186
ig-0.180
Token hamb
Token ID41478
Feature activation+1.10e+01

Pos logit contributions
 distance+0.025
handed+0.022
 restaurants+0.021
asts+0.020
line+0.020
Neg logit contributions
 how-0.021
imi-0.019
icked-0.018
 knows-0.018
 eight-0.017
Tokenurg
Token ID3686
Feature activation-1.88e+00

Pos logit contributions
 hamb+0.891
 restaurants+0.860
 distance+0.856
 burger+0.842
asts+0.797
Neg logit contributions
arl-0.648
imi-0.599
ixie-0.549
ig-0.547
 tweet-0.542
Tokeners
Token ID364
Feature activation+2.99e+00

Pos logit contributions
arl+0.138
imi+0.127
icked+0.126
igg+0.124
ig+0.120
Neg logit contributions
 distance-0.129
 restaurants-0.121
 hamb-0.114
 burger-0.110
Ice-0.104
Token on
Token ID319
Feature activation+1.96e+00

Pos logit contributions
 distance+0.257
 restaurants+0.244
 hamb+0.229
 burger+0.218
Ice+0.218
Neg logit contributions
arl-0.159
imi-0.144
 how-0.143
icked-0.141
icks-0.135
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
 distance+0.178
 restaurants+0.168
 hamb+0.163
handed+0.161
 burger+0.159
Neg logit contributions
arl-0.087
imi-0.085
 how-0.082
 eight-0.075
 tweet-0.067
Token grill
Token ID29901
Feature activation+6.24e+00

Pos logit contributions
arl+0.012
imi+0.011
 me+0.011
 eight+0.011
 heads+0.010
Neg logit contributions
 restaurants-0.009
 distance-0.009
dream-0.008
handed-0.008
David-0.008
Token that
Token ID326
Feature activation-5.83e+00

Pos logit contributions
arl+0.033
imi+0.031
ig+0.028
icked+0.028
ixie+0.028
Neg logit contributions
 distance-0.035
 restaurants-0.033
 hamb-0.033
 burger-0.031
handed-0.029
Token,
Token ID11
Feature activation-6.63e-01

Pos logit contributions
arl+0.375
imi+0.344
icked+0.323
ixie+0.316
anda+0.313
Neg logit contributions
 distance-0.469
 restaurants-0.446
 hamb-0.441
 burger-0.420
aster-0.398
Token Lily
Token ID20037
Feature activation-1.22e+00

Pos logit contributions
imi+0.029
arl+0.029
 how+0.026
ig+0.025
ixie+0.025
Neg logit contributions
 distance-0.062
 hamb-0.059
 restaurants-0.059
 burger-0.057
asts-0.056
Token enjoyed
Token ID8359
Feature activation+3.44e+00

Pos logit contributions
arl+0.081
imi+0.079
ig+0.073
ixie+0.072
icked+0.072
Neg logit contributions
 distance-0.088
 restaurants-0.084
 hamb-0.083
 burger-0.078
aster-0.074
Token her
Token ID607
Feature activation-1.71e+00

Pos logit contributions
 distance+0.289
 restaurants+0.265
 hamb+0.263
 burger+0.254
handed+0.254
Neg logit contributions
arl-0.191
 how-0.181
imi-0.180
ig-0.160
 tweet-0.158
Token hamb
Token ID41478
Feature activation+1.10e+01

Pos logit contributions
arl+0.132
imi+0.128
anda+0.118
ig+0.118
 how+0.116
Neg logit contributions
 distance-0.107
 restaurants-0.093
Ice-0.089
handed-0.087
line-0.086
Tokenurger
Token ID32650
Feature activation+4.50e+00

Pos logit contributions
 distance+0.673
 restaurants+0.630
 hamb+0.623
 burger+0.582
aster+0.541
Neg logit contributions
arl-0.889
imi-0.848
ig-0.783
icked-0.778
ixie-0.776
Token without
Token ID1231
Feature activation+3.82e+00

Pos logit contributions
 distance+0.394
 restaurants+0.382
 hamb+0.374
 burger+0.355
aster+0.341
Neg logit contributions
arl-0.225
imi-0.218
ig-0.207
 how-0.206
icked-0.198
Token pick
Token ID2298
Feature activation+6.24e-01

Pos logit contributions
 distance+0.285
 restaurants+0.265
 hamb+0.251
 burger+0.245
handed+0.245
Neg logit contributions
arl-0.245
imi-0.234
 how-0.229
icked-0.218
ra-0.212
Tokenles
Token ID829
Feature activation+4.87e-02

Pos logit contributions
 distance+0.054
 restaurants+0.051
 hamb+0.050
 burger+0.048
asts+0.046
Neg logit contributions
arl-0.033
imi-0.032
 how-0.029
icked-0.029
ig-0.029
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
 distance+0.004
 restaurants+0.004
 hamb+0.004
 burger+0.004
handed+0.003
Neg logit contributions
arl-0.003
imi-0.003
ig-0.002
 how-0.002
icked-0.002
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
 restaurants+0.030
 distance+0.030
 hamb+0.028
 burger+0.027
handed+0.026
Neg logit contributions
arl-0.015
icked-0.014
 how-0.014
imi-0.014
uttered-0.014
Token It
Token ID632
Feature activation+1.17e-01

Pos logit contributions
 distance+0.009
 restaurants+0.009
 hamb+0.009
 burger+0.008
aster+0.008
Neg logit contributions
arl-0.009
imi-0.008
ixie-0.007
ig-0.007
icked-0.007
Token was
Token ID373
Feature activation+6.02e-02

Pos logit contributions
 distance+0.009
 restaurants+0.009
 hamb+0.009
 burger+0.008
aster+0.008
Neg logit contributions
arl-0.007
imi-0.007
ig-0.006
icked-0.006
ixie-0.006
Token the
Token ID262
Feature activation-1.32e-01

Pos logit contributions
 distance+0.004
 restaurants+0.004
 hamb+0.004
 burger+0.004
handed+0.004
Neg logit contributions
arl-0.004
imi-0.004
 how-0.004
icked-0.003
ixie-0.003
Token best
Token ID1266
Feature activation+1.20e+00

Pos logit contributions
arl+0.007
imi+0.006
 tweet+0.006
 eight+0.006
 wand+0.006
Neg logit contributions
 restaurants-0.011
 distance-0.010
aster-0.010
handed-0.010
 hamb-0.010
Token hamb
Token ID41478
Feature activation+1.09e+01

Pos logit contributions
 distance+0.070
chard+0.068
 restaurants+0.065
handed+0.065
line+0.064
Neg logit contributions
imi-0.088
 how-0.085
arl-0.084
ia-0.080
 is-0.079
Tokenurger
Token ID32650
Feature activation+4.57e+00

Pos logit contributions
 distance+0.731
 restaurants+0.691
 hamb+0.687
 burger+0.646
aster+0.603
Neg logit contributions
arl-0.825
imi-0.784
ig-0.718
ixie-0.714
icked-0.714
Token he
Token ID339
Feature activation+2.23e+00

Pos logit contributions
 distance+0.395
 restaurants+0.392
 hamb+0.363
 burger+0.344
handed+0.343
Neg logit contributions
arl-0.227
imi-0.221
icked-0.212
 how-0.210
ig-0.210
Token had
Token ID550
Feature activation+7.42e-01

Pos logit contributions
 distance+0.187
 restaurants+0.185
 hamb+0.169
handed+0.163
 burger+0.162
Neg logit contributions
arl-0.115
imi-0.112
 how-0.107
 knows-0.106
ra-0.106
Token ever
Token ID1683
Feature activation-7.96e-01

Pos logit contributions
 distance+0.070
 restaurants+0.067
 hamb+0.065
 burger+0.063
dogs+0.062
Neg logit contributions
arl-0.029
icked-0.029
ra-0.029
 how-0.029
imi-0.027
Token had
Token ID550
Feature activation+7.37e-01

Pos logit contributions
 how+0.063
 knows+0.062
icked+0.061
 eight+0.059
 tweet+0.058
Neg logit contributions
 distance-0.040
 restaurants-0.040
 lobby-0.037
 stret-0.036
Ice-0.035
Token"
Token ID1
Feature activation-3.09e+00

Pos logit contributions
arl+0.002
imi+0.001
ixie+0.001
 how+0.001
anda+0.001
Neg logit contributions
 distance-0.007
 hamb-0.007
 restaurants-0.007
 burger-0.007
asts-0.007
TokenI
Token ID40
Feature activation+1.90e+00

Pos logit contributions
arl+0.179
 tweet+0.164
imi+0.161
 eight+0.156
 how+0.153
Neg logit contributions
 restaurants-0.221
 hamb-0.220
 distance-0.218
asts-0.212
 burger-0.211
Token want
Token ID765
Feature activation-5.02e-01

Pos logit contributions
 distance+0.151
 restaurants+0.144
 hamb+0.139
 burger+0.136
handed+0.131
Neg logit contributions
arl-0.114
imi-0.107
 knows-0.099
anda-0.099
 tweet-0.098
Token a
Token ID257
Feature activation-2.77e-02

Pos logit contributions
imi+0.028
arl+0.028
 how+0.027
icked+0.025
 tweet+0.024
Neg logit contributions
 distance-0.042
 restaurants-0.037
 hamb-0.037
handed-0.036
 burger-0.035
Token big
Token ID1263
Feature activation-7.81e-01

Pos logit contributions
arl+0.002
imi+0.002
 tweet+0.002
anda+0.002
ixie+0.002
Neg logit contributions
 distance-0.002
 restaurants-0.002
handed-0.002
 hamb-0.002
asts-0.002
Token burger
Token ID26593
Feature activation+1.09e+01

Pos logit contributions
arl+0.054
imi+0.052
 tweet+0.051
ig+0.049
anda+0.048
Neg logit contributions
 distance-0.052
 restaurants-0.049
handed-0.048
 hamb-0.047
Ice-0.046
Token and
Token ID290
Feature activation+4.13e-01

Pos logit contributions
 distance+1.048
 restaurants+0.985
 hamb+0.948
handed+0.926
aster+0.900
Neg logit contributions
arl-0.481
imi-0.454
 how-0.417
icked-0.403
anda-0.399
Token fries
Token ID31757
Feature activation+1.01e+01

Pos logit contributions
 distance+0.025
handed+0.021
 restaurants+0.021
line+0.018
 hamb+0.018
Neg logit contributions
 how-0.032
arl-0.032
imi-0.032
 eight-0.031
 tweet-0.031
Token and
Token ID290
Feature activation+4.29e-01

Pos logit contributions
 distance+0.885
 restaurants+0.818
 hamb+0.795
handed+0.757
aster+0.742
Neg logit contributions
arl-0.551
imi-0.510
icked-0.474
 how-0.473
anda-0.456
Token k
Token ID479
Feature activation-6.25e+00

Pos logit contributions
 distance+0.027
handed+0.023
 restaurants+0.022
ding+0.020
line+0.020
Neg logit contributions
 how-0.033
arl-0.033
imi-0.032
 eight-0.031
 tweet-0.031
Tokenetchup
Token ID47132
Feature activation+1.88e+00

Pos logit contributions
arl+0.434
imi+0.397
anda+0.380
icked+0.379
ig+0.373
Neg logit contributions
 distance-0.468
 restaurants-0.429
 hamb-0.414
 burger-0.378
handed-0.375
Token him
Token ID683
Feature activation+1.84e+00

Pos logit contributions
 distance+0.193
Ice+0.159
 restaurants+0.159
handed+0.159
 hamb+0.156
Neg logit contributions
 how-0.121
imi-0.114
arl-0.111
 me-0.108
 my-0.102
Token and
Token ID290
Feature activation+3.50e-01

Pos logit contributions
 distance+0.144
 restaurants+0.133
 hamb+0.131
 burger+0.124
asts+0.119
Neg logit contributions
arl-0.113
imi-0.109
 how-0.103
icked-0.100
ig-0.098
Token he
Token ID339
Feature activation+2.34e+00

Pos logit contributions
 distance+0.027
 restaurants+0.024
handed+0.023
asts+0.023
 hamb+0.023
Neg logit contributions
 how-0.021
imi-0.020
arl-0.020
icked-0.019
 knows-0.019
Token ordered
Token ID6149
Feature activation+5.21e+00

Pos logit contributions
 distance+0.185
 restaurants+0.172
 hamb+0.160
handed+0.157
 burger+0.154
Neg logit contributions
imi-0.135
 how-0.134
icked-0.132
arl-0.130
 knows-0.126
Token a
Token ID257
Feature activation-1.50e-01

Pos logit contributions
 distance+0.425
 restaurants+0.371
handed+0.349
 hamb+0.348
Ice+0.342
Neg logit contributions
 how-0.310
arl-0.300
imi-0.282
icked-0.272
 eyes-0.262
Token burger
Token ID26593
Feature activation+1.09e+01

Pos logit contributions
arl+0.008
 wand+0.007
 can+0.007
 eight+0.007
 how+0.006
Neg logit contributions
 distance-0.012
 restaurants-0.012
handed-0.011
line-0.011
rob-0.011
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
 distance+1.065
 restaurants+1.013
handed+0.947
 hamb+0.929
aster+0.908
Neg logit contributions
arl-0.454
 how-0.416
icked-0.403
imi-0.402
icks-0.370
Token
Token ID198
Feature activation-4.28e-02

Pos logit contributions
 distance+0.012
 restaurants+0.011
 hamb+0.011
 burger+0.010
handed+0.010
Neg logit contributions
arl-0.007
 how-0.006
imi-0.006
 eight-0.006
icked-0.006
Token
Token ID198
Feature activation-5.03e-02

Pos logit contributions
arl+0.002
imi+0.002
ixie+0.002
ia+0.002
 how+0.002
Neg logit contributions
 hamb-0.004
 distance-0.004
 restaurants-0.004
 burger-0.004
dogs-0.004
TokenWhile
Token ID3633
Feature activation+3.25e+00

Pos logit contributions
arl+0.003
imi+0.003
 eight+0.003
 how+0.003
roud+0.003
Neg logit contributions
 hamb-0.004
 distance-0.004
 restaurants-0.004
 burger-0.004
dogs-0.003
Token he
Token ID339
Feature activation+2.32e+00

Pos logit contributions
 distance+0.257
handed+0.230
 hamb+0.219
 restaurants+0.217
des+0.216
Neg logit contributions
 how-0.181
 eight-0.181
imi-0.180
arl-0.178
icked-0.168
Token Tom
Token ID4186
Feature activation+1.80e+00

Pos logit contributions
 distance+0.182
 restaurants+0.173
 hamb+0.171
 burger+0.163
handed+0.161
Neg logit contributions
arl-0.126
imi-0.123
 how-0.115
ig-0.111
icked-0.111
Token did
Token ID750
Feature activation-2.88e+00

Pos logit contributions
 distance+0.148
 restaurants+0.142
 hamb+0.138
 burger+0.131
handed+0.127
Neg logit contributions
arl-0.102
imi-0.102
ig-0.092
icked-0.092
anda-0.091
Token not
Token ID407
Feature activation-1.62e+00

Pos logit contributions
arl+0.134
imi+0.124
ig+0.106
icked+0.106
ixie+0.104
Neg logit contributions
 distance-0.277
 restaurants-0.265
 hamb-0.264
 burger-0.253
handed-0.242
Token like
Token ID588
Feature activation+2.53e+00

Pos logit contributions
arl+0.091
 bud+0.082
 how+0.078
imi+0.074
icks+0.074
Neg logit contributions
 distance-0.135
 hamb-0.131
 restaurants-0.129
 burger-0.125
dogs-0.124
Token his
Token ID465
Feature activation+1.13e+00

Pos logit contributions
 distance+0.198
 restaurants+0.184
 hamb+0.183
 burger+0.173
handed+0.169
Neg logit contributions
arl-0.159
imi-0.152
 how-0.138
icked-0.135
ig-0.134
Token burger
Token ID26593
Feature activation+1.09e+01

Pos logit contributions
 distance+0.071
 restaurants+0.061
handed+0.060
 hamb+0.059
Ice+0.058
Neg logit contributions
imi-0.084
arl-0.084
 how-0.080
 eyes-0.079
 fears-0.078
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
 distance+0.938
 restaurants+0.899
 hamb+0.884
 burger+0.833
handed+0.827
Neg logit contributions
arl-0.543
imi-0.531
 how-0.508
icked-0.483
ig-0.471
Token It
Token ID632
Feature activation+1.47e-01

Pos logit contributions
 distance+0.015
 restaurants+0.014
 hamb+0.014
 burger+0.013
asts+0.013
Neg logit contributions
arl-0.012
imi-0.010
 how-0.009
ixie-0.009
icked-0.009
Token was
Token ID373
Feature activation-2.30e-02

Pos logit contributions
 distance+0.012
 restaurants+0.011
 hamb+0.011
 burger+0.010
handed+0.010
Neg logit contributions
arl-0.009
imi-0.009
icked-0.008
ig-0.008
ixie-0.008
Token too
Token ID1165
Feature activation-2.83e+00

Pos logit contributions
arl+0.001
imi+0.001
ig+0.001
icked+0.001
ixie+0.001
Neg logit contributions
 distance-0.002
 restaurants-0.002
 hamb-0.002
 burger-0.002
handed-0.002
Token big
Token ID1263
Feature activation-8.80e-01

Pos logit contributions
arl+0.192
imi+0.178
icked+0.161
igg+0.160
anda+0.158
Neg logit contributions
 distance-0.213
 restaurants-0.204
 hamb-0.197
 burger-0.185
handed-0.183
Token happy
Token ID3772
Feature activation+9.95e-01

Pos logit contributions
 distance+0.004
 restaurants+0.004
 hamb+0.004
 burger+0.004
asts+0.004
Neg logit contributions
 how-0.002
arl-0.002
imi-0.002
icked-0.002
icks-0.002
Token.
Token ID13
Feature activation+2.10e-01

Pos logit contributions
 distance+0.085
 restaurants+0.082
 hamb+0.081
 burger+0.077
aster+0.074
Neg logit contributions
arl-0.054
imi-0.052
 how-0.048
ig-0.046
icked-0.045
Token He
Token ID679
Feature activation+5.13e-01

Pos logit contributions
 distance+0.016
 hamb+0.015
 restaurants+0.015
 burger+0.014
asts+0.014
Neg logit contributions
arl-0.015
imi-0.013
ixie-0.012
 how-0.012
icked-0.012
Token ate
Token ID15063
Feature activation+1.69e+00

Pos logit contributions
 distance+0.034
 restaurants+0.032
 hamb+0.032
 burger+0.030
aster+0.028
Neg logit contributions
arl-0.039
imi-0.037
ig-0.034
ixie-0.034
icked-0.034
Token his
Token ID465
Feature activation+1.22e+00

Pos logit contributions
 distance+0.149
 restaurants+0.141
handed+0.136
asts+0.133
Ice+0.132
Neg logit contributions
 how-0.085
arl-0.083
imi-0.074
icked-0.072
icks-0.069
Token burger
Token ID26593
Feature activation+1.09e+01

Pos logit contributions
 distance+0.073
 restaurants+0.064
handed+0.061
line+0.061
Ice+0.058
Neg logit contributions
arl-0.092
imi-0.090
 how-0.086
 eyes-0.086
igg-0.085
Token and
Token ID290
Feature activation+3.47e-01

Pos logit contributions
 restaurants+1.027
 distance+1.024
 hamb+0.964
handed+0.934
Ice+0.928
Neg logit contributions
 how-0.425
arl-0.413
imi-0.401
icked-0.382
ig-0.362
Token fries
Token ID31757
Feature activation+1.00e+01

Pos logit contributions
 distance+0.025
handed+0.024
 restaurants+0.023
Ice+0.022
asts+0.022
Neg logit contributions
 how-0.022
imi-0.021
icked-0.020
 knows-0.020
 eyes-0.019
Token and
Token ID290
Feature activation+3.51e-01

Pos logit contributions
 distance+0.975
 restaurants+0.938
handed+0.867
 hamb+0.863
Ice+0.853
Neg logit contributions
 how-0.413
icked-0.411
arl-0.383
icks-0.351
imi-0.341
Token k
Token ID479
Feature activation-6.27e+00

Pos logit contributions
 distance+0.026
handed+0.024
 restaurants+0.023
Ice+0.022
line+0.022
Neg logit contributions
 how-0.023
icked-0.021
imi-0.021
 knows-0.020
 eyes-0.020
Tokenetchup
Token ID47132
Feature activation+1.85e+00

Pos logit contributions
arl+0.438
imi+0.411
icked+0.386
ig+0.385
anda+0.383
Neg logit contributions
 distance-0.458
 restaurants-0.435
 hamb-0.418
 burger-0.389
aster-0.377
Token "
Token ID366
Feature activation-8.97e-01

Pos logit contributions
arl+0.031
imi+0.028
icked+0.024
 how+0.024
ig+0.024
Neg logit contributions
 distance-0.054
 restaurants-0.052
 hamb-0.051
 burger-0.049
handed-0.047
TokenDid
Token ID11633
Feature activation+4.97e+00

Pos logit contributions
arl+0.058
imi+0.058
 how+0.055
 tweet+0.052
ra+0.050
Neg logit contributions
 restaurants-0.065
 distance-0.062
 hamb-0.061
 burger-0.059
asts-0.057
Token you
Token ID345
Feature activation-1.26e+00

Pos logit contributions
 restaurants+0.380
 hamb+0.361
 burger+0.349
 distance+0.341
aster+0.337
Neg logit contributions
arl-0.351
uttered-0.294
igg-0.286
imi-0.267
ixie-0.266
Token like
Token ID588
Feature activation+2.38e+00

Pos logit contributions
arl+0.070
 how+0.067
imi+0.064
 tweet+0.063
icks+0.062
Neg logit contributions
 distance-0.102
 restaurants-0.095
 hamb-0.091
dogs-0.090
handed-0.089
Token your
Token ID534
Feature activation-4.11e-01

Pos logit contributions
 distance+0.206
 restaurants+0.190
 hamb+0.187
handed+0.179
 burger+0.176
Neg logit contributions
arl-0.128
imi-0.125
 how-0.117
 tweet-0.109
icked-0.107
Token burger
Token ID26593
Feature activation+1.09e+01

Pos logit contributions
arl+0.031
imi+0.030
 how+0.029
anda+0.028
 eyes+0.028
Neg logit contributions
 distance-0.025
 restaurants-0.022
aster-0.021
handed-0.021
Ice-0.021
Token,
Token ID11
Feature activation-5.72e-01

Pos logit contributions
 distance+0.958
 restaurants+0.896
 hamb+0.894
 burger+0.840
handed+0.835
Neg logit contributions
arl-0.569
imi-0.552
 how-0.493
ig-0.478
icked-0.477
Token sweet
Token ID6029
Feature activation-2.08e+00

Pos logit contributions
arl+0.035
imi+0.032
ig+0.030
icked+0.029
uttered+0.029
Neg logit contributions
 distance-0.046
 hamb-0.044
 restaurants-0.044
 burger-0.042
aster-0.040
Tokenie
Token ID494
Feature activation-2.24e+00

Pos logit contributions
arl+0.094
imi+0.089
ig+0.074
anda+0.073
ixie+0.073
Neg logit contributions
 distance-0.203
 restaurants-0.192
 hamb-0.191
 burger-0.183
handed-0.178
Token?"
Token ID1701
Feature activation+4.06e+00

Pos logit contributions
arl+0.134
imi+0.130
ig+0.116
anda+0.115
 how+0.113
Neg logit contributions
 distance-0.183
 restaurants-0.173
 hamb-0.172
 burger-0.162
handed-0.156
Token Lily
Token ID20037
Feature activation-1.12e+00

Pos logit contributions
 distance+0.357
 restaurants+0.342
 hamb+0.338
 burger+0.323
handed+0.307
Neg logit contributions
arl-0.221
imi-0.210
ixie-0.188
 how-0.186
icked-0.186
Token they
Token ID484
Feature activation+6.21e-01

Pos logit contributions
imi+0.219
arl+0.213
 how+0.206
icked+0.198
anda+0.191
Neg logit contributions
 distance-0.332
 restaurants-0.301
 hamb-0.295
handed-0.291
 burger-0.278
Token saw
Token ID2497
Feature activation+3.73e+00

Pos logit contributions
 distance+0.056
 restaurants+0.053
 hamb+0.052
 burger+0.051
handed+0.049
Neg logit contributions
arl-0.030
imi-0.029
icked-0.026
 knows-0.026
 how-0.025
Token.
Token ID13
Feature activation+2.88e-01

Pos logit contributions
 distance+0.314
 hamb+0.305
 restaurants+0.299
 burger+0.287
aster+0.274
Neg logit contributions
arl-0.204
imi-0.187
 how-0.185
icked-0.183
ig-0.171
Token Tim
Token ID5045
Feature activation+3.54e-01

Pos logit contributions
 distance+0.028
 restaurants+0.026
 hamb+0.026
 burger+0.025
handed+0.024
Neg logit contributions
arl-0.014
imi-0.012
icked-0.011
uttered-0.011
 how-0.011
Token's
Token ID338
Feature activation+3.49e+00

Pos logit contributions
 distance+0.027
 restaurants+0.026
 hamb+0.026
 burger+0.024
aster+0.023
Neg logit contributions
arl-0.023
imi-0.022
ig-0.020
ixie-0.020
icked-0.020
Token burger
Token ID26593
Feature activation+1.09e+01

Pos logit contributions
 distance+0.294
 restaurants+0.276
 hamb+0.270
handed+0.262
Ice+0.258
Neg logit contributions
arl-0.192
imi-0.181
 how-0.165
 eyes-0.165
 wand-0.162
Token was
Token ID373
Feature activation+3.42e-02

Pos logit contributions
 distance+0.905
 restaurants+0.869
 hamb+0.854
 burger+0.807
handed+0.789
Neg logit contributions
arl-0.618
imi-0.590
icked-0.536
ig-0.523
anda-0.516
Token burnt
Token ID22999
Feature activation+4.33e+00

Pos logit contributions
 distance+0.003
 restaurants+0.003
handed+0.002
 hamb+0.002
Ice+0.002
Neg logit contributions
arl-0.002
imi-0.002
icked-0.002
 how-0.002
icks-0.002
Token and
Token ID290
Feature activation+4.34e-01

Pos logit contributions
 distance+0.375
 restaurants+0.357
 hamb+0.352
 burger+0.334
handed+0.331
Neg logit contributions
arl-0.241
imi-0.224
 how-0.207
icked-0.201
ig-0.197
Token dry
Token ID5894
Feature activation+1.54e+00

Pos logit contributions
 distance+0.029
 hamb+0.028
 restaurants+0.028
 burger+0.027
aster+0.025
Neg logit contributions
arl-0.033
imi-0.031
ig-0.029
ixie-0.029
anda-0.028
Token.
Token ID13
Feature activation+2.98e-01

Pos logit contributions
 distance+0.132
 restaurants+0.128
 hamb+0.126
aster+0.119
 burger+0.118
Neg logit contributions
arl-0.083
imi-0.074
 how-0.072
icked-0.071
ig-0.066
Token disappointment
Token ID18641
Feature activation-2.02e+00

Pos logit contributions
arl+0.046
imi+0.043
ig+0.039
ixie+0.038
icked+0.038
Neg logit contributions
 distance-0.054
 restaurants-0.052
 hamb-0.052
 burger-0.049
handed-0.046
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
arl+0.134
imi+0.126
ig+0.115
icked+0.114
ixie+0.114
Neg logit contributions
 distance-0.152
 restaurants-0.146
 hamb-0.145
 burger-0.137
aster-0.129
Token
Token ID198
Feature activation+2.34e-02

Pos logit contributions
 distance+0.017
 restaurants+0.016
 hamb+0.016
 burger+0.015
aster+0.014
Neg logit contributions
arl-0.019
imi-0.018
ig-0.017
ixie-0.017
icked-0.017
Token
Token ID198
Feature activation+2.00e-02

Pos logit contributions
 distance+0.002
 restaurants+0.001
 hamb+0.001
 burger+0.001
aster+0.001
Neg logit contributions
arl-0.002
imi-0.002
ig-0.002
icked-0.002
anda-0.002
TokenM
Token ID44
Feature activation-5.30e-01

Pos logit contributions
 distance+0.001
 restaurants+0.001
 hamb+0.001
 burger+0.001
aster+0.001
Neg logit contributions
arl-0.002
imi-0.001
icked-0.001
ig-0.001
uttered-0.001
Tokenarl
Token ID7063
Feature activation-1.26e+01

Pos logit contributions
arl+0.031
 tweet+0.028
imi+0.027
 how+0.026
icked+0.026
Neg logit contributions
 hamb-0.041
 distance-0.041
 restaurants-0.039
asts-0.039
aster-0.039
Tokenon
Token ID261
Feature activation+2.29e+00

Pos logit contributions
arl+0.764
imi+0.750
 tweet+0.669
anda+0.660
 how+0.646
Neg logit contributions
 distance-1.006
 restaurants-0.932
 hamb-0.931
 burger-0.912
handed-0.869
Token's
Token ID338
Feature activation+3.51e+00

Pos logit contributions
 distance+0.150
 hamb+0.145
 burger+0.134
aster+0.129
 restaurants+0.128
Neg logit contributions
arl-0.188
imi-0.155
icks-0.143
igg-0.140
ixie-0.140
Token mum
Token ID25682
Feature activation+4.02e-01

Pos logit contributions
 distance+0.274
 hamb+0.269
 burger+0.261
 restaurants+0.244
 french+0.243
Neg logit contributions
arl-0.237
imi-0.189
 tweet-0.183
icked-0.175
icks-0.175
Token hugged
Token ID41130
Feature activation-1.72e+00

Pos logit contributions
 distance+0.027
 hamb+0.026
 restaurants+0.025
 burger+0.024
aster+0.023
Neg logit contributions
arl-0.031
imi-0.028
ig-0.025
igg-0.025
uttered-0.025
Token him
Token ID683
Feature activation+1.72e+00

Pos logit contributions
arl+0.122
imi+0.115
anda+0.107
ig+0.107
ixie+0.107
Neg logit contributions
 distance-0.124
 hamb-0.116
 restaurants-0.113
 burger-0.109
aster-0.103
Token's
Token ID338
Feature activation+3.57e+00

Pos logit contributions
arl+0.173
ir+0.142
uttered+0.140
igg+0.135
ubby+0.134
Neg logit contributions
 restaurants-0.066
 weighed-0.063
 hamb-0.060
 burger-0.050
 distance-0.048
Token okay
Token ID8788
Feature activation+3.33e+00

Pos logit contributions
 distance+0.256
 restaurants+0.244
 hamb+0.243
 burger+0.229
aster+0.216
Neg logit contributions
arl-0.251
imi-0.236
uttered-0.223
ixie-0.221
ig-0.221
Token,
Token ID11
Feature activation-3.87e-01

Pos logit contributions
 restaurants+0.220
 hamb+0.219
 distance+0.219
 burger+0.208
aster+0.192
Neg logit contributions
arl-0.245
uttered-0.222
igg-0.217
ixie-0.216
icked-0.211
Token my
Token ID616
Feature activation-3.48e+00

Pos logit contributions
arl+0.026
uttered+0.025
imi+0.024
icked+0.024
ixie+0.023
Neg logit contributions
 distance-0.028
 restaurants-0.028
 hamb-0.028
 burger-0.027
aster-0.024
Token d
Token ID288
Feature activation-3.32e+00

Pos logit contributions
arl+0.195
imi+0.184
icked+0.178
uttered+0.171
ixie+0.166
Neg logit contributions
 distance-0.286
 hamb-0.285
 restaurants-0.278
 burger-0.273
aster-0.253
Tokenarl
Token ID7063
Feature activation-1.25e+01

Pos logit contributions
 how+0.038
 tweet+0.035
 knows+0.033
imi+0.032
ixie+0.021
Neg logit contributions
 distance-0.382
aster-0.381
asts-0.380
 hamb-0.371
 restaurants-0.368
Tokenings
Token ID654
Feature activation-4.11e-01

Pos logit contributions
imi+0.787
 tweet+0.738
 how+0.736
ur+0.716
ixie+0.713
Neg logit contributions
 distance-0.930
 restaurants-0.856
 burger-0.850
 hamb-0.814
asts-0.780
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
arl+0.029
imi+0.028
ig+0.026
ixie+0.026
uttered+0.026
Neg logit contributions
 distance-0.029
 restaurants-0.027
 hamb-0.027
 burger-0.025
aster-0.024
Token I
Token ID314
Feature activation-2.70e-01

Pos logit contributions
 distance+0.027
 restaurants+0.025
 hamb+0.025
 burger+0.024
handed+0.023
Neg logit contributions
arl-0.018
imi-0.016
icked-0.014
ig-0.014
ixie-0.014
Token forgive
Token ID20927
Feature activation-3.85e-01

Pos logit contributions
arl+0.014
imi+0.013
ig+0.011
icked+0.011
ixie+0.011
Neg logit contributions
 distance-0.024
 hamb-0.024
 restaurants-0.023
 burger-0.022
handed-0.021
Token you
Token ID345
Feature activation-1.29e+00

Pos logit contributions
arl+0.018
imi+0.016
uttered+0.015
ixie+0.015
ig+0.015
Neg logit contributions
 distance-0.036
 restaurants-0.036
 hamb-0.035
 burger-0.034
aster-0.032
Token to
Token ID284
Feature activation-4.16e-01

Pos logit contributions
 distance+0.092
 hamb+0.088
 restaurants+0.088
 burger+0.083
aster+0.079
Neg logit contributions
arl-0.069
imi-0.065
ig-0.058
ixie-0.058
icked-0.057
Token grow
Token ID1663
Feature activation-6.27e+00

Pos logit contributions
arl+0.029
imi+0.027
ig+0.025
 tweet+0.025
ixie+0.025
Neg logit contributions
 distance-0.030
 restaurants-0.028
 hamb-0.028
 burger-0.027
handed-0.025
Tokenl
Token ID75
Feature activation-2.48e+00

Pos logit contributions
arl+0.393
imi+0.368
ig+0.334
icked+0.333
ixie+0.331
Neg logit contributions
 distance-0.493
 restaurants-0.474
 hamb-0.472
 burger-0.449
aster-0.429
Token and
Token ID290
Feature activation+1.77e-01

Pos logit contributions
arl+0.164
imi+0.154
ig+0.140
icked+0.139
ixie+0.138
Neg logit contributions
 distance-0.189
 restaurants-0.180
 hamb-0.179
 burger-0.170
aster-0.160
Token sn
Token ID3013
Feature activation-5.33e-02

Pos logit contributions
 distance+0.013
 restaurants+0.012
 hamb+0.012
 burger+0.011
aster+0.011
Neg logit contributions
arl-0.013
imi-0.012
ig-0.011
ixie-0.011
icked-0.011
Tokenarl
Token ID7063
Feature activation-1.25e+01

Pos logit contributions
imi+0.003
arl+0.003
 eight+0.003
 tweet+0.003
 bud+0.003
Neg logit contributions
 distance-0.004
 hamb-0.004
handed-0.004
 burger-0.004
 restaurants-0.004
Token to
Token ID284
Feature activation-3.87e-01

Pos logit contributions
arl+0.705
imi+0.701
 tweet+0.633
ig+0.619
anda+0.611
Neg logit contributions
 distance-1.022
 restaurants-0.964
 hamb-0.958
 burger-0.927
handed-0.878
Token try
Token ID1949
Feature activation-2.40e+00

Pos logit contributions
arl+0.025
imi+0.023
ig+0.021
ixie+0.021
icked+0.021
Neg logit contributions
 distance-0.031
 restaurants-0.029
 hamb-0.029
 burger-0.027
aster-0.026
Token and
Token ID290
Feature activation+1.99e-01

Pos logit contributions
arl+0.136
imi+0.128
 how+0.114
ig+0.114
 tweet+0.113
Neg logit contributions
 distance-0.202
 restaurants-0.193
 hamb-0.192
 burger-0.183
handed-0.176
Token scare
Token ID19437
Feature activation-1.40e+00

Pos logit contributions
 distance+0.015
 restaurants+0.014
 hamb+0.014
 burger+0.014
dogs+0.013
Neg logit contributions
arl-0.013
imi-0.012
 tweet-0.011
icked-0.011
ig-0.011
Token her
Token ID607
Feature activation-1.58e+00

Pos logit contributions
arl+0.092
imi+0.087
ig+0.078
ixie+0.078
anda+0.077
Neg logit contributions
 distance-0.111
 restaurants-0.103
 hamb-0.102
 burger-0.096
aster-0.093
Token you
Token ID345
Feature activation-1.29e+00

Pos logit contributions
arl+0.072
imi+0.061
ig+0.059
uttered+0.059
icked+0.059
Neg logit contributions
 distance-0.129
 restaurants-0.128
 hamb-0.128
 burger-0.123
dogs-0.116
Token too
Token ID1165
Feature activation-2.88e+00

Pos logit contributions
arl+0.068
imi+0.063
ig+0.056
ixie+0.056
uttered+0.055
Neg logit contributions
 distance-0.115
 hamb-0.110
 restaurants-0.110
 burger-0.106
aster-0.100
Token,
Token ID11
Feature activation-5.17e-01

Pos logit contributions
arl+0.157
imi+0.147
ig+0.133
ixie+0.130
uttered+0.130
Neg logit contributions
 distance-0.251
 hamb-0.241
 restaurants-0.241
 burger-0.229
handed-0.215
Token my
Token ID616
Feature activation-3.38e+00

Pos logit contributions
arl+0.035
uttered+0.035
icked+0.033
ig+0.032
igg+0.032
Neg logit contributions
 hamb-0.035
 distance-0.035
 restaurants-0.035
 burger-0.034
 burgers-0.032
Token d
Token ID288
Feature activation-3.32e+00

Pos logit contributions
arl+0.206
imi+0.193
ig+0.174
icked+0.173
ixie+0.172
Neg logit contributions
 distance-0.274
 hamb-0.263
 restaurants-0.262
 burger-0.251
handed-0.235
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.042
 knows+0.041
imi+0.040
 tweet+0.037
ig+0.028
Neg logit contributions
 distance-0.376
 hamb-0.369
asts-0.369
aster-0.367
handed-0.365
Tokenings
Token ID654
Feature activation-4.07e-01

Pos logit contributions
imi+0.765
 tweet+0.709
 how+0.707
ixie+0.692
ur+0.690
Neg logit contributions
 distance-0.946
 restaurants-0.880
 burger-0.869
 hamb-0.837
asts-0.779
Token,"
Token ID553
Feature activation+2.98e+00

Pos logit contributions
arl+0.028
imi+0.026
ig+0.024
ixie+0.024
icked+0.024
Neg logit contributions
 distance-0.030
 restaurants-0.028
 hamb-0.028
 burger-0.027
aster-0.025
Token she
Token ID673
Feature activation-4.19e-02

Pos logit contributions
 distance+0.233
 restaurants+0.223
 hamb+0.223
 burger+0.212
aster+0.199
Neg logit contributions
arl-0.192
imi-0.179
ig-0.163
ixie-0.161
icked-0.161
Token said
Token ID531
Feature activation+3.99e-01

Pos logit contributions
arl+0.003
imi+0.003
ig+0.003
uttered+0.003
anda+0.003
Neg logit contributions
 distance-0.003
 hamb-0.003
 restaurants-0.003
 burger-0.003
aster-0.002
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
 distance+0.031
 restaurants+0.030
 hamb+0.029
 burger+0.027
handed+0.026
Neg logit contributions
arl-0.026
imi-0.025
icked-0.023
ig-0.022
ixie-0.022
Token let
Token ID1309
Feature activation-1.12e+00

Pos logit contributions
arl+0.132
imi+0.113
icks+0.100
ig+0.100
igg+0.098
Neg logit contributions
 distance-0.092
 hamb-0.091
 weighed-0.082
 burger-0.081
 restaurants-0.080
Token out
Token ID503
Feature activation+2.68e+00

Pos logit contributions
arl+0.076
imi+0.072
ig+0.065
ixie+0.065
icked+0.065
Neg logit contributions
 distance-0.084
 restaurants-0.080
 hamb-0.080
 burger-0.076
aster-0.071
Token a
Token ID257
Feature activation-1.95e-01

Pos logit contributions
 hamb+0.164
 restaurants+0.159
 distance+0.158
 burger+0.149
aster+0.141
Neg logit contributions
arl-0.224
imi-0.209
ixie-0.194
anda-0.192
ig-0.191
Token big
Token ID1263
Feature activation-9.52e-01

Pos logit contributions
arl+0.012
imi+0.011
icked+0.010
ig+0.010
ixie+0.010
Neg logit contributions
 distance-0.016
 hamb-0.015
 restaurants-0.015
 burger-0.015
aster-0.014
Token sn
Token ID3013
Feature activation+1.28e-02

Pos logit contributions
arl+0.068
imi+0.063
icked+0.060
uttered+0.060
anda+0.058
Neg logit contributions
 distance-0.070
 hamb-0.069
 burger-0.065
 restaurants-0.063
aster-0.060
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 distance+0.001
aster+0.001
handed+0.001
 hamb+0.001
 restaurants+0.001
Neg logit contributions
imi-0.001
arl-0.001
 how-0.001
 tweet-0.001
 eight-0.001
Token and
Token ID290
Feature activation+2.26e-01

Pos logit contributions
imi+0.732
arl+0.713
anda+0.665
 how+0.662
 tweet+0.655
Neg logit contributions
 distance-1.018
 hamb-0.906
 restaurants-0.899
 burger-0.885
asts-0.849
Token the
Token ID262
Feature activation-1.10e-01

Pos logit contributions
 distance+0.017
 restaurants+0.017
 hamb+0.016
 burger+0.016
handed+0.015
Neg logit contributions
arl-0.014
imi-0.013
 how-0.012
 tweet-0.012
icked-0.012
Token mouse
Token ID10211
Feature activation-1.55e+00

Pos logit contributions
arl+0.007
imi+0.006
ig+0.005
icked+0.005
ixie+0.005
Neg logit contributions
 distance-0.009
 hamb-0.009
 restaurants-0.009
 burger-0.008
handed-0.008
Token knew
Token ID2993
Feature activation-2.72e+00

Pos logit contributions
arl+0.128
imi+0.117
ixie+0.100
ig+0.100
igg+0.098
Neg logit contributions
 distance-0.108
 hamb-0.102
 restaurants-0.093
 burger-0.092
 weighed-0.089
Token he
Token ID339
Feature activation+2.28e+00

Pos logit contributions
arl+0.154
imi+0.139
ixie+0.122
ig+0.121
uttered+0.120
Neg logit contributions
 distance-0.237
 hamb-0.231
 restaurants-0.230
 burger-0.219
handed-0.208
Token you
Token ID345
Feature activation-1.28e+00

Pos logit contributions
 restaurants+0.331
 hamb+0.326
 distance+0.324
 burger+0.317
aster+0.296
Neg logit contributions
arl-0.282
uttered-0.231
imi-0.220
igg-0.217
ixie-0.212
Token go
Token ID467
Feature activation-6.06e-01

Pos logit contributions
arl+0.104
imi+0.101
uttered+0.099
ixie+0.098
uff+0.094
Neg logit contributions
 hamb-0.069
 restaurants-0.069
 distance-0.065
aster-0.062
 burger-0.061
Token,
Token ID11
Feature activation-4.54e-01

Pos logit contributions
arl+0.035
imi+0.034
uttered+0.031
icked+0.030
ixie+0.030
Neg logit contributions
 distance-0.052
 restaurants-0.048
 hamb-0.047
 burger-0.045
handed-0.044
Token my
Token ID616
Feature activation-3.41e+00

Pos logit contributions
uttered+0.030
arl+0.029
icked+0.028
ig+0.027
ixie+0.027
Neg logit contributions
 hamb-0.033
 distance-0.032
 restaurants-0.032
 burger-0.032
aster-0.030
Token d
Token ID288
Feature activation-3.30e+00

Pos logit contributions
arl+0.207
imi+0.197
icked+0.191
ig+0.184
uttered+0.183
Neg logit contributions
 distance-0.253
 hamb-0.253
 burger-0.244
 restaurants-0.238
 barbecue-0.224
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.047
 knows+0.045
 tweet+0.040
imi+0.039
 thinks+0.035
Neg logit contributions
aster-0.373
 distance-0.364
asts-0.354
 hamb-0.352
handed-0.351
Tokenings
Token ID654
Feature activation-3.57e-01

Pos logit contributions
imi+0.780
 tweet+0.749
 how+0.745
ur+0.731
anda+0.720
Neg logit contributions
 distance-0.911
 restaurants-0.824
 burger-0.817
 hamb-0.778
thank-0.737
Token,"
Token ID553
Feature activation+2.99e+00

Pos logit contributions
arl+0.023
imi+0.022
ig+0.020
uttered+0.020
ixie+0.020
Neg logit contributions
 distance-0.027
 hamb-0.026
 restaurants-0.026
 burger-0.025
aster-0.024
Token Mom
Token ID11254
Feature activation+3.32e-01

Pos logit contributions
 distance+0.258
 restaurants+0.245
 hamb+0.243
 burger+0.230
handed+0.227
Neg logit contributions
arl-0.168
imi-0.162
icked-0.140
 how-0.139
ixie-0.139
Token said
Token ID531
Feature activation+3.41e-01

Pos logit contributions
 distance+0.028
 restaurants+0.027
 hamb+0.026
 burger+0.024
asts+0.024
Neg logit contributions
arl-0.019
imi-0.019
icked-0.017
ixie-0.017
 how-0.017
Token.
Token ID13
Feature activation+2.00e-01

Pos logit contributions
 distance+0.031
 restaurants+0.030
 hamb+0.029
 burger+0.027
Ice+0.027
Neg logit contributions
arl-0.019
icked-0.017
 how-0.016
imi-0.016
icks-0.015
Token you
Token ID345
Feature activation-1.28e+00

Pos logit contributions
 restaurants+0.221
 hamb+0.197
aster+0.196
 distance+0.194
 burger+0.191
Neg logit contributions
arl-0.323
 wand-0.291
uttered-0.290
igg-0.290
ixie-0.288
Token okay
Token ID8788
Feature activation+3.25e+00

Pos logit contributions
arl+0.066
imi+0.062
icked+0.055
 tweet+0.053
ig+0.052
Neg logit contributions
 distance-0.118
 restaurants-0.112
 hamb-0.107
aster-0.107
asts-0.105
Token,
Token ID11
Feature activation-4.68e-01

Pos logit contributions
 restaurants+0.246
 hamb+0.239
 distance+0.234
 burger+0.229
aster+0.212
Neg logit contributions
arl-0.203
uttered-0.185
igg-0.181
icked-0.180
ixie-0.179
Token my
Token ID616
Feature activation-3.32e+00

Pos logit contributions
arl+0.029
uttered+0.027
imi+0.027
ig+0.027
icked+0.026
Neg logit contributions
 distance-0.035
 hamb-0.035
 restaurants-0.034
 burger-0.034
 burgers-0.031
Token d
Token ID288
Feature activation-3.33e+00

Pos logit contributions
arl+0.162
imi+0.144
icked+0.144
ig+0.140
icks+0.134
Neg logit contributions
 distance-0.295
 hamb-0.295
 burger-0.289
 restaurants-0.279
 burgers-0.263
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.062
 knows+0.053
 tweet+0.048
 teamwork+0.043
 thinks+0.043
Neg logit contributions
 distance-0.365
aster-0.355
handed-0.350
 hamb-0.350
asts-0.349
Tokenings
Token ID654
Feature activation-3.96e-01

Pos logit contributions
imi+0.748
 tweet+0.716
 how+0.701
anda+0.691
ur+0.690
Neg logit contributions
 distance-0.945
 restaurants-0.877
 burger-0.864
 hamb-0.842
thank-0.775
Token?"
Token ID1701
Feature activation+3.98e+00

Pos logit contributions
arl+0.025
imi+0.024
ig+0.022
ixie+0.022
icked+0.022
Neg logit contributions
 distance-0.031
 restaurants-0.029
 hamb-0.029
 burger-0.028
aster-0.026
Token she
Token ID673
Feature activation+8.85e-02

Pos logit contributions
 hamb+0.376
 distance+0.373
 restaurants+0.371
atmeal+0.354
 burger+0.353
Neg logit contributions
arl-0.178
imi-0.176
 how-0.158
ia-0.141
 knows-0.139
Token asked
Token ID1965
Feature activation+2.28e+00

Pos logit contributions
 distance+0.006
 restaurants+0.006
 hamb+0.006
 burger+0.005
handed+0.005
Neg logit contributions
imi-0.006
arl-0.006
icked-0.006
 how-0.006
ixie-0.006
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
 distance+0.195
 restaurants+0.189
 hamb+0.188
 burger+0.178
 french+0.169
Neg logit contributions
arl-0.133
imi-0.123
icked-0.116
 how-0.113
uttered-0.111
Token have
Token ID423
Feature activation+1.62e+00

Pos logit contributions
arl+0.080
imi+0.075
 tweet+0.070
icked+0.068
ig+0.067
Neg logit contributions
 distance-0.090
 restaurants-0.087
 hamb-0.086
 burger-0.082
dogs-0.079
Token fun
Token ID1257
Feature activation-2.28e-01

Pos logit contributions
 distance+0.120
 hamb+0.112
 restaurants+0.111
 burger+0.106
handed+0.102
Neg logit contributions
arl-0.108
imi-0.103
icked-0.094
 how-0.093
 tweet-0.093
Token,
Token ID11
Feature activation-2.96e-01

Pos logit contributions
arl+0.014
imi+0.012
ixie+0.011
icked+0.011
uttered+0.011
Neg logit contributions
 distance-0.019
 restaurants-0.018
 hamb-0.018
 burger-0.017
aster-0.016
Token my
Token ID616
Feature activation-3.40e+00

Pos logit contributions
arl+0.020
imi+0.019
ig+0.017
icked+0.017
uttered+0.017
Neg logit contributions
 distance-0.022
 restaurants-0.021
 hamb-0.021
 burger-0.020
aster-0.019
Token d
Token ID288
Feature activation-3.15e+00

Pos logit contributions
arl+0.236
imi+0.221
icked+0.202
ig+0.201
ixie+0.201
Neg logit contributions
 distance-0.250
 hamb-0.240
 restaurants-0.239
 burger-0.230
handed-0.211
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 tweet+0.067
imi+0.066
 how+0.063
ixie+0.058
ig+0.054
Neg logit contributions
asts-0.334
 restaurants-0.322
 distance-0.321
handed-0.321
aster-0.319
Tokenings
Token ID654
Feature activation-2.63e-01

Pos logit contributions
imi+0.753
ixie+0.700
 tweet+0.693
anda+0.685
ur+0.683
Neg logit contributions
 distance-0.950
 restaurants-0.914
 burger-0.878
 hamb-0.851
asts-0.822
Token?"
Token ID1701
Feature activation+4.22e+00

Pos logit contributions
arl+0.016
imi+0.015
icked+0.013
ixie+0.013
anda+0.013
Neg logit contributions
 distance-0.022
 restaurants-0.021
 hamb-0.021
 burger-0.020
handed-0.019
Token she
Token ID673
Feature activation+1.31e-01

Pos logit contributions
 distance+0.337
 hamb+0.323
 restaurants+0.321
 burger+0.308
handed+0.292
Neg logit contributions
arl-0.261
imi-0.251
ixie-0.220
icked-0.219
anda-0.219
Token asks
Token ID7893
Feature activation-7.47e-01

Pos logit contributions
 distance+0.010
 restaurants+0.010
 hamb+0.009
 burger+0.009
aster+0.008
Neg logit contributions
arl-0.009
imi-0.008
ig-0.007
uttered-0.007
icked-0.007
Token.
Token ID13
Feature activation+3.71e-01

Pos logit contributions
arl+0.046
imi+0.043
icked+0.039
ig+0.039
uttered+0.039
Neg logit contributions
 distance-0.060
 restaurants-0.058
 hamb-0.058
 burger-0.055
handed-0.052
TokenSpot
Token ID32565
Feature activation+1.35e+00

Pos logit contributions
arl+0.004
imi+0.004
ig+0.003
ixie+0.003
anda+0.003
Neg logit contributions
 distance-0.005
 restaurants-0.005
 hamb-0.005
 burger-0.004
asts-0.004
Token grow
Token ID1663
Feature activation-6.18e+00

Pos logit contributions
 hamb+0.113
 distance+0.112
 burger+0.111
 restaurants+0.111
 cafe+0.102
Neg logit contributions
arl-0.086
imi-0.074
uttered-0.065
ixie-0.062
ig-0.061
Tokenls
Token ID7278
Feature activation-2.12e+00

Pos logit contributions
arl+0.363
imi+0.336
ig+0.307
 tweet+0.305
ra+0.305
Neg logit contributions
 distance-0.503
 restaurants-0.489
 hamb-0.463
 burgers-0.451
 burger-0.451
Token and
Token ID290
Feature activation+4.18e-01

Pos logit contributions
arl+0.151
imi+0.143
ixie+0.129
anda+0.127
ig+0.127
Neg logit contributions
 distance-0.156
 hamb-0.144
 restaurants-0.143
 burger-0.137
handed-0.127
Token sn
Token ID3013
Feature activation-1.17e-02

Pos logit contributions
 distance+0.033
 hamb+0.033
 restaurants+0.032
 burger+0.032
 burgers+0.029
Neg logit contributions
arl-0.028
imi-0.026
uttered-0.024
ixie-0.023
ig-0.023
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
imi+0.001
 tweet+0.001
arl+0.001
 eight+0.001
 bud+0.001
Neg logit contributions
 distance-0.001
 restaurants-0.001
 burger-0.001
 hamb-0.001
handed-0.001
Tokens
Token ID82
Feature activation-2.97e-01

Pos logit contributions
imi+0.766
 tweet+0.737
ixie+0.697
arl+0.697
anda+0.691
Neg logit contributions
 distance-0.918
 restaurants-0.860
 burger-0.846
 hamb-0.827
asts-0.781
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
imi+0.021
arl+0.021
ixie+0.018
ig+0.018
igg+0.018
Neg logit contributions
 distance-0.023
 hamb-0.020
 restaurants-0.019
 burger-0.019
handed-0.018
Token He
Token ID679
Feature activation+5.89e-01

Pos logit contributions
 distance+0.015
 restaurants+0.014
 hamb+0.014
 burger+0.013
handed+0.012
Neg logit contributions
arl-0.018
imi-0.017
icked-0.015
ixie-0.015
uttered-0.015
Token tries
Token ID8404
Feature activation-5.73e+00

Pos logit contributions
 hamb+0.047
 burger+0.046
 distance+0.045
 restaurants+0.045
ns+0.043
Neg logit contributions
arl-0.040
imi-0.034
uttered-0.031
chie-0.030
ixie-0.029
Token to
Token ID284
Feature activation-2.72e-01

Pos logit contributions
arl+0.378
imi+0.357
ig+0.323
ixie+0.318
icked+0.317
Neg logit contributions
 distance-0.453
 hamb-0.424
 restaurants-0.423
 burger-0.405
aster-0.377
Token you
Token ID345
Feature activation-1.34e+00

Pos logit contributions
 restaurants+0.221
 hamb+0.205
aster+0.200
 burger+0.195
 burgers+0.191
Neg logit contributions
arl-0.325
uttered-0.298
ixie-0.289
igg-0.281
 wand-0.280
Token okay
Token ID8788
Feature activation+3.21e+00

Pos logit contributions
arl+0.081
imi+0.075
icked+0.068
ig+0.067
 tweet+0.066
Neg logit contributions
 distance-0.112
 restaurants-0.106
 hamb-0.104
 burger-0.100
aster-0.097
Token,
Token ID11
Feature activation-5.41e-01

Pos logit contributions
 restaurants+0.245
 hamb+0.241
 distance+0.235
 burger+0.230
 barbecue+0.213
Neg logit contributions
arl-0.199
uttered-0.179
ixie-0.175
icked-0.175
igg-0.174
Token my
Token ID616
Feature activation-3.55e+00

Pos logit contributions
uttered+0.036
arl+0.035
ig+0.033
icked+0.033
ixie+0.033
Neg logit contributions
 hamb-0.039
 burger-0.038
 restaurants-0.037
 distance-0.036
 burgers-0.036
Token d
Token ID288
Feature activation-3.35e+00

Pos logit contributions
arl+0.211
imi+0.186
icked+0.179
ixie+0.174
uttered+0.171
Neg logit contributions
 distance-0.286
 hamb-0.286
 burger-0.275
 restaurants-0.266
aster-0.249
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.077
 knows+0.072
 thinks+0.067
 does+0.055
 teamwork+0.053
Neg logit contributions
aster-0.349
asts-0.344
 distance-0.338
Ice-0.326
 hamb-0.325
Tokenings
Token ID654
Feature activation-4.22e-01

Pos logit contributions
imi+0.765
 how+0.724
 tweet+0.715
ur+0.703
anda+0.699
Neg logit contributions
 distance-0.931
 restaurants-0.850
 burger-0.849
 hamb-0.829
asts-0.776
Token?"
Token ID1701
Feature activation+3.92e+00

Pos logit contributions
arl+0.027
imi+0.026
ig+0.023
ixie+0.023
icked+0.023
Neg logit contributions
 distance-0.033
 restaurants-0.031
 hamb-0.031
 burger-0.029
aster-0.028
Token Mom
Token ID11254
Feature activation+2.90e-01

Pos logit contributions
 distance+0.357
 restaurants+0.347
 hamb+0.344
 burger+0.328
handed+0.326
Neg logit contributions
arl-0.200
imi-0.195
 how-0.162
ixie-0.158
icked-0.158
Token asked
Token ID1965
Feature activation+2.25e+00

Pos logit contributions
 distance+0.020
 restaurants+0.020
 hamb+0.019
 burger+0.018
handed+0.018
Neg logit contributions
arl-0.020
imi-0.020
icked-0.018
ixie-0.018
anda-0.018
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
 distance+0.180
 restaurants+0.172
 hamb+0.171
 burger+0.162
handed+0.152
Neg logit contributions
arl-0.143
imi-0.133
icked-0.122
ig-0.121
ixie-0.119
Token have
Token ID423
Feature activation+1.46e+00

Pos logit contributions
arl+0.083
imi+0.076
 tweet+0.070
ig+0.070
icked+0.069
Neg logit contributions
 distance-0.098
 restaurants-0.096
 hamb-0.095
 burger-0.091
handed-0.086
Token fun
Token ID1257
Feature activation-3.35e-01

Pos logit contributions
 distance+0.105
 restaurants+0.099
 hamb+0.099
 burger+0.093
handed+0.088
Neg logit contributions
arl-0.103
imi-0.098
ig-0.089
icked-0.089
ixie-0.089
Token,
Token ID11
Feature activation-4.79e-01

Pos logit contributions
arl+0.021
imi+0.020
ixie+0.018
uttered+0.018
icked+0.018
Neg logit contributions
 distance-0.026
 restaurants-0.026
 hamb-0.025
 burger-0.024
aster-0.023
Token my
Token ID616
Feature activation-3.47e+00

Pos logit contributions
arl+0.030
imi+0.028
uttered+0.026
icked+0.026
ig+0.026
Neg logit contributions
 distance-0.037
 hamb-0.036
 restaurants-0.036
 burger-0.035
aster-0.032
Token d
Token ID288
Feature activation-3.27e+00

Pos logit contributions
arl+0.233
icked+0.212
imi+0.212
icks+0.198
uttered+0.195
Neg logit contributions
 distance-0.260
 hamb-0.256
 burger-0.241
 restaurants-0.240
 burgers-0.220
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 knows+0.072
 how+0.070
 thinks+0.064
 does+0.061
 teamwork+0.057
Neg logit contributions
asts-0.344
aster-0.334
 distance-0.333
line-0.331
Ice-0.327
Tokenings
Token ID654
Feature activation-2.64e-01

Pos logit contributions
imi+0.765
 tweet+0.705
 how+0.700
ixie+0.695
arl+0.694
Neg logit contributions
 distance-0.948
 restaurants-0.875
 burger-0.852
 hamb-0.834
asts-0.796
Token?"
Token ID1701
Feature activation+4.00e+00

Pos logit contributions
arl+0.016
imi+0.015
icked+0.013
 how+0.013
anda+0.012
Neg logit contributions
 distance-0.022
 restaurants-0.021
 hamb-0.021
 burger-0.020
handed-0.019
Token Their
Token ID5334
Feature activation+6.93e-01

Pos logit contributions
 distance+0.403
 restaurants+0.390
 hamb+0.389
handed+0.382
atmeal+0.377
Neg logit contributions
imi-0.154
arl-0.150
 how-0.130
 eight-0.121
 tweet-0.115
Token mother
Token ID2802
Feature activation-1.59e+00

Pos logit contributions
 distance+0.044
 hamb+0.042
 restaurants+0.041
 burger+0.039
 couple+0.037
Neg logit contributions
arl-0.053
icked-0.052
icks-0.049
igg-0.049
ig-0.048
Token asked
Token ID1965
Feature activation+2.18e+00

Pos logit contributions
arl+0.111
imi+0.106
ig+0.096
ixie+0.096
icked+0.096
Neg logit contributions
 distance-0.115
 restaurants-0.110
 hamb-0.109
 burger-0.103
handed-0.097
Token.
Token ID13
Feature activation+3.85e-01

Pos logit contributions
arl+0.145
imi+0.137
ig+0.125
ixie+0.124
icked+0.123
Neg logit contributions
 distance-0.152
 restaurants-0.145
 hamb-0.144
 burger-0.137
aster-0.128
Token
Token ID198
Feature activation-3.44e-02

Pos logit contributions
 distance+0.026
 restaurants+0.025
 hamb+0.025
 burger+0.023
handed+0.022
Neg logit contributions
arl-0.029
imi-0.027
icked-0.025
ixie-0.025
ig-0.025
Token
Token ID198
Feature activation-2.49e-02

Pos logit contributions
arl+0.003
imi+0.003
ig+0.002
icked+0.002
ixie+0.002
Neg logit contributions
 distance-0.002
 restaurants-0.002
 hamb-0.002
 burger-0.002
aster-0.002
TokenRex
Token ID47389
Feature activation-6.93e-01

Pos logit contributions
arl+0.002
imi+0.002
uttered+0.002
icked+0.002
ig+0.002
Neg logit contributions
 distance-0.002
 restaurants-0.002
 hamb-0.002
 burger-0.002
aster-0.002
Token sn
Token ID3013
Feature activation-3.35e-03

Pos logit contributions
arl+0.043
imi+0.038
ixie+0.033
uttered+0.033
chie+0.032
Neg logit contributions
 hamb-0.058
 burger-0.057
 distance-0.056
 restaurants-0.056
 cafe-0.053
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
imi+0.000
 tweet+0.000
 wand+0.000
arl+0.000
 eight+0.000
Neg logit contributions
 distance-0.000
handed-0.000
 burger-0.000
 restaurants-0.000
 hamb-0.000
Tokens
Token ID82
Feature activation-3.14e-01

Pos logit contributions
imi+0.790
 tweet+0.770
 how+0.746
ur+0.735
ixie+0.727
Neg logit contributions
 distance-0.860
 restaurants-0.800
 burger-0.787
 hamb-0.745
asts-0.720
Token.
Token ID13
Feature activation+3.38e-01

Pos logit contributions
imi+0.022
arl+0.021
ixie+0.020
ig+0.019
igg+0.019
Neg logit contributions
 distance-0.025
 hamb-0.021
 restaurants-0.021
 burger-0.020
handed-0.019
Token He
Token ID679
Feature activation+6.62e-01

Pos logit contributions
 distance+0.021
 restaurants+0.020
 hamb+0.020
 burger+0.019
handed+0.018
Neg logit contributions
arl-0.027
imi-0.026
icked-0.023
ixie-0.023
 tweet-0.023
Token does
Token ID857
Feature activation-3.96e+00

Pos logit contributions
 hamb+0.050
 burger+0.049
 distance+0.048
 restaurants+0.048
 weighed+0.045
Neg logit contributions
arl-0.047
imi-0.042
uttered-0.039
chie-0.038
ig-0.038
Token not
Token ID407
Feature activation-1.55e+00

Pos logit contributions
arl+0.219
imi+0.212
ig+0.191
ixie+0.188
anda+0.183
Neg logit contributions
 distance-0.344
 hamb-0.328
 restaurants-0.328
 burger-0.317
aster-0.307
Token's
Token ID338
Feature activation+4.19e+00

Pos logit contributions
arl+0.184
imi+0.178
ig+0.156
ixie+0.152
anda+0.152
Neg logit contributions
 distance-0.332
 restaurants-0.318
 hamb-0.313
 burger-0.298
handed-0.290
Token wonderful
Token ID7932
Feature activation-1.18e+00

Pos logit contributions
 hamb+0.314
 distance+0.303
 restaurants+0.295
 burger+0.292
 burgers+0.275
Neg logit contributions
arl-0.270
uttered-0.251
imi-0.251
ig-0.247
ixie-0.246
Token,
Token ID11
Feature activation-4.36e-01

Pos logit contributions
arl+0.078
imi+0.070
uttered+0.067
icked+0.066
ig+0.065
Neg logit contributions
 hamb-0.090
 distance-0.088
 restaurants-0.088
 burger-0.086
 french-0.080
Token my
Token ID616
Feature activation-3.37e+00

Pos logit contributions
arl+0.028
imi+0.026
ig+0.024
icked+0.023
ixie+0.023
Neg logit contributions
 distance-0.034
 hamb-0.033
 restaurants-0.033
 burger-0.031
aster-0.029
Token d
Token ID288
Feature activation-3.31e+00

Pos logit contributions
arl+0.215
imi+0.198
icked+0.185
uttered+0.180
ixie+0.175
Neg logit contributions
 hamb-0.258
 distance-0.257
 burger-0.250
 restaurants-0.248
 french-0.231
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 tweet+0.118
imi+0.116
 how+0.113
 knows+0.106
 fears+0.098
Neg logit contributions
asts-0.295
 hamb-0.291
aster-0.284
dogs-0.283
 burger-0.282
Tokenings
Token ID654
Feature activation-2.94e-01

Pos logit contributions
imi+0.730
arl+0.691
 tweet+0.650
ixie+0.639
anda+0.628
Neg logit contributions
 distance-0.986
 restaurants-0.931
 burger-0.917
 hamb-0.906
handed-0.859
Token!"
Token ID2474
Feature activation+7.41e-01

Pos logit contributions
arl+0.018
imi+0.018
ig+0.016
uttered+0.016
ixie+0.016
Neg logit contributions
 distance-0.023
 hamb-0.022
 restaurants-0.022
 burger-0.021
handed-0.020
Token Mom
Token ID11254
Feature activation+1.80e-01

Pos logit contributions
 distance+0.060
 restaurants+0.058
 hamb+0.056
 burger+0.053
asts+0.052
Neg logit contributions
arl-0.043
imi-0.042
icked-0.038
 how-0.038
ixie-0.037
Tokenmy
Token ID1820
Feature activation+2.08e-01

Pos logit contributions
 distance+0.014
 hamb+0.014
 restaurants+0.014
 burger+0.013
aster+0.012
Neg logit contributions
arl-0.011
imi-0.010
ig-0.009
uttered-0.009
ixie-0.009
Token says
Token ID1139
Feature activation-2.17e+00

Pos logit contributions
 distance+0.016
 hamb+0.015
 restaurants+0.015
 burger+0.014
aster+0.013
Neg logit contributions
arl-0.014
imi-0.013
ig-0.012
ixie-0.012
icked-0.012
Token "
Token ID366
Feature activation-7.93e-01

Pos logit contributions
arl+0.026
imi+0.025
ig+0.023
ixie+0.023
anda+0.023
Neg logit contributions
 distance-0.032
 hamb-0.031
 restaurants-0.030
 burger-0.030
aster-0.027
TokenHello
Token ID15496
Feature activation+1.88e+00

Pos logit contributions
arl+0.056
imi+0.051
 tweet+0.047
ixie+0.045
icked+0.044
Neg logit contributions
 hamb-0.057
 restaurants-0.056
 distance-0.056
 burger-0.054
asts-0.050
Token,
Token ID11
Feature activation-4.34e-01

Pos logit contributions
 restaurants+0.157
 distance+0.152
 hamb+0.152
 burger+0.147
dogs+0.135
Neg logit contributions
arl-0.110
imi-0.095
uttered-0.093
icked-0.092
igg-0.091
Token my
Token ID616
Feature activation-3.48e+00

Pos logit contributions
uttered+0.029
arl+0.028
icked+0.028
imi+0.028
igg+0.027
Neg logit contributions
 restaurants-0.031
 hamb-0.030
 distance-0.029
 burger-0.029
 vendors-0.026
Token d
Token ID288
Feature activation-3.22e+00

Pos logit contributions
arl+0.158
imi+0.142
icked+0.127
ig+0.123
uttered+0.121
Neg logit contributions
 distance-0.332
 hamb-0.330
 restaurants-0.325
 burger-0.316
dogs-0.298
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
imi+0.044
 how+0.039
 tweet+0.038
 fears+0.029
 knows+0.029
Neg logit contributions
 distance-0.362
asts-0.362
aster-0.360
 hamb-0.360
 restaurants-0.357
Tokenings
Token ID654
Feature activation-2.84e-01

Pos logit contributions
imi+0.738
ur+0.699
 tweet+0.693
anda+0.687
ixie+0.683
Neg logit contributions
 distance-0.937
 restaurants-0.907
 burger-0.876
 hamb-0.862
asts-0.817
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
arl+0.020
imi+0.020
ig+0.018
uttered+0.018
ixie+0.018
Neg logit contributions
 distance-0.020
 restaurants-0.018
 hamb-0.018
 burger-0.017
handed-0.016
Token I
Token ID314
Feature activation-2.41e-01

Pos logit contributions
 distance+0.019
 restaurants+0.018
 hamb+0.018
 burger+0.017
handed+0.016
Neg logit contributions
arl-0.013
imi-0.013
ig-0.011
icked-0.011
ixie-0.011
Token'm
Token ID1101
Feature activation+1.10e+00

Pos logit contributions
arl+0.015
imi+0.014
icked+0.013
ig+0.013
ixie+0.013
Neg logit contributions
 distance-0.019
 restaurants-0.018
 hamb-0.018
 burger-0.017
aster-0.016
Token so
Token ID523
Feature activation+1.16e+00

Pos logit contributions
 distance+0.103
 restaurants+0.098
 hamb+0.098
 burger+0.094
handed+0.090
Neg logit contributions
arl-0.054
imi-0.050
icked-0.043
ixie-0.043
anda-0.042
Token'm
Token ID1101
Feature activation+1.10e+00

Pos logit contributions
 distance+0.152
 restaurants+0.146
 hamb+0.144
 burger+0.138
handed+0.131
Neg logit contributions
arl-0.112
imi-0.105
ixie-0.094
ig-0.094
anda-0.093
Token sorry
Token ID7926
Feature activation-1.16e+00

Pos logit contributions
 distance+0.095
 hamb+0.092
 restaurants+0.091
 burger+0.088
aster+0.084
Neg logit contributions
arl-0.059
imi-0.055
ig-0.050
uttered-0.049
ixie-0.049
Token,
Token ID11
Feature activation-4.97e-01

Pos logit contributions
arl+0.090
uttered+0.090
icked+0.082
igg+0.082
rave+0.080
Neg logit contributions
 hamb-0.064
 restaurants-0.063
 burger-0.061
 burgers-0.059
 distance-0.058
Token my
Token ID616
Feature activation-3.49e+00

Pos logit contributions
uttered+0.033
arl+0.032
icked+0.030
ixie+0.029
igg+0.029
Neg logit contributions
 hamb-0.036
 distance-0.036
 burger-0.035
 restaurants-0.034
 burgers-0.033
Token d
Token ID288
Feature activation-3.31e+00

Pos logit contributions
arl+0.210
imi+0.188
icked+0.176
ixie+0.172
uttered+0.170
Neg logit contributions
 distance-0.283
 hamb-0.282
 burger-0.270
 restaurants-0.263
aster-0.246
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.080
 knows+0.071
 thinks+0.069
 does+0.056
imi+0.055
Neg logit contributions
aster-0.339
asts-0.330
 distance-0.324
Ice-0.315
line-0.313
Tokenings
Token ID654
Feature activation-3.83e-01

Pos logit contributions
imi+0.778
 how+0.746
 tweet+0.717
ur+0.715
anda+0.708
Neg logit contributions
 distance-0.917
 burger-0.826
 restaurants-0.818
 hamb-0.805
thank-0.761
Token.
Token ID13
Feature activation+1.52e-01

Pos logit contributions
arl+0.027
imi+0.026
ig+0.024
uttered+0.024
ixie+0.024
Neg logit contributions
 distance-0.027
 hamb-0.025
 restaurants-0.025
 burger-0.023
handed-0.022
Token That
Token ID1320
Feature activation-3.96e+00

Pos logit contributions
 distance+0.011
 hamb+0.011
 restaurants+0.010
 burger+0.010
aster+0.010
Neg logit contributions
arl-0.010
imi-0.010
ig-0.009
ixie-0.009
anda-0.009
Token was
Token ID373
Feature activation+4.10e-02

Pos logit contributions
imi+0.211
arl+0.197
 tweet+0.184
ig+0.181
anda+0.181
Neg logit contributions
 distance-0.337
 restaurants-0.328
 hamb-0.319
 burger-0.308
asts-0.302
Token a
Token ID257
Feature activation-1.65e-02

Pos logit contributions
 distance+0.003
 hamb+0.003
 burger+0.003
 restaurants+0.003
 burgers+0.003
Neg logit contributions
arl-0.003
imi-0.003
ig-0.003
uttered-0.003
ixie-0.003
Token'm
Token ID1101
Feature activation+1.10e+00

Pos logit contributions
 distance+0.144
 restaurants+0.140
 hamb+0.134
 burger+0.131
asts+0.126
Neg logit contributions
arl-0.104
imi-0.098
 knows-0.089
ixie-0.088
 tweet-0.087
Token sorry
Token ID7926
Feature activation-1.17e+00

Pos logit contributions
 distance+0.102
 restaurants+0.098
 hamb+0.098
 burger+0.094
aster+0.090
Neg logit contributions
arl-0.054
imi-0.050
ig-0.044
ixie-0.044
icked-0.043
Token,
Token ID11
Feature activation-4.52e-01

Pos logit contributions
arl+0.090
uttered+0.089
igg+0.085
icked+0.084
ixie+0.079
Neg logit contributions
 restaurants-0.064
 hamb-0.064
 burgers-0.058
 distance-0.057
 burger-0.057
Token my
Token ID616
Feature activation-3.54e+00

Pos logit contributions
arl+0.034
uttered+0.033
ixie+0.032
imi+0.032
icked+0.032
Neg logit contributions
 hamb-0.028
 distance-0.028
 restaurants-0.026
 burger-0.025
 burgers-0.024
Token d
Token ID288
Feature activation-3.31e+00

Pos logit contributions
arl+0.177
imi+0.157
icked+0.149
uttered+0.144
ig+0.142
Neg logit contributions
 hamb-0.322
 distance-0.319
 restaurants-0.308
 burger-0.305
dogs-0.285
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.048
 knows+0.040
 tweet+0.039
 thinks+0.032
imi+0.030
Neg logit contributions
 distance-0.366
 hamb-0.365
aster-0.364
asts-0.362
dogs-0.357
Tokenings
Token ID654
Feature activation-3.35e-01

Pos logit contributions
imi+0.733
 tweet+0.692
 how+0.688
arl+0.682
ixie+0.671
Neg logit contributions
 distance-0.952
 restaurants-0.881
 hamb-0.864
 burger-0.861
asts-0.805
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
arl+0.025
imi+0.024
uttered+0.024
ig+0.023
ixie+0.023
Neg logit contributions
 distance-0.022
 hamb-0.019
 restaurants-0.019
 burger-0.018
handed-0.017
Token That
Token ID1320
Feature activation-4.10e+00

Pos logit contributions
 distance+0.015
 restaurants+0.014
 hamb+0.014
 burger+0.014
aster+0.013
Neg logit contributions
arl-0.013
imi-0.013
ig-0.012
ixie-0.012
anda-0.012
Token was
Token ID373
Feature activation-6.04e-02

Pos logit contributions
imi+0.235
arl+0.218
 how+0.201
anda+0.200
ig+0.199
Neg logit contributions
 distance-0.336
 restaurants-0.320
 hamb-0.316
 burger-0.304
asts-0.300
Token a
Token ID257
Feature activation-6.07e-02

Pos logit contributions
arl+0.004
imi+0.004
ig+0.004
uttered+0.004
ixie+0.004
Neg logit contributions
 distance-0.004
 hamb-0.004
 restaurants-0.004
 burger-0.004
 burgers-0.004
Token"
Token ID1
Feature activation-3.22e+00

Pos logit contributions
arl+0.011
imi+0.011
ixie+0.009
ig+0.009
icked+0.009
Neg logit contributions
 distance-0.016
 restaurants-0.015
 hamb-0.015
 burger-0.014
handed-0.013
TokenHello
Token ID15496
Feature activation+1.83e+00

Pos logit contributions
arl+0.221
imi+0.194
 tweet+0.186
 how+0.180
 bud+0.175
Neg logit contributions
 hamb-0.232
 restaurants-0.226
 distance-0.225
 burger-0.219
asts-0.206
Token,
Token ID11
Feature activation-4.87e-01

Pos logit contributions
 restaurants+0.154
 distance+0.150
 hamb+0.148
 burger+0.142
 vendors+0.134
Neg logit contributions
arl-0.107
imi-0.095
icked-0.089
igg-0.089
uttered-0.088
Token my
Token ID616
Feature activation-3.51e+00

Pos logit contributions
arl+0.034
imi+0.033
icked+0.032
ig+0.032
igg+0.032
Neg logit contributions
 distance-0.033
 restaurants-0.033
 hamb-0.032
 burger-0.030
 vendors-0.028
Token d
Token ID288
Feature activation-3.25e+00

Pos logit contributions
arl+0.203
imi+0.187
icked+0.175
ig+0.167
icks+0.166
Neg logit contributions
 distance-0.289
 hamb-0.286
 restaurants-0.283
 burger-0.270
dogs-0.254
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
imi+0.060
 how+0.054
 tweet+0.051
 knows+0.048
 fears+0.047
Neg logit contributions
 distance-0.354
asts-0.349
 restaurants-0.347
line-0.345
ding-0.342
Tokenings
Token ID654
Feature activation-2.46e-01

Pos logit contributions
imi+0.764
ur+0.716
 tweet+0.713
arl+0.708
ixie+0.701
Neg logit contributions
 distance-0.928
 restaurants-0.895
 burger-0.834
 hamb-0.821
asts-0.794
Token.
Token ID13
Feature activation+1.76e-01

Pos logit contributions
arl+0.016
imi+0.015
ig+0.013
ixie+0.013
icked+0.013
Neg logit contributions
 distance-0.019
 restaurants-0.018
 hamb-0.018
 burger-0.017
aster-0.016
Token I
Token ID314
Feature activation-2.92e-01

Pos logit contributions
 distance+0.014
 restaurants+0.014
 hamb+0.014
 burger+0.013
aster+0.012
Neg logit contributions
arl-0.011
imi-0.010
icked-0.009
ig-0.009
ixie-0.009
Token'm
Token ID1101
Feature activation+1.10e+00

Pos logit contributions
arl+0.017
imi+0.016
 tweet+0.015
icked+0.015
ig+0.015
Neg logit contributions
 distance-0.023
 restaurants-0.022
 hamb-0.022
 burger-0.022
dogs-0.020
Token so
Token ID523
Feature activation+1.08e+00

Pos logit contributions
 distance+0.098
 restaurants+0.093
 hamb+0.092
 burger+0.089
handed+0.085
Neg logit contributions
arl-0.059
imi-0.056
anda-0.048
ixie-0.048
 how-0.048
TokenWhat
Token ID2061
Feature activation+9.25e-01

Pos logit contributions
arl+0.049
imi+0.046
ig+0.041
ixie+0.041
 tweet+0.041
Neg logit contributions
 distance-0.051
 hamb-0.050
 restaurants-0.050
 burger-0.047
asts-0.045
Token happened
Token ID3022
Feature activation+1.92e+00

Pos logit contributions
 distance+0.089
 hamb+0.081
 restaurants+0.081
 burger+0.078
handed+0.077
Neg logit contributions
imi-0.043
arl-0.043
icked-0.039
 how-0.038
icks-0.038
Token,
Token ID11
Feature activation-3.39e-01

Pos logit contributions
 distance+0.171
 restaurants+0.165
 hamb+0.164
 burger+0.157
aster+0.149
Neg logit contributions
arl-0.102
imi-0.095
ixie-0.083
ig-0.083
icked-0.083
Token my
Token ID616
Feature activation-3.42e+00

Pos logit contributions
uttered+0.022
arl+0.022
icked+0.021
ig+0.020
anda+0.020
Neg logit contributions
 hamb-0.025
 restaurants-0.024
 burger-0.024
 distance-0.024
 burgers-0.022
Token d
Token ID288
Feature activation-3.24e+00

Pos logit contributions
arl+0.189
imi+0.170
icked+0.159
ig+0.156
uttered+0.154
Neg logit contributions
 distance-0.288
 hamb-0.288
 burger-0.281
 restaurants-0.274
handed-0.256
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.052
 tweet+0.045
 knows+0.044
imi+0.042
 thinks+0.038
Neg logit contributions
aster-0.359
 distance-0.352
asts-0.350
 hamb-0.342
Ice-0.340
Tokenings
Token ID654
Feature activation-2.75e-01

Pos logit contributions
imi+0.747
 tweet+0.712
 how+0.709
anda+0.696
ur+0.692
Neg logit contributions
 distance-0.931
 restaurants-0.854
 burger-0.852
 hamb-0.825
asts-0.776
Token?
Token ID30
Feature activation+9.86e-01

Pos logit contributions
arl+0.017
imi+0.016
ig+0.015
ixie+0.015
anda+0.015
Neg logit contributions
 distance-0.021
 restaurants-0.020
 hamb-0.020
 burger-0.019
aster-0.018
Token Why
Token ID4162
Feature activation-2.33e+00

Pos logit contributions
 distance+0.073
 hamb+0.071
 restaurants+0.071
 burger+0.067
aster+0.063
Neg logit contributions
arl-0.065
imi-0.061
ig-0.056
ixie-0.055
anda-0.055
Token do
Token ID466
Feature activation+1.04e+00

Pos logit contributions
arl+0.130
imi+0.112
uttered+0.104
ixie+0.104
ig+0.104
Neg logit contributions
 restaurants-0.202
 hamb-0.199
 distance-0.198
 burger-0.190
aster-0.178
Token you
Token ID345
Feature activation-1.26e+00

Pos logit contributions
 restaurants+0.089
 hamb+0.085
 burger+0.084
 distance+0.083
aster+0.082
Neg logit contributions
arl-0.060
uttered-0.051
imi-0.050
igg-0.049
icked-0.048
Token forgive
Token ID20927
Feature activation-4.56e-01

Pos logit contributions
arl+0.009
imi+0.008
ig+0.007
uttered+0.007
ixie+0.007
Neg logit contributions
 distance-0.021
 hamb-0.020
 restaurants-0.020
 burger-0.020
aster-0.019
Token you
Token ID345
Feature activation-1.24e+00

Pos logit contributions
arl+0.020
imi+0.018
ig+0.015
ixie+0.015
icked+0.015
Neg logit contributions
 distance-0.045
 restaurants-0.044
 hamb-0.044
 burger-0.042
aster-0.040
Token,
Token ID11
Feature activation-5.60e-01

Pos logit contributions
arl+0.071
imi+0.067
ig+0.059
icked+0.059
anda+0.059
Neg logit contributions
 distance-0.107
 restaurants-0.102
 hamb-0.100
 burger-0.096
handed-0.092
Token my
Token ID616
Feature activation-3.46e+00

Pos logit contributions
arl+0.037
uttered+0.035
icked+0.033
imi+0.033
igg+0.032
Neg logit contributions
 hamb-0.042
 restaurants-0.041
 distance-0.040
 burger-0.040
 french-0.036
Token d
Token ID288
Feature activation-3.27e+00

Pos logit contributions
arl+0.203
imi+0.178
icked+0.173
uttered+0.170
ig+0.166
Neg logit contributions
 hamb-0.290
 distance-0.280
 burger-0.278
 restaurants-0.275
 french-0.256
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.057
 knows+0.055
imi+0.047
 thinks+0.046
 does+0.045
Neg logit contributions
asts-0.354
aster-0.353
 hamb-0.340
 distance-0.338
Ice-0.338
Tokenings
Token ID654
Feature activation-3.24e-01

Pos logit contributions
imi+0.779
 how+0.728
 tweet+0.723
ur+0.719
arl+0.710
Neg logit contributions
 distance-0.895
 restaurants-0.839
 burger-0.835
 hamb-0.799
thank-0.758
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
arl+0.022
imi+0.020
ig+0.018
ixie+0.018
icked+0.018
Neg logit contributions
 distance-0.024
 restaurants-0.023
 hamb-0.023
 burger-0.022
aster-0.021
Token And
Token ID843
Feature activation+8.82e-01

Pos logit contributions
 distance+0.008
 hamb+0.008
 restaurants+0.008
 burger+0.007
Ice+0.007
Neg logit contributions
arl-0.008
imi-0.008
ig-0.008
anda-0.008
igg-0.008
Token of
Token ID286
Feature activation+1.73e+00

Pos logit contributions
 restaurants+0.059
 hamb+0.058
 burger+0.056
 distance+0.056
 customers+0.051
Neg logit contributions
arl-0.065
imi-0.062
igg-0.058
uttered-0.056
anda-0.054
Token course
Token ID1781
Feature activation-4.34e+00

Pos logit contributions
 restaurants+0.145
 distance+0.145
 hamb+0.141
aster+0.137
 burger+0.135
Neg logit contributions
arl-0.100
uttered-0.092
ixie-0.091
imi-0.091
anda-0.090
Token'm
Token ID1101
Feature activation+1.18e+00

Pos logit contributions
 distance+0.160
 restaurants+0.153
 hamb+0.149
 burger+0.145
dogs+0.140
Neg logit contributions
arl-0.106
imi-0.099
 knows-0.091
anda-0.090
ixie-0.090
Token sorry
Token ID7926
Feature activation-1.19e+00

Pos logit contributions
 distance+0.112
 restaurants+0.106
 hamb+0.105
 burger+0.101
handed+0.098
Neg logit contributions
arl-0.057
imi-0.052
icked-0.045
ixie-0.045
anda-0.044
Token,
Token ID11
Feature activation-3.74e-01

Pos logit contributions
arl+0.080
uttered+0.075
igg+0.072
icked+0.071
ig+0.068
Neg logit contributions
 hamb-0.086
 restaurants-0.086
 burger-0.082
 distance-0.081
 burgers-0.078
Token my
Token ID616
Feature activation-3.41e+00

Pos logit contributions
arl+0.024
imi+0.023
uttered+0.022
ig+0.022
icked+0.022
Neg logit contributions
 hamb-0.028
 distance-0.028
 restaurants-0.027
 burger-0.027
 burgers-0.025
Token d
Token ID288
Feature activation-3.17e+00

Pos logit contributions
arl+0.161
imi+0.147
icked+0.130
ig+0.127
uttered+0.126
Neg logit contributions
 distance-0.320
 hamb-0.314
 restaurants-0.310
 burger-0.303
handed-0.285
Tokenarl
Token ID7063
Feature activation-1.24e+01

Pos logit contributions
 how+0.049
 tweet+0.045
 knows+0.044
imi+0.040
 thinks+0.034
Neg logit contributions
asts-0.343
aster-0.338
 hamb-0.337
 distance-0.337
ding-0.331
Tokenings
Token ID654
Feature activation-2.50e-01

Pos logit contributions
imi+0.751
arl+0.701
 tweet+0.700
 how+0.696
ixie+0.680
Neg logit contributions
 distance-0.945
 restaurants-0.879
 burger-0.868
 hamb-0.855
asts-0.799
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
arl+0.018
imi+0.017
ig+0.016
uttered+0.016
ixie+0.016
Neg logit contributions
 distance-0.017
 hamb-0.016
 restaurants-0.016
 burger-0.015
handed-0.014
Token The
Token ID383
Feature activation+1.68e+00

Pos logit contributions
 distance+0.027
 restaurants+0.025
 hamb+0.025
 burger+0.024
aster+0.023
Neg logit contributions
arl-0.021
imi-0.020
ig-0.018
ixie-0.017
icked-0.017
Token tornado
Token ID33718
Feature activation-8.41e-01

Pos logit contributions
 distance+0.119
 restaurants+0.111
 hamb+0.109
 burger+0.104
handed+0.097
Neg logit contributions
arl-0.120
imi-0.116
icked-0.108
uttered-0.107
ixie-0.106
Token took
Token ID1718
Feature activation-1.00e+00

Pos logit contributions
arl+0.045
imi+0.039
ixie+0.033
icked+0.032
uttered+0.032
Neg logit contributions
 distance-0.079
 hamb-0.074
 restaurants-0.073
 burger-0.071
 weighed-0.069
Token on
Token ID319
Feature activation+2.01e+00

Pos logit contributions
 distance+0.040
 restaurants+0.038
 hamb+0.038
 burger+0.036
aster+0.034
Neg logit contributions
arl-0.033
imi-0.031
ig-0.028
ixie-0.028
icked-0.028
Token the
Token ID262
Feature activation-5.08e-02

Pos logit contributions
 distance+0.167
 restaurants+0.159
 hamb+0.159
 burger+0.151
aster+0.143
Neg logit contributions
arl-0.119
imi-0.112
ig-0.100
ixie-0.099
icked-0.099
Token floor
Token ID4314
Feature activation+3.26e+00

Pos logit contributions
arl+0.004
imi+0.004
 bud+0.003
ig+0.003
 how+0.003
Neg logit contributions
 hamb-0.003
 restaurants-0.003
 distance-0.003
 burger-0.003
handed-0.003
Token under
Token ID739
Feature activation+3.03e+00

Pos logit contributions
 distance+0.234
 restaurants+0.221
 hamb+0.219
 burger+0.207
aster+0.195
Neg logit contributions
arl-0.229
imi-0.218
ixie-0.198
ig-0.198
icked-0.196
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
 distance+0.261
 hamb+0.253
 restaurants+0.251
 burger+0.241
handed+0.228
Neg logit contributions
arl-0.169
imi-0.158
ig-0.138
icked-0.137
ixie-0.135
Token table
Token ID3084
Feature activation+5.62e+00

Pos logit contributions
arl+0.002
imi+0.002
ig+0.001
ixie+0.001
icks+0.001
Neg logit contributions
 distance-0.001
 hamb-0.001
 restaurants-0.001
 burger-0.001
handed-0.001
Token.
Token ID13
Feature activation+2.72e-01

Pos logit contributions
 distance+0.447
 hamb+0.439
 restaurants+0.436
 burger+0.417
dogs+0.395
Neg logit contributions
arl-0.340
imi-0.305
ig-0.285
icked-0.282
 how-0.281
Token The
Token ID383
Feature activation+1.67e+00

Pos logit contributions
 distance+0.020
 restaurants+0.019
 hamb+0.019
 burger+0.018
aster+0.017
Neg logit contributions
arl-0.019
imi-0.018
ig-0.016
ixie-0.016
icked-0.016
Token table
Token ID3084
Feature activation+5.63e+00

Pos logit contributions
 distance+0.114
 restaurants+0.108
 hamb+0.108
 burger+0.102
aster+0.095
Neg logit contributions
arl-0.123
imi-0.117
ig-0.107
ixie-0.106
icked-0.106
Token was
Token ID373
Feature activation+2.15e-02

Pos logit contributions
 distance+0.446
 hamb+0.422
 restaurants+0.420
 burger+0.402
aster+0.379
Neg logit contributions
arl-0.361
imi-0.336
ixie-0.299
ig-0.298
icked-0.293
Token strong
Token ID1913
Feature activation+1.24e+00

Pos logit contributions
 distance+0.002
 hamb+0.001
 restaurants+0.001
 burger+0.001
aster+0.001
Neg logit contributions
arl-0.001
imi-0.001
ig-0.001
ixie-0.001
icked-0.001
Token put
Token ID1234
Feature activation-1.84e-01

Pos logit contributions
arl+0.141
imi+0.126
ig+0.116
ixie+0.115
 tweet+0.114
Neg logit contributions
 distance-0.105
 hamb-0.101
 restaurants-0.099
 burger-0.096
aster-0.087
Token the
Token ID262
Feature activation+6.67e-02

Pos logit contributions
arl+0.013
imi+0.013
anda+0.012
ixie+0.012
ig+0.011
Neg logit contributions
 distance-0.012
 hamb-0.012
 burger-0.012
 restaurants-0.012
dogs-0.011
Token bunny
Token ID44915
Feature activation-2.14e+00

Pos logit contributions
 distance+0.005
 restaurants+0.005
 hamb+0.005
 burger+0.005
handed+0.004
Neg logit contributions
arl-0.005
imi-0.004
icked-0.004
uttered-0.004
igg-0.004
Token in
Token ID287
Feature activation-5.77e-01

Pos logit contributions
arl+0.169
imi+0.151
ixie+0.135
igg+0.132
icked+0.130
Neg logit contributions
 distance-0.154
 restaurants-0.139
 hamb-0.137
 burger-0.136
handed-0.122
Token the
Token ID262
Feature activation+6.41e-02

Pos logit contributions
arl+0.032
imi+0.030
ig+0.026
ixie+0.026
icked+0.026
Neg logit contributions
 distance-0.051
 restaurants-0.049
 hamb-0.048
 burger-0.046
handed-0.044
Token middle
Token ID3504
Feature activation+8.24e+00

Pos logit contributions
 distance+0.004
 restaurants+0.004
 hamb+0.004
 burger+0.004
handed+0.003
Neg logit contributions
arl-0.005
imi-0.005
icked-0.005
uttered-0.005
anda-0.004
Token of
Token ID286
Feature activation+1.91e+00

Pos logit contributions
 distance+0.623
 restaurants+0.594
 hamb+0.592
 burger+0.561
handed+0.527
Neg logit contributions
arl-0.547
imi-0.516
ig-0.467
ixie-0.464
icked-0.463
Token the
Token ID262
Feature activation+1.20e-01

Pos logit contributions
 distance+0.151
 restaurants+0.146
 hamb+0.140
 burger+0.134
aster+0.131
Neg logit contributions
arl-0.125
imi-0.115
anda-0.111
ixie-0.109
icked-0.109
Token case
Token ID1339
Feature activation-2.99e+00

Pos logit contributions
 distance+0.007
 restaurants+0.006
 hamb+0.006
 burger+0.006
handed+0.005
Neg logit contributions
arl-0.010
imi-0.010
icked-0.010
uttered-0.009
ixie-0.009
Token,
Token ID11
Feature activation-2.80e-01

Pos logit contributions
arl+0.167
imi+0.156
ig+0.140
ixie+0.138
icked+0.138
Neg logit contributions
 distance-0.255
 hamb-0.248
 restaurants-0.247
 burger-0.235
aster-0.222
Token next
Token ID1306
Feature activation-4.39e+00

Pos logit contributions
arl+0.019
imi+0.018
ig+0.016
 tweet+0.016
icked+0.016
Neg logit contributions
 distance-0.020
 restaurants-0.020
 hamb-0.019
 burger-0.018
handed-0.018
Token with
Token ID351
Feature activation+1.13e+00

Pos logit contributions
 fries+0.474
 hamb+0.474
 french+0.469
 burger+0.462
 restaurants+0.454
Neg logit contributions
 how-0.164
arl-0.132
:-0.119
uff-0.117
uttered-0.117
Token his
Token ID465
Feature activation+1.01e+00

Pos logit contributions
 distance+0.083
 hamb+0.077
 restaurants+0.076
 burger+0.076
aster+0.072
Neg logit contributions
arl-0.072
imi-0.069
 how-0.067
icked-0.066
 eight-0.063
Token mom
Token ID1995
Feature activation-2.85e-01

Pos logit contributions
 distance+0.083
 hamb+0.079
 restaurants+0.078
 burger+0.075
handed+0.070
Neg logit contributions
arl-0.061
imi-0.055
icked-0.052
 tweet-0.051
ig-0.051
Token.
Token ID13
Feature activation+9.79e-02

Pos logit contributions
 how+0.009
imi+0.008
el+0.008
arl+0.008
:+0.007
Neg logit contributions
 restaurants-0.029
 distance-0.029
handed-0.028
 hamb-0.028
aster-0.028
Token He
Token ID679
Feature activation+3.77e-01

Pos logit contributions
 distance+0.008
 restaurants+0.008
 hamb+0.008
handed+0.008
 burger+0.007
Neg logit contributions
arl-0.005
imi-0.005
uttered-0.005
 how-0.005
icked-0.005
Token noticed
Token ID6810
Feature activation+5.90e+00

Pos logit contributions
 distance+0.026
 restaurants+0.026
 hamb+0.025
 burger+0.024
asts+0.024
Neg logit contributions
imi-0.025
arl-0.024
anda-0.023
ixie-0.023
ig-0.023
Token a
Token ID257
Feature activation-1.28e-01

Pos logit contributions
 distance+0.515
 restaurants+0.484
 hamb+0.482
 burger+0.468
asts+0.452
Neg logit contributions
 how-0.306
imi-0.299
arl-0.286
icked-0.262
uttered-0.249
Token big
Token ID1263
Feature activation-9.37e-01

Pos logit contributions
arl+0.007
imi+0.006
ixie+0.006
anda+0.006
ig+0.006
Neg logit contributions
 restaurants-0.011
 distance-0.011
 hamb-0.010
handed-0.010
asts-0.010
Token billboard
Token ID41448
Feature activation+2.71e+00

Pos logit contributions
arl+0.060
imi+0.057
ig+0.052
 tweet+0.051
anda+0.050
Neg logit contributions
 distance-0.070
 restaurants-0.068
 hamb-0.067
handed-0.064
 burger-0.063
Token.
Token ID13
Feature activation+7.89e-02

Pos logit contributions
 restaurants+0.248
 hamb+0.241
 distance+0.239
 french+0.228
 burger+0.228
Neg logit contributions
arl-0.119
 how-0.115
imi-0.110
 bud-0.103
ig-0.100
Token On
Token ID1550
Feature activation+7.14e-02

Pos logit contributions
 distance+0.007
 restaurants+0.006
 hamb+0.006
handed+0.006
 burger+0.006
Neg logit contributions
arl-0.004
imi-0.004
uttered-0.004
 how-0.004
icked-0.004
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
arl+0.059
imi+0.056
ig+0.053
anda+0.052
ixie+0.051
Neg logit contributions
 distance-0.050
 hamb-0.049
 restaurants-0.049
 burger-0.046
handed-0.043
Token The
Token ID383
Feature activation+1.69e+00

Pos logit contributions
 distance+0.002
 restaurants+0.002
 hamb+0.002
atmeal+0.002
handed+0.002
Neg logit contributions
arl-0.001
uttered-0.001
imi-0.001
 how-0.001
ixie-0.001
Token family
Token ID1641
Feature activation+3.11e+00

Pos logit contributions
 distance+0.130
 hamb+0.128
 restaurants+0.128
 burger+0.121
aster+0.121
Neg logit contributions
arl-0.103
imi-0.095
anda-0.086
ixie-0.085
 eight-0.084
Token had
Token ID550
Feature activation+9.09e-01

Pos logit contributions
 distance+0.247
 restaurants+0.238
 hamb+0.236
 burger+0.226
handed+0.215
Neg logit contributions
arl-0.183
imi-0.178
ig-0.166
icked-0.164
anda-0.164
Token to
Token ID284
Feature activation-2.32e-01

Pos logit contributions
 distance+0.075
 restaurants+0.072
 hamb+0.072
 burger+0.069
aster+0.068
Neg logit contributions
arl-0.048
 how-0.045
imi-0.044
icked-0.042
 eight-0.042
Token choose
Token ID3853
Feature activation+7.35e+00

Pos logit contributions
arl+0.012
imi+0.011
 how+0.011
 bud+0.011
 tweet+0.010
Neg logit contributions
 hamb-0.019
 distance-0.019
 burger-0.019
 restaurants-0.018
dogs-0.018
Token one
Token ID530
Feature activation+1.61e-01

Pos logit contributions
 distance+0.579
 restaurants+0.559
 hamb+0.545
 burger+0.518
 vendors+0.491
Neg logit contributions
arl-0.478
imi-0.452
ig-0.405
igg-0.403
ixie-0.397
Token of
Token ID286
Feature activation+1.92e+00

Pos logit contributions
 distance+0.013
 hamb+0.012
 restaurants+0.012
 burger+0.012
aster+0.011
Neg logit contributions
arl-0.010
imi-0.010
icked-0.009
ixie-0.008
igg-0.008
Token the
Token ID262
Feature activation+6.20e-02

Pos logit contributions
 distance+0.150
 restaurants+0.145
 hamb+0.143
 burger+0.136
aster+0.129
Neg logit contributions
arl-0.128
imi-0.119
ixie-0.109
anda-0.108
ig-0.108
Token roads
Token ID9725
Feature activation+1.19e+00

Pos logit contributions
 hamb+0.004
 distance+0.004
 restaurants+0.004
 burger+0.004
aster+0.004
Neg logit contributions
arl-0.004
imi-0.004
anda-0.004
 bud-0.004
ig-0.004
Token,
Token ID11
Feature activation-4.79e-01

Pos logit contributions
 distance+0.102
 restaurants+0.100
 hamb+0.100
 burger+0.095
handed+0.090
Neg logit contributions
arl-0.064
imi-0.058
ig-0.053
 how-0.052
icked-0.052
Token mom
Token ID1995
Feature activation-1.56e-01

Pos logit contributions
 distance+0.321
 restaurants+0.308
 hamb+0.308
 burger+0.295
aster+0.279
Neg logit contributions
arl-0.192
imi-0.177
ig-0.157
ixie-0.155
icked-0.155
Token made
Token ID925
Feature activation+2.21e-01

Pos logit contributions
imi+0.009
arl+0.009
 how+0.008
icked+0.008
 knows+0.008
Neg logit contributions
 distance-0.013
 restaurants-0.012
 hamb-0.011
handed-0.011
 burger-0.011
Token him
Token ID683
Feature activation+1.85e+00

Pos logit contributions
 distance+0.017
 restaurants+0.014
 hamb+0.013
 burger+0.013
handed+0.013
Neg logit contributions
imi-0.015
arl-0.015
 how-0.014
icked-0.014
igg-0.014
Token spaghetti
Token ID48241
Feature activation+6.40e+00

Pos logit contributions
 distance+0.157
 restaurants+0.142
 hamb+0.137
 burger+0.130
 vendors+0.130
Neg logit contributions
 how-0.101
imi-0.099
arl-0.097
icks-0.093
icked-0.092
Token for
Token ID329
Feature activation+4.86e-01

Pos logit contributions
 distance+0.529
 hamb+0.510
 restaurants+0.510
 burger+0.487
aster+0.461
Neg logit contributions
arl-0.378
imi-0.351
ixie-0.314
ig-0.312
anda-0.311
Token dinner
Token ID8073
Feature activation+5.59e+00

Pos logit contributions
 distance+0.027
 restaurants+0.021
 hamb+0.021
 burger+0.020
handed+0.020
Neg logit contributions
arl-0.043
imi-0.043
 how-0.040
icked-0.038
 eight-0.038
Token.
Token ID13
Feature activation+2.60e-01

Pos logit contributions
 distance+0.517
 restaurants+0.477
 hamb+0.465
handed+0.448
 burger+0.444
Neg logit contributions
arl-0.259
imi-0.250
 how-0.243
ig-0.213
 bud-0.202
Token He
Token ID679
Feature activation+4.10e-01

Pos logit contributions
 distance+0.022
 restaurants+0.021
 hamb+0.020
 burger+0.020
asts+0.019
Neg logit contributions
arl-0.015
imi-0.015
 how-0.013
ixie-0.012
uttered-0.012
Token was
Token ID373
Feature activation+1.45e-01

Pos logit contributions
 distance+0.029
 restaurants+0.027
 hamb+0.027
 burger+0.025
handed+0.024
Neg logit contributions
arl-0.029
imi-0.028
icked-0.026
ig-0.025
ixie-0.025
Token so
Token ID523
Feature activation+1.09e+00

Pos logit contributions
 distance+0.013
 restaurants+0.012
handed+0.011
 hamb+0.011
 burger+0.011
Neg logit contributions
arl-0.008
 how-0.008
imi-0.007
icked-0.007
 eight-0.007
Token excited
Token ID6568
Feature activation+6.39e-01

Pos logit contributions
 distance+0.073
 restaurants+0.068
 hamb+0.067
 burger+0.063
handed+0.059
Neg logit contributions
arl-0.084
imi-0.080
icked-0.073
ig-0.073
ixie-0.073
Token is
Token ID318
Feature activation-3.35e+00

Pos logit contributions
arl+0.317
imi+0.258
ixie+0.237
uttered+0.230
icked+0.228
Neg logit contributions
 distance-0.578
 hamb-0.531
 restaurants-0.523
 burger-0.512
handed-0.495
Token not
Token ID407
Feature activation-1.62e+00

Pos logit contributions
arl+0.236
imi+0.224
uttered+0.218
ixie+0.218
ig+0.214
Neg logit contributions
 distance-0.238
 hamb-0.226
 burger-0.211
 restaurants-0.197
 burgers-0.197
Token a
Token ID257
Feature activation-7.16e-02

Pos logit contributions
arl+0.115
imi+0.110
uttered+0.104
ixie+0.104
ig+0.101
Neg logit contributions
 distance-0.113
 hamb-0.110
 restaurants-0.106
 burger-0.103
 burgers-0.096
Token map
Token ID3975
Feature activation-3.68e+00

Pos logit contributions
arl+0.005
icked+0.004
imi+0.004
uttered+0.004
ixie+0.004
Neg logit contributions
 distance-0.006
 hamb-0.005
 restaurants-0.005
 burger-0.005
 barbecue-0.005
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
arl+0.235
imi+0.218
ig+0.199
anda+0.196
icked+0.196
Neg logit contributions
 distance-0.286
 restaurants-0.276
 hamb-0.274
 burger-0.258
aster-0.245
Token It
Token ID632
Feature activation+1.46e-01

Pos logit contributions
 distance+0.014
 hamb+0.013
 restaurants+0.013
 burger+0.013
aster+0.012
Neg logit contributions
arl-0.012
imi-0.012
ig-0.011
ixie-0.011
anda-0.011
Token is
Token ID318
Feature activation-3.33e+00

Pos logit contributions
 distance+0.012
 weighed+0.011
 hamb+0.011
 restaurants+0.010
 burger+0.010
Neg logit contributions
arl-0.010
imi-0.008
uff-0.008
ixie-0.008
uttered-0.007
Token just
Token ID655
Feature activation+3.94e-01

Pos logit contributions
arl+0.233
imi+0.218
uttered+0.207
ig+0.207
ixie+0.206
Neg logit contributions
 distance-0.240
 hamb-0.234
 burger-0.220
 restaurants-0.217
 burgers-0.204
Token a
Token ID257
Feature activation-5.21e-02

Pos logit contributions
 distance+0.026
 hamb+0.025
 restaurants+0.024
 burger+0.023
 burgers+0.021
Neg logit contributions
arl-0.030
imi-0.029
ixie-0.027
ig-0.027
uttered-0.026
Token leaf
Token ID12835
Feature activation-6.03e+00

Pos logit contributions
arl+0.003
imi+0.003
icked+0.003
uttered+0.003
ig+0.003
Neg logit contributions
 distance-0.004
 hamb-0.004
 restaurants-0.004
 burger-0.004
aster-0.003
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
arl+0.398
imi+0.376
ig+0.342
icked+0.337
ixie+0.336
Neg logit contributions
 distance-0.456
 restaurants-0.437
 hamb-0.434
 burger-0.410
aster-0.387
Token friends
Token ID2460
Feature activation-1.80e+00

Pos logit contributions
arl+0.123
imi+0.111
uttered+0.098
ig+0.095
ixie+0.094
Neg logit contributions
 distance-0.279
 restaurants-0.270
 hamb-0.270
 burger-0.261
aster-0.249
Token again
Token ID757
Feature activation+7.44e-01

Pos logit contributions
arl+0.091
imi+0.086
 how+0.086
 tweet+0.075
ig+0.075
Neg logit contributions
 distance-0.162
 hamb-0.157
 restaurants-0.153
dogs-0.144
 burger-0.144
Token?"
Token ID1701
Feature activation+4.26e+00

Pos logit contributions
 distance+0.065
 hamb+0.062
 restaurants+0.061
handed+0.059
 burger+0.059
Neg logit contributions
arl-0.041
imi-0.039
 how-0.034
ig-0.034
 tweet-0.032
Token
Token ID198
Feature activation+1.65e-01

Pos logit contributions
 distance+0.398
 hamb+0.383
 restaurants+0.377
asts+0.357
 barbecue+0.357
Neg logit contributions
arl-0.203
imi-0.185
ia-0.170
 how-0.169
icks-0.152
Token
Token ID198
Feature activation+1.76e-01

Pos logit contributions
 distance+0.011
 restaurants+0.011
 hamb+0.011
 burger+0.010
handed+0.009
Neg logit contributions
arl-0.012
imi-0.012
ig-0.011
ixie-0.011
icked-0.011
TokenS
Token ID50
Feature activation+8.39e-01

Pos logit contributions
 distance+0.013
 restaurants+0.013
 hamb+0.013
 burger+0.012
dogs+0.012
Neg logit contributions
arl-0.012
imi-0.011
anda-0.010
ixie-0.010
ig-0.010
Tokenara
Token ID3301
Feature activation+6.29e-01

Pos logit contributions
 distance+0.062
 hamb+0.062
 restaurants+0.061
 burger+0.061
aster+0.061
Neg logit contributions
arl-0.050
imi-0.047
uttered-0.046
 tweet-0.045
 wand-0.045
Token and
Token ID290
Feature activation+6.60e-01

Pos logit contributions
 burger+0.047
 hamb+0.046
 distance+0.046
 restaurants+0.045
 cafe+0.044
Neg logit contributions
arl-0.042
imi-0.035
chie-0.034
uttered-0.032
upid-0.031
Token Tom
Token ID4186
Feature activation+2.09e+00

Pos logit contributions
 distance+0.041
 restaurants+0.040
 hamb+0.039
 burger+0.038
aster+0.036
Neg logit contributions
arl-0.053
imi-0.050
uttered-0.049
ixie-0.048
ig-0.048
Token smile
Token ID8212
Feature activation-9.79e-02

Pos logit contributions
 distance+0.138
 restaurants+0.124
 hamb+0.124
 burger+0.118
aster+0.110
Neg logit contributions
arl-0.163
imi-0.163
ig-0.143
ixie-0.143
uttered-0.142
Token at
Token ID379
Feature activation-9.94e-01

Pos logit contributions
imi+0.008
arl+0.008
ixie+0.008
anda+0.008
ig+0.008
Neg logit contributions
 distance-0.005
 hamb-0.005
 burger-0.005
 restaurants-0.005
dream-0.005
Token.
Token ID13
Feature activation-2.02e-01

Pos logit contributions
arl+0.015
imi+0.014
ig+0.012
ixie+0.012
icked+0.012
Neg logit contributions
 distance-0.020
 restaurants-0.019
 hamb-0.019
 burger-0.018
aster-0.017
Token The
Token ID383
Feature activation+1.40e+00

Pos logit contributions
arl+0.013
imi+0.012
ig+0.011
ixie+0.011
uttered+0.011
Neg logit contributions
 distance-0.015
 hamb-0.015
 restaurants-0.015
 burger-0.014
aster-0.013
Token bird
Token ID6512
Feature activation-1.86e+00

Pos logit contributions
 distance+0.104
 hamb+0.098
 restaurants+0.098
 burger+0.093
aster+0.087
Neg logit contributions
arl-0.095
imi-0.090
icked-0.083
ig-0.082
ixie-0.082
Token will
Token ID481
Feature activation-2.98e+00

Pos logit contributions
arl+0.102
imi+0.085
ixie+0.078
uttered+0.076
uff+0.072
Neg logit contributions
 distance-0.169
 hamb-0.161
 restaurants-0.154
 burger-0.153
 weighed-0.152
Token be
Token ID307
Feature activation-3.12e+00

Pos logit contributions
arl+0.197
imi+0.190
ig+0.182
ixie+0.181
uttered+0.179
Neg logit contributions
 distance-0.234
 hamb-0.208
 restaurants-0.205
aster-0.195
 burger-0.194
Token sad
Token ID6507
Feature activation-1.45e-01

Pos logit contributions
arl+0.205
imi+0.193
ig+0.174
ixie+0.174
icked+0.172
Neg logit contributions
 distance-0.240
 hamb-0.228
 restaurants-0.228
 burger-0.216
aster-0.204
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
arl+0.010
imi+0.009
ixie+0.009
uttered+0.008
ig+0.008
Neg logit contributions
 distance-0.011
 restaurants-0.010
 hamb-0.010
 burger-0.010
asts-0.009
Token And
Token ID843
Feature activation+6.27e-01

Pos logit contributions
arl+0.009
imi+0.008
ig+0.008
ixie+0.008
uttered+0.007
Neg logit contributions
 distance-0.011
 hamb-0.011
 restaurants-0.011
 burger-0.010
aster-0.010
Token the
Token ID262
Feature activation-1.92e-01

Pos logit contributions
 distance+0.050
 restaurants+0.049
 hamb+0.049
 burger+0.047
aster+0.044
Neg logit contributions
arl-0.040
imi-0.036
igg-0.033
uttered-0.033
ixie-0.033
Token baby
Token ID5156
Feature activation-1.40e+00

Pos logit contributions
arl+0.012
imi+0.012
icked+0.011
ixie+0.010
uttered+0.010
Neg logit contributions
 distance-0.015
 hamb-0.014
 restaurants-0.014
 burger-0.014
aster-0.013
Token birds
Token ID10087
Feature activation+1.08e+00

Pos logit contributions
arl+0.091
imi+0.081
ixie+0.074
uttered+0.073
icked+0.072
Neg logit contributions
 distance-0.113
 hamb-0.108
 restaurants-0.107
 burger-0.103
aster-0.096
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
arl+0.098
imi+0.088
ixie+0.085
uttered+0.084
igg+0.084
Neg logit contributions
 distance-0.094
 hamb-0.087
 restaurants-0.087
 burger-0.083
aster-0.081
Token The
Token ID383
Feature activation+1.58e+00

Pos logit contributions
 distance+0.016
 restaurants+0.015
 hamb+0.015
 burger+0.014
aster+0.014
Neg logit contributions
arl-0.016
imi-0.015
ig-0.014
anda-0.014
ixie-0.014
Token net
Token ID2010
Feature activation-8.31e-01

Pos logit contributions
 distance+0.125
 restaurants+0.111
 hamb+0.111
 burger+0.109
aster+0.104
Neg logit contributions
arl-0.104
imi-0.098
icked-0.092
ixie-0.091
anda-0.090
Token got
Token ID1392
Feature activation+7.43e-02

Pos logit contributions
arl+0.069
imi+0.063
ixie+0.058
uff+0.054
upp+0.053
Neg logit contributions
 distance-0.056
 hamb-0.049
 weighed-0.047
 burger-0.045
 restaurants-0.044
Token stuck
Token ID7819
Feature activation-1.91e+00

Pos logit contributions
 distance+0.004
 hamb+0.004
 thirsty+0.004
 burgers+0.004
 burger+0.004
Neg logit contributions
arl-0.006
imi-0.006
ixie-0.006
ig-0.006
anda-0.005
Token on
Token ID319
Feature activation+1.99e+00

Pos logit contributions
arl+0.130
imi+0.127
ixie+0.115
 tweet+0.113
anda+0.113
Neg logit contributions
 distance-0.140
 hamb-0.128
 restaurants-0.126
 burger-0.120
handed-0.113
Token the
Token ID262
Feature activation-6.75e-04

Pos logit contributions
 distance+0.142
 restaurants+0.131
 hamb+0.131
aster+0.124
 burger+0.121
Neg logit contributions
arl-0.147
imi-0.131
anda-0.126
ixie-0.125
icked-0.124
Token branch
Token ID8478
Feature activation+8.51e-02

Pos logit contributions
arl+0.000
imi+0.000
ig+0.000
ixie+0.000
 tweet+0.000
Neg logit contributions
 distance-0.000
 restaurants-0.000
 hamb-0.000
 burger-0.000
handed-0.000
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
 distance+0.006
 hamb+0.006
 restaurants+0.005
 burger+0.005
aster+0.005
Neg logit contributions
arl-0.006
imi-0.006
ixie-0.005
ig-0.005
icked-0.005
Token L
Token ID406
Feature activation-3.61e+00

Pos logit contributions
 distance+0.016
 restaurants+0.015
 hamb+0.015
 burger+0.014
aster+0.013
Neg logit contributions
arl-0.015
imi-0.015
ig-0.014
anda-0.014
ixie-0.013
Tokenila
Token ID10102
Feature activation-1.95e+00

Pos logit contributions
arl+0.201
imi+0.182
 tweet+0.179
 fears+0.167
 wand+0.167
Neg logit contributions
 distance-0.282
 restaurants-0.277
aster-0.270
 hamb-0.262
line-0.261
Token Max
Token ID5436
Feature activation-6.63e-01

Pos logit contributions
 distance+0.009
 restaurants+0.008
 hamb+0.008
 burger+0.008
handed+0.007
Neg logit contributions
arl-0.008
imi-0.008
icked-0.007
ixie-0.007
ig-0.007
Token wanted
Token ID2227
Feature activation-8.62e-01

Pos logit contributions
arl+0.055
imi+0.048
uttered+0.044
ig+0.044
icked+0.043
Neg logit contributions
 distance-0.044
 hamb-0.043
 restaurants-0.041
 burger-0.040
 cafĂ©-0.036
Token the
Token ID262
Feature activation-1.20e-01

Pos logit contributions
imi+0.034
arl+0.034
 how+0.033
 eight+0.028
 tweet+0.028
Neg logit contributions
 distance-0.082
 restaurants-0.079
 hamb-0.078
handed-0.075
 burger-0.075
Token cord
Token ID15050
Feature activation-3.50e+00

Pos logit contributions
arl+0.007
icked+0.007
anda+0.007
imi+0.007
ig+0.006
Neg logit contributions
 distance-0.010
 restaurants-0.009
 hamb-0.009
 burger-0.008
 barbecue-0.008
Token too
Token ID1165
Feature activation-2.85e+00

Pos logit contributions
arl+0.204
imi+0.191
ixie+0.169
icked+0.167
uttered+0.166
Neg logit contributions
 distance-0.301
 restaurants-0.282
 hamb-0.282
 burger-0.270
handed-0.255
Token,
Token ID11
Feature activation-4.52e-01

Pos logit contributions
arl+0.157
imi+0.146
ig+0.129
ixie+0.128
icked+0.128
Neg logit contributions
 distance-0.249
 restaurants-0.239
 hamb-0.238
 burger-0.228
aster-0.216
Token but
Token ID475
Feature activation+4.83e+00

Pos logit contributions
arl+0.033
imi+0.032
 how+0.031
 tweet+0.031
ig+0.030
Neg logit contributions
 distance-0.029
 restaurants-0.028
 hamb-0.027
handed-0.026
 burger-0.026
Token he
Token ID339
Feature activation+2.25e+00

Pos logit contributions
 distance+0.320
 hamb+0.300
 restaurants+0.299
 burger+0.282
aster+0.260
Neg logit contributions
arl-0.380
imi-0.354
ig-0.330
ixie-0.325
igg-0.324
Token did
Token ID750
Feature activation-2.88e+00

Pos logit contributions
 hamb+0.151
 distance+0.149
 burger+0.141
 restaurants+0.137
 weighed+0.133
Neg logit contributions
arl-0.187
imi-0.152
uttered-0.142
icks-0.142
 tweet-0.140
Token not
Token ID407
Feature activation-1.64e+00

Pos logit contributions
arl+0.163
imi+0.150
ixie+0.134
uttered+0.131
ig+0.131
Neg logit contributions
 distance-0.251
 restaurants-0.243
 hamb-0.240
 burger-0.230
aster-0.223
Token want
Token ID765
Feature activation-5.72e-01

Pos logit contributions
arl+0.102
imi+0.094
ig+0.085
 tweet+0.085
icked+0.084
Neg logit contributions
 distance-0.130
 hamb-0.126
 restaurants-0.124
 burger-0.120
handed-0.113
Token as
Token ID355
Feature activation-2.90e+00

Pos logit contributions
arl+0.106
imi+0.099
ig+0.090
ixie+0.089
icked+0.089
Neg logit contributions
 distance-0.143
 restaurants-0.138
 hamb-0.137
 burger-0.130
aster-0.124
Token he
Token ID339
Feature activation+2.35e+00

Pos logit contributions
arl+0.206
imi+0.198
ig+0.181
 how+0.180
icked+0.180
Neg logit contributions
 distance-0.201
 restaurants-0.189
 hamb-0.188
 burger-0.178
handed-0.168
Token could
Token ID714
Feature activation-2.84e-01

Pos logit contributions
 distance+0.158
 hamb+0.149
 restaurants+0.147
 burger+0.139
aster+0.130
Neg logit contributions
arl-0.181
imi-0.171
ixie-0.154
uttered-0.152
anda-0.152
Token use
Token ID779
Feature activation-5.12e-01

Pos logit contributions
arl+0.019
imi+0.018
ig+0.016
icked+0.016
ixie+0.016
Neg logit contributions
 distance-0.021
 restaurants-0.020
 hamb-0.020
 burger-0.019
handed-0.018
Token it
Token ID340
Feature activation+1.35e+00

Pos logit contributions
arl+0.035
imi+0.033
ig+0.030
ixie+0.030
icked+0.030
Neg logit contributions
 distance-0.038
 restaurants-0.036
 hamb-0.035
 burger-0.034
aster-0.032
Token as
Token ID355
Feature activation-2.91e+00

Pos logit contributions
 distance+0.101
 hamb+0.093
 restaurants+0.093
 burger+0.087
handed+0.084
Neg logit contributions
arl-0.095
imi-0.091
ixie-0.080
anda-0.079
igg-0.079
Token a
Token ID257
Feature activation-9.46e-02

Pos logit contributions
arl+0.200
imi+0.189
ig+0.172
icked+0.171
ixie+0.171
Neg logit contributions
 distance-0.213
 restaurants-0.202
 hamb-0.201
 burger-0.190
handed-0.179
Token blanket
Token ID18447
Feature activation-5.17e-02

Pos logit contributions
arl+0.006
imi+0.006
ig+0.005
ixie+0.005
anda+0.005
Neg logit contributions
 distance-0.007
 restaurants-0.007
 hamb-0.007
 burger-0.007
handed-0.006
Token or
Token ID393
Feature activation+3.16e+00

Pos logit contributions
arl+0.003
imi+0.003
ig+0.003
icked+0.003
ixie+0.003
Neg logit contributions
 distance-0.004
 restaurants-0.004
 hamb-0.004
 burger-0.004
aster-0.004
Token cushion
Token ID33579
Feature activation-5.10e+00

Pos logit contributions
 distance+0.246
 restaurants+0.236
 hamb+0.234
 burger+0.222
handed+0.213
Neg logit contributions
arl-0.195
imi-0.184
 tweet-0.167
icked-0.167
 how-0.166
Token whenever
Token ID8797
Feature activation+1.33e+00

Pos logit contributions
arl+0.334
imi+0.314
ig+0.285
ixie+0.282
icked+0.282
Neg logit contributions
 distance-0.389
 restaurants-0.371
 hamb-0.369
 burger-0.350
aster-0.330
Token went
Token ID1816
Feature activation-7.94e-01

Pos logit contributions
 distance+0.027
 restaurants+0.026
 hamb+0.025
 burger+0.024
handed+0.023
Neg logit contributions
arl-0.026
imi-0.026
ig-0.024
ixie-0.023
icked-0.023
Token to
Token ID284
Feature activation-2.63e-01

Pos logit contributions
arl+0.052
imi+0.049
icked+0.044
ig+0.044
ixie+0.043
Neg logit contributions
 distance-0.062
 restaurants-0.059
 hamb-0.059
 burger-0.055
aster-0.052
Token his
Token ID465
Feature activation+1.08e+00

Pos logit contributions
 how+0.012
arl+0.011
imi+0.011
 my+0.010
 me+0.010
Neg logit contributions
 distance-0.024
 hamb-0.023
 burger-0.023
dogs-0.022
Ice-0.022
Token desk
Token ID6915
Feature activation+2.65e+00

Pos logit contributions
 distance+0.057
 restaurants+0.055
 hamb+0.052
 burger+0.049
 cafĂ©+0.044
Neg logit contributions
arl-0.102
imi-0.091
icked-0.091
 tweet-0.089
uttered-0.089
Token and
Token ID290
Feature activation+3.71e-01

Pos logit contributions
 distance+0.235
 hamb+0.233
 restaurants+0.228
 burger+0.221
Ice+0.213
Neg logit contributions
arl-0.127
imi-0.120
 how-0.115
ig-0.114
ixie-0.105
Token looked
Token ID3114
Feature activation-7.58e-01

Pos logit contributions
 distance+0.029
 restaurants+0.027
 hamb+0.027
 burger+0.026
asts+0.025
Neg logit contributions
 how-0.021
imi-0.021
arl-0.021
icked-0.021
ig-0.020
Token around
Token ID1088
Feature activation-1.73e+00

Pos logit contributions
arl+0.039
imi+0.035
ig+0.032
 how+0.032
icked+0.031
Neg logit contributions
 distance-0.068
 restaurants-0.066
 hamb-0.065
 burger-0.063
handed-0.061
Token for
Token ID329
Feature activation+4.38e-01

Pos logit contributions
arl+0.081
 how+0.078
imi+0.076
ig+0.069
 wand+0.068
Neg logit contributions
 distance-0.152
 restaurants-0.150
 hamb-0.148
 burger-0.144
handed-0.143
Token an
Token ID281
Feature activation+1.94e+00

Pos logit contributions
 distance+0.037
 hamb+0.033
 burger+0.033
ding+0.033
Ice+0.033
Neg logit contributions
 how-0.025
 eyes-0.022
 bud-0.020
 eight-0.019
 luck-0.019
Token idea
Token ID2126
Feature activation-1.78e+00

Pos logit contributions
 distance+0.150
 hamb+0.141
 restaurants+0.140
 burger+0.135
handed+0.126
Neg logit contributions
arl-0.121
imi-0.121
ig-0.110
 how-0.109
anda-0.107
Token.
Token ID13
Feature activation+2.28e-01

Pos logit contributions
arl+0.092
 how+0.089
imi+0.086
icked+0.083
ig+0.081
Neg logit contributions
 distance-0.154
 restaurants-0.148
 hamb-0.146
 burger-0.141
aster-0.135
Token the
Token ID262
Feature activation+8.86e-02

Pos logit contributions
 distance+0.190
 restaurants+0.179
 hamb+0.177
 burger+0.170
handed+0.169
Neg logit contributions
arl-0.101
imi-0.099
 how-0.093
ig-0.085
ra-0.082
Token stove
Token ID27635
Feature activation+6.16e+00

Pos logit contributions
 distance+0.004
 restaurants+0.004
handed+0.004
 hamb+0.004
John+0.004
Neg logit contributions
arl-0.007
imi-0.007
 wand-0.007
ig-0.007
anda-0.007
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
 distance+0.508
 restaurants+0.487
 hamb+0.486
 burger+0.461
handed+0.444
Neg logit contributions
arl-0.353
imi-0.332
ig-0.298
 how-0.296
icked-0.293
Token They
Token ID1119
Feature activation-4.34e-01

Pos logit contributions
 distance+0.034
 restaurants+0.032
 hamb+0.032
 burger+0.030
handed+0.030
Neg logit contributions
arl-0.027
imi-0.025
icked-0.022
ixie-0.022
uttered-0.022
Token are
Token ID389
Feature activation-4.70e-01

Pos logit contributions
arl+0.034
imi+0.032
uttered+0.030
ixie+0.030
ig+0.029
Neg logit contributions
 distance-0.029
 restaurants-0.028
 hamb-0.028
 burger-0.026
aster-0.025
Token careful
Token ID8161
Feature activation-1.19e+00

Pos logit contributions
arl+0.029
imi+0.028
 how+0.025
icked+0.025
icks+0.024
Neg logit contributions
 distance-0.038
 restaurants-0.035
 hamb-0.035
handed-0.033
 burger-0.033
Token not
Token ID407
Feature activation-1.47e+00

Pos logit contributions
arl+0.065
imi+0.062
 how+0.054
ig+0.054
anda+0.053
Neg logit contributions
 distance-0.103
 hamb-0.099
 restaurants-0.099
 burger-0.094
handed-0.090
Token to
Token ID284
Feature activation-1.97e-01

Pos logit contributions
arl+0.089
imi+0.084
ig+0.076
ixie+0.076
uttered+0.074
Neg logit contributions
 distance-0.120
 hamb-0.115
 restaurants-0.115
 burger-0.110
aster-0.103
Token touch
Token ID3638
Feature activation+4.75e-01

Pos logit contributions
arl+0.013
imi+0.012
 how+0.011
 bud+0.011
 tweet+0.011
Neg logit contributions
 distance-0.015
 hamb-0.014
 restaurants-0.014
 burger-0.013
handed-0.013
Token the
Token ID262
Feature activation+6.47e-02

Pos logit contributions
 distance+0.039
 restaurants+0.037
 hamb+0.037
 burger+0.035
handed+0.034
Neg logit contributions
arl-0.028
imi-0.026
ig-0.024
 how-0.023
icked-0.023
Token hot
Token ID3024
Feature activation+4.12e+00

Pos logit contributions
 distance+0.004
 restaurants+0.004
handed+0.004
John+0.004
aster+0.004
Neg logit contributions
arl-0.004
imi-0.004
anda-0.004
 wand-0.004
 bud-0.004
Token again
Token ID757
Feature activation+6.50e-01

Pos logit contributions
arl+0.103
uttered+0.097
ra+0.096
ixie+0.094
anda+0.094
Neg logit contributions
 distance-0.081
 hamb-0.074
 restaurants-0.069
dream-0.069
 burger-0.068
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
 distance+0.052
 hamb+0.050
 restaurants+0.050
 burger+0.048
dogs+0.045
Neg logit contributions
arl-0.039
imi-0.038
ig-0.034
 how-0.033
 tweet-0.033
Token But
Token ID887
Feature activation+2.25e+00

Pos logit contributions
 distance+0.009
 restaurants+0.008
 hamb+0.008
 burger+0.007
Ice+0.007
Neg logit contributions
arl-0.011
ig-0.010
imi-0.009
anda-0.009
ixie-0.009
Token then
Token ID788
Feature activation-1.15e+00

Pos logit contributions
 distance+0.136
 hamb+0.123
 restaurants+0.122
aster+0.114
 burger+0.110
Neg logit contributions
arl-0.210
imi-0.175
anda-0.169
igg-0.168
ixie-0.167
Token,
Token ID11
Feature activation-4.47e-01

Pos logit contributions
arl+0.102
imi+0.079
igg+0.075
anda+0.074
ixie+0.073
Neg logit contributions
 restaurants-0.078
 hamb-0.075
 distance-0.074
 burger-0.069
aster-0.066
Token her
Token ID607
Feature activation-1.51e+00

Pos logit contributions
arl+0.031
imi+0.029
ig+0.026
ixie+0.026
icked+0.026
Neg logit contributions
 distance-0.033
 restaurants-0.031
 hamb-0.031
 burger-0.029
aster-0.028
Token mom
Token ID1995
Feature activation-2.30e-01

Pos logit contributions
arl+0.096
imi+0.090
icked+0.084
uttered+0.079
igg+0.079
Neg logit contributions
 distance-0.121
 hamb-0.112
 burger-0.109
 restaurants-0.108
asts-0.100
Tokenmy
Token ID1820
Feature activation+3.22e-01

Pos logit contributions
arl+0.017
imi+0.016
ig+0.015
icked+0.015
ixie+0.015
Neg logit contributions
 distance-0.016
 restaurants-0.015
 hamb-0.015
 burger-0.014
handed-0.013
Token came
Token ID1625
Feature activation+1.05e+00

Pos logit contributions
 distance+0.023
 restaurants+0.022
 hamb+0.021
 burger+0.020
handed+0.019
Neg logit contributions
arl-0.023
imi-0.022
icked-0.020
ig-0.020
ixie-0.020
Token outside
Token ID2354
Feature activation+2.04e+00

Pos logit contributions
 distance+0.085
 restaurants+0.080
 hamb+0.080
 burger+0.076
aster+0.072
Neg logit contributions
arl-0.066
imi-0.062
ig-0.055
ixie-0.055
icked-0.055
Token and
Token ID290
Feature activation+3.30e-01

Pos logit contributions
 distance+0.173
 restaurants+0.169
 hamb+0.168
 burger+0.161
handed+0.153
Neg logit contributions
arl-0.112
imi-0.104
 how-0.095
ig-0.090
 tweet-0.090
TokenOnce
Token ID7454
Feature activation-2.47e+00

Pos logit contributions
arl+0.102
imi+0.097
uttered+0.086
ig+0.086
icked+0.085
Neg logit contributions
 hamb-0.155
 distance-0.150
 restaurants-0.146
 burger-0.146
 french-0.141
Token upon
Token ID2402
Feature activation-3.14e+00

Pos logit contributions
arl+0.169
 tweet+0.154
ra+0.150
ps+0.148
imi+0.147
Neg logit contributions
 distance-0.177
 restaurants-0.161
 hamb-0.150
 burger-0.147
aster-0.143
Token a
Token ID257
Feature activation-1.70e-01

Pos logit contributions
 how+0.105
imi+0.088
arl+0.082
 tweet+0.074
 is+0.072
Neg logit contributions
 distance-0.328
 hamb-0.323
dogs-0.316
 restaurants-0.314
 burger-0.308
Token time
Token ID640
Feature activation+6.64e-01

Pos logit contributions
arl+0.013
imi+0.012
ig+0.011
ixie+0.011
anda+0.011
Neg logit contributions
 distance-0.011
 restaurants-0.011
 hamb-0.011
 burger-0.010
handed-0.009
Token,
Token ID11
Feature activation-4.61e-01

Pos logit contributions
 distance+0.043
 restaurants+0.043
 hamb+0.040
dream+0.037
 burger+0.037
Neg logit contributions
arl-0.048
imi-0.042
icked-0.041
ps-0.041
igg-0.041
Token there
Token ID612
Feature activation-1.35e+00

Pos logit contributions
arl+0.040
imi+0.037
ixie+0.036
 fears+0.034
igg+0.034
Neg logit contributions
 distance-0.025
 restaurants-0.022
 hamb-0.021
 burger-0.020
aster-0.020
Token was
Token ID373
Feature activation+1.42e-01

Pos logit contributions
arl+0.083
imi+0.078
ig+0.070
ixie+0.069
icked+0.069
Neg logit contributions
 distance-0.109
 restaurants-0.104
 hamb-0.104
 burger-0.099
aster-0.093
Token a
Token ID257
Feature activation-1.73e-01

Pos logit contributions
 distance+0.014
 restaurants+0.013
 hamb+0.013
 burger+0.013
line+0.012
Neg logit contributions
imi-0.006
arl-0.006
 how-0.005
igg-0.005
icked-0.005
Token girl
Token ID2576
Feature activation+6.90e-01

Pos logit contributions
arl+0.009
imi+0.008
ig+0.007
anda+0.007
ixie+0.007
Neg logit contributions
 distance-0.016
 restaurants-0.015
 hamb-0.015
 burger-0.014
handed-0.014
Token named
Token ID3706
Feature activation+3.63e+00

Pos logit contributions
 distance+0.047
 restaurants+0.046
 hamb+0.045
 burger+0.042
dogs+0.040
Neg logit contributions
imi-0.050
arl-0.049
ig-0.046
anda-0.045
 how-0.044
Token Lily
Token ID20037
Feature activation-1.12e+00

Pos logit contributions
 distance+0.258
 restaurants+0.248
 hamb+0.244
 burger+0.228
aster+0.221
Neg logit contributions
arl-0.252
imi-0.232
ig-0.228
igg-0.218
icked-0.216
Token happy
Token ID3772
Feature activation+6.64e-01

Pos logit contributions
 distance+0.052
 hamb+0.043
 restaurants+0.042
handed+0.039
 burger+0.038
Neg logit contributions
imi-0.079
arl-0.079
ig-0.073
ixie-0.070
icked-0.069
Token to
Token ID284
Feature activation-2.28e-01

Pos logit contributions
 distance+0.054
 hamb+0.051
 restaurants+0.051
 burger+0.049
handed+0.046
Neg logit contributions
arl-0.042
imi-0.038
ixie-0.034
ig-0.034
icked-0.034
Token escape
Token ID6654
Feature activation-1.50e+00

Pos logit contributions
arl+0.016
imi+0.016
ixie+0.014
ig+0.014
anda+0.014
Neg logit contributions
 distance-0.016
 restaurants-0.015
 hamb-0.015
 burger-0.014
aster-0.014
Token.
Token ID13
Feature activation-3.41e-02

Pos logit contributions
arl+0.092
imi+0.086
ig+0.078
ixie+0.077
icked+0.077
Neg logit contributions
 distance-0.120
 restaurants-0.115
 hamb-0.115
 burger-0.109
aster-0.104
Token It
Token ID632
Feature activation+1.99e-01

Pos logit contributions
arl+0.003
imi+0.003
ig+0.002
anda+0.002
icked+0.002
Neg logit contributions
 distance-0.002
 restaurants-0.002
 hamb-0.002
 burger-0.002
handed-0.002
Token fl
Token ID781
Feature activation-8.63e+00

Pos logit contributions
 distance+0.014
 hamb+0.011
 weighed+0.011
 restaurants+0.011
 burger+0.010
Neg logit contributions
arl-0.018
imi-0.016
uff-0.014
icks-0.014
ig-0.014
Tokenapped
Token ID6320
Feature activation-2.51e+00

Pos logit contributions
imi+0.449
 appearances+0.408
 thinks+0.392
 bud+0.386
 how+0.377
Neg logit contributions
 distance-0.681
line-0.632
dream-0.623
asts-0.609
handed-0.605
Token its
Token ID663
Feature activation-8.01e-01

Pos logit contributions
arl+0.186
imi+0.173
anda+0.163
ixie+0.162
igg+0.146
Neg logit contributions
 distance-0.187
 hamb-0.159
 burger-0.152
 restaurants-0.150
dream-0.147
Token wings
Token ID12098
Feature activation-6.62e+00

Pos logit contributions
arl+0.050
imi+0.050
ixie+0.037
uff+0.036
icked+0.036
Neg logit contributions
 distance-0.074
 hamb-0.060
ns-0.057
dream-0.057
 weighed-0.056
Token and
Token ID290
Feature activation+2.25e-01

Pos logit contributions
arl+0.454
imi+0.437
anda+0.383
ixie+0.379
icked+0.376
Neg logit contributions
 distance-0.515
 hamb-0.464
 restaurants-0.453
 burger-0.437
handed-0.412
Token buzz
Token ID14713
Feature activation-4.11e+00

Pos logit contributions
 distance+0.019
 hamb+0.018
 restaurants+0.018
 burger+0.017
aster+0.016
Neg logit contributions
arl-0.014
imi-0.013
anda-0.011
ig-0.011
ixie-0.011
Token to
Token ID284
Feature activation-2.86e-01

Pos logit contributions
arl+0.220
imi+0.209
ixie+0.193
anda+0.181
icked+0.180
Neg logit contributions
 distance-0.170
 restaurants-0.133
 hamb-0.130
 burger-0.128
 especially-0.128
Token each
Token ID1123
Feature activation-1.88e+00

Pos logit contributions
arl+0.011
imi+0.010
ig+0.009
 how+0.009
 tweet+0.009
Neg logit contributions
 distance-0.029
 restaurants-0.028
 hamb-0.028
 burger-0.027
handed-0.026
Token other
Token ID584
Feature activation+6.03e+00

Pos logit contributions
arl+0.143
imi+0.138
ig+0.128
 how+0.125
ixie+0.124
Neg logit contributions
 distance-0.118
 hamb-0.117
 restaurants-0.115
 burger-0.108
handed-0.101
Token and
Token ID290
Feature activation+4.55e-01

Pos logit contributions
 distance+0.461
 restaurants+0.435
 hamb+0.430
 burger+0.411
aster+0.386
Neg logit contributions
arl-0.403
imi-0.380
icked-0.343
ixie-0.341
uttered-0.339
Token see
Token ID766
Feature activation+4.19e+00

Pos logit contributions
 distance+0.034
 hamb+0.034
 restaurants+0.034
handed+0.032
 burger+0.032
Neg logit contributions
arl-0.027
imi-0.025
 how-0.025
 tweet-0.025
ig-0.024
Token how
Token ID703
Feature activation-1.07e+01

Pos logit contributions
 distance+0.377
 restaurants+0.368
 hamb+0.363
 burger+0.351
 burgers+0.331
Neg logit contributions
arl-0.220
imi-0.203
ixie-0.179
uttered-0.178
anda-0.174
Token far
Token ID1290
Feature activation-1.69e+00

Pos logit contributions
arl+0.723
imi+0.703
ig+0.627
ixie+0.616
icked+0.607
Neg logit contributions
 distance-0.812
 hamb-0.756
 restaurants-0.753
 burger-0.728
Ice-0.685
Token it
Token ID340
Feature activation+1.30e+00

Pos logit contributions
arl+0.117
imi+0.114
ixie+0.106
anda+0.102
uttered+0.102
Neg logit contributions
 distance-0.127
 restaurants-0.115
 hamb-0.113
 burger-0.113
aster-0.104
Token could
Token ID714
Feature activation-4.20e-01

Pos logit contributions
 distance+0.094
 restaurants+0.086
 hamb+0.085
 burger+0.080
 weighed+0.077
Neg logit contributions
arl-0.098
imi-0.090
ixie-0.082
ig-0.080
anda-0.079
Token go
Token ID467
Feature activation-5.54e-01

Pos logit contributions
arl+0.028
imi+0.026
ixie+0.024
ig+0.024
icked+0.023
Neg logit contributions
 distance-0.033
 restaurants-0.030
 hamb-0.030
 burger-0.029
aster-0.027
Token.
Token ID13
Feature activation+3.50e-01

Pos logit contributions
arl+0.037
imi+0.034
ig+0.031
ixie+0.031
anda+0.031
Neg logit contributions
 distance-0.042
 restaurants-0.040
 hamb-0.040
 burger-0.038
aster-0.036
Token grow
Token ID1663
Feature activation-5.97e+00

Pos logit contributions
arl+0.189
imi+0.184
uttered+0.169
ig+0.169
ixie+0.169
Neg logit contributions
 distance-0.234
 restaurants-0.217
 hamb-0.214
 burger-0.202
aster-0.196
Token up
Token ID510
Feature activation-4.86e-01

Pos logit contributions
arl+0.379
imi+0.360
ig+0.322
ixie+0.321
icked+0.317
Neg logit contributions
 distance-0.469
 restaurants-0.449
 hamb-0.446
 burger-0.424
asts-0.396
Token well
Token ID880
Feature activation-8.48e-01

Pos logit contributions
arl+0.027
imi+0.025
 how+0.023
ig+0.022
icked+0.022
Neg logit contributions
 distance-0.041
 restaurants-0.040
 hamb-0.039
 burger-0.037
handed-0.037
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
arl+0.053
imi+0.049
ig+0.045
icked+0.044
 how+0.044
Neg logit contributions
 distance-0.067
 restaurants-0.064
 hamb-0.064
 burger-0.060
aster-0.058
Token He
Token ID679
Feature activation+6.78e-01

Pos logit contributions
 distance+0.030
 hamb+0.028
 restaurants+0.028
 burger+0.027
aster+0.025
Neg logit contributions
arl-0.029
imi-0.027
ixie-0.024
icked-0.024
 how-0.024
Token thinks
Token ID6834
Feature activation-8.37e+00

Pos logit contributions
 hamb+0.056
 distance+0.056
 burger+0.054
 weighed+0.054
 restaurants+0.053
Neg logit contributions
arl-0.042
imi-0.036
uttered-0.032
uff-0.030
ixie-0.029
Token they
Token ID484
Feature activation+6.67e-01

Pos logit contributions
arl+0.562
imi+0.524
ig+0.477
ixie+0.475
anda+0.471
Neg logit contributions
 distance-0.627
 restaurants-0.604
 hamb-0.603
 burger-0.572
aster-0.542
Token are
Token ID389
Feature activation-4.57e-01

Pos logit contributions
 distance+0.064
 restaurants+0.062
 hamb+0.061
 burger+0.059
handed+0.056
Neg logit contributions
arl-0.032
imi-0.029
ixie-0.024
ig-0.024
uttered-0.024
Token smart
Token ID4451
Feature activation-6.44e-01

Pos logit contributions
arl+0.029
imi+0.027
ig+0.025
ixie+0.025
uttered+0.025
Neg logit contributions
 distance-0.036
 restaurants-0.035
 hamb-0.035
 burger-0.033
aster-0.031
Token.
Token ID13
Feature activation+4.03e-01

Pos logit contributions
arl+0.043
imi+0.041
ixie+0.037
ig+0.037
uttered+0.036
Neg logit contributions
 distance-0.048
 restaurants-0.046
 hamb-0.046
 burger-0.044
handed-0.041
Token He
Token ID679
Feature activation+6.69e-01

Pos logit contributions
 distance+0.029
 hamb+0.028
 restaurants+0.028
 burger+0.027
aster+0.025
Neg logit contributions
arl-0.029
imi-0.027
ixie-0.024
icked-0.024
 how-0.024
Token Sam
Token ID3409
Feature activation-1.57e+00

Pos logit contributions
arl+0.013
imi+0.011
uttered+0.011
icked+0.011
ixie+0.011
Neg logit contributions
 distance-0.016
 hamb-0.016
 restaurants-0.016
 burger-0.015
aster-0.015
Token."
Token ID526
Feature activation+2.13e+00

Pos logit contributions
arl+0.123
imi+0.113
uttered+0.110
ixie+0.107
icked+0.106
Neg logit contributions
 distance-0.101
 restaurants-0.095
 hamb-0.094
 burger-0.090
handed-0.082
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
 distance+0.173
 hamb+0.169
 restaurants+0.168
 burger+0.159
dogs+0.151
Neg logit contributions
arl-0.130
imi-0.126
 how-0.109
 tweet-0.107
icked-0.107
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
 distance+0.008
 restaurants+0.007
 hamb+0.007
 burger+0.007
aster+0.007
Neg logit contributions
arl-0.009
imi-0.009
ig-0.008
ixie-0.008
icked-0.008
TokenSam
Token ID16305
Feature activation+2.08e-01

Pos logit contributions
 distance+0.008
 restaurants+0.008
 hamb+0.008
 burger+0.007
aster+0.007
Neg logit contributions
arl-0.009
imi-0.008
ig-0.007
icked-0.007
ixie-0.007
Token thinks
Token ID6834
Feature activation-8.36e+00

Pos logit contributions
 distance+0.017
 restaurants+0.016
 hamb+0.016
 burger+0.016
 cafĂ©+0.015
Neg logit contributions
arl-0.014
imi-0.011
uttered-0.011
chie-0.011
icked-0.010
Token for
Token ID329
Feature activation+4.99e-01

Pos logit contributions
arl+0.567
imi+0.499
uttered+0.493
ra+0.492
 tweet+0.483
Neg logit contributions
 hamb-0.578
 distance-0.569
 restaurants-0.567
aster-0.554
 burger-0.549
Token a
Token ID257
Feature activation+7.66e-02

Pos logit contributions
 distance+0.041
 restaurants+0.038
 hamb+0.038
 burger+0.037
handed+0.035
Neg logit contributions
arl-0.030
imi-0.029
 how-0.026
icked-0.025
ixie-0.025
Token moment
Token ID2589
Feature activation-1.29e+00

Pos logit contributions
 distance+0.005
 restaurants+0.004
 hamb+0.004
 burger+0.004
handed+0.004
Neg logit contributions
arl-0.006
imi-0.006
ig-0.005
ixie-0.005
icked-0.005
Token.
Token ID13
Feature activation+3.84e-01

Pos logit contributions
arl+0.081
imi+0.076
ig+0.069
 how+0.068
icked+0.068
Neg logit contributions
 distance-0.102
 restaurants-0.099
 hamb-0.097
 burger-0.092
handed-0.088
Token He
Token ID679
Feature activation+6.00e-01

Pos logit contributions
 distance+0.019
 hamb+0.018
 restaurants+0.018
 burger+0.017
dogs+0.015
Neg logit contributions
arl-0.035
imi-0.034
ig-0.032
anda-0.032
ixie-0.032
Token."
Token ID526
Feature activation+2.01e+00

Pos logit contributions
arl+0.280
imi+0.250
ig+0.233
 how+0.230
icked+0.227
Neg logit contributions
 distance-0.384
 hamb-0.373
 restaurants-0.367
 burger-0.350
dogs-0.333
Token
Token ID198
Feature activation-6.36e-02

Pos logit contributions
 distance+0.154
 hamb+0.146
 restaurants+0.146
 burger+0.138
handed+0.130
Neg logit contributions
arl-0.135
imi-0.127
ixie-0.114
icked-0.114
ig-0.114
Token
Token ID198
Feature activation-5.42e-02

Pos logit contributions
arl+0.005
imi+0.005
ig+0.004
ixie+0.004
icked+0.004
Neg logit contributions
 distance-0.004
 restaurants-0.004
 hamb-0.004
 burger-0.004
aster-0.003
TokenS
Token ID50
Feature activation+7.51e-01

Pos logit contributions
arl+0.004
imi+0.004
ixie+0.003
ig+0.003
anda+0.003
Neg logit contributions
 distance-0.004
 restaurants-0.004
 hamb-0.004
 burger-0.004
asts-0.003
Tokenara
Token ID3301
Feature activation+5.82e-01

Pos logit contributions
 hamb+0.053
 restaurants+0.052
 distance+0.052
 burger+0.051
aster+0.049
Neg logit contributions
arl-0.048
imi-0.045
 tweet-0.045
 wand-0.044
icked-0.043
Token thinks
Token ID6834
Feature activation-8.39e+00

Pos logit contributions
 hamb+0.043
 restaurants+0.042
 burger+0.041
 weighed+0.041
 distance+0.040
Neg logit contributions
arl-0.042
imi-0.035
chie-0.031
uttered-0.031
 tweet-0.030
Token for
Token ID329
Feature activation+4.69e-01

Pos logit contributions
arl+0.574
imi+0.496
 tweet+0.487
ixie+0.482
anda+0.478
Neg logit contributions
 hamb-0.589
 restaurants-0.585
 distance-0.575
aster-0.565
 burger-0.561
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
 distance+0.042
 hamb+0.040
 restaurants+0.040
 burger+0.038
handed+0.037
Neg logit contributions
arl-0.022
 how-0.022
imi-0.022
icked-0.019
ixie-0.019
Token moment
Token ID2589
Feature activation-1.31e+00

Pos logit contributions
 distance+0.001
 restaurants+0.001
 hamb+0.001
 burger+0.001
handed+0.001
Neg logit contributions
arl-0.001
imi-0.001
ig-0.001
ixie-0.001
 tweet-0.001
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
arl+0.083
imi+0.079
ig+0.071
 how+0.071
icked+0.070
Neg logit contributions
 distance-0.102
 restaurants-0.099
 hamb-0.098
 burger-0.093
handed-0.089
Token She
Token ID1375
Feature activation-1.65e+00

Pos logit contributions
 distance+0.017
 restaurants+0.017
 hamb+0.016
 burger+0.016
Ice+0.015
Neg logit contributions
arl-0.016
imi-0.015
ig-0.015
anda-0.014
ixie-0.013