TOP ACTIVATIONS
MAX = 0.283

to eat and so he slur ped up the apples au
at Tom . He was slur ping his soup and making
worm with her eye and slur ped it up with a
with one bite , he slur ped it all up .
bled it all up , slur ping and sav oring every
he lun ged forward and slur ped it all up .
soon he heard a big slur p and a splash coming
to receive it . He slur ped it up with delight
into his cup . He slur ped it up and felt
it , and then you slur p it down your mouth
. She recognizes the bird c age . It belongs to
goodbye . They said " c aw , c aw ,
made a c aw - c aw sound . They tried
the door of the bird c age . It is locked
. She carried the bird c age to the window sill
He shows Mom the bird c age . "
bird in a black bird c age . The bird was
They look inside the bird c age . They see a
is sitting in the bird c age . That 's why
age . Inside the bird c age , there was a

MIN ACTIVATIONS
MIN = -0.322

to remain until another weary traveler needed it . <|endoftext|> Once
oak tree , their secret portal . They never told anyone
forgot the lion and his mercy . He always tried to
said the mirror was a portal to another world . He
very scared and begged for mercy . He said , '
. The light was a portal . It took her to
top had become a magic portal that took her straight to
your fault and ask for mercy . I will spare you
. She skipped into the portal and began her story .
her wand and a magical portal opened . The
! He stepped through the portal , and landed in the
He said he was a traveler and he liked to play
day , Max thanked the Wizard as he handed him his
. It is called The Wizard of Oz . Do you
him his match . The Wizard smiled and said , "
Tom cried and begged for mercy . They wished they had
" I 'm a traveler ," the man said .
Wow , you are a traveler ? Where did you come
Rabbit took Max to the Wizard who said , " I
" I am a traveler ," the person said .

INTERVAL 0.132 - 0.283
CONTAINS 3.436%

can say sorry to each other and give each other a
. They both went down to the river
and explore the woods . His mother was worried so before
Yes , Sam . It 's very pretty . Do you
be wealthy to buy your favorite ice cream ."

INTERVAL -0.019 - 0.132
CONTAINS 60.142%

can squeeze the car the most ". Bob liked the idea
to say sorry again . He wanted to be friends with
saw in person ! It was an incredible experience . <|endoftext|>
the hot dogs . Sarah was so excited , she couldn
them up into the air . Pat said , " Let

INTERVAL -0.171 - -0.019
CONTAINS 35.451%

cried . Mom came to see what happened .
They ran to the park with their k ite .
, a cat , a butterfly and a flower . They
cup board . The little mouse nervously replied , " I
was a girl called M illy . M illy was 3

INTERVAL -0.322 - -0.171
CONTAINS 0.971%

and all the other things that make up a landscape .
do not hear the door that opens downstairs . They do
there . She was happy that she could have some healthy
. She felt so relieved that she remembered and so happy
up , and he understood that it was better to stay
Token to
Token ID284
Feature activation-6.66e-02

Pos logit contributions
 traveler+0.102
 portal+0.100
 troll+0.098
 spies+0.096
fire+0.093
Neg logit contributions
ags-0.157
rell-0.144
opl-0.144
aces-0.143
c-0.138
Token eat
Token ID4483
Feature activation+3.74e-02

Pos logit contributions
 traveler+0.087
 spies+0.082
 setup+0.081
 troll+0.079
 detectives+0.079
Neg logit contributions
ags-0.152
opl-0.136
rell-0.136
icer-0.133
aters-0.131
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
ags+0.093
opl+0.085
rell+0.084
aters+0.083
icer+0.082
Neg logit contributions
 traveler-0.040
 spies-0.038
 setup-0.038
 troll-0.037
 detectives-0.036
Token so
Token ID523
Feature activation-7.51e-03

Pos logit contributions
ags+0.205
rell+0.190
opl+0.186
c+0.183
aces+0.176
Neg logit contributions
 traveler-0.156
 spies-0.151
 portal-0.149
 detectives-0.143
 troll-0.143
Token he
Token ID339
Feature activation-3.23e-02

Pos logit contributions
 traveler+0.009
 spies+0.008
 setup+0.008
 detectives+0.008
 portal+0.008
Neg logit contributions
ags-0.018
opl-0.016
rell-0.016
icer-0.016
c-0.016
Token slur
Token ID40066
Feature activation+2.81e-01

Pos logit contributions
 traveler+0.044
 spies+0.042
 setup+0.041
 portal+0.041
 detectives+0.041
Neg logit contributions
ags-0.071
rell-0.064
opl-0.063
icer-0.062
aters-0.061
Tokenped
Token ID9124
Feature activation+3.49e-02

Pos logit contributions
ags+0.363
icer+0.325
opl+0.317
aters+0.308
rell+0.293
Neg logit contributions
 traveler-0.601
 troll-0.580
 spies-0.577
 rescue-0.564
 criminal-0.564
Token up
Token ID510
Feature activation+1.35e-02

Pos logit contributions
ags+0.078
opl+0.070
rell+0.070
icer+0.068
aters+0.067
Neg logit contributions
 traveler-0.047
 spies-0.045
 setup-0.044
 troll-0.043
 detectives-0.043
Token the
Token ID262
Feature activation+5.07e-02

Pos logit contributions
ags+0.030
rell+0.027
opl+0.027
icer+0.026
aters+0.026
Neg logit contributions
 traveler-0.019
 spies-0.018
 setup-0.017
 portal-0.017
 detectives-0.017
Token apples
Token ID22514
Feature activation+8.50e-02

Pos logit contributions
ags+0.107
opl+0.093
aters+0.091
agged+0.090
ash+0.089
Neg logit contributions
 setup-0.075
 rescue-0.075
 traveler-0.074
 troll-0.072
 criminal-0.070
Tokenau
Token ID559
Feature activation-7.65e-02

Pos logit contributions
ags+0.198
opl+0.177
icer+0.174
rell+0.173
aters+0.170
Neg logit contributions
 traveler-0.109
 spies-0.103
 setup-0.103
 troll-0.100
 scientist-0.098
Token at
Token ID379
Feature activation+6.91e-02

Pos logit contributions
 traveler+0.044
 spies+0.043
 detectives+0.042
 setup+0.042
 troll+0.041
Neg logit contributions
ags-0.058
opl-0.055
rell-0.054
icer-0.052
aters-0.052
Token Tom
Token ID4186
Feature activation+5.75e-03

Pos logit contributions
ags+0.164
opl+0.148
rell+0.147
icer+0.144
aters+0.142
Neg logit contributions
 traveler-0.086
 spies-0.080
 setup-0.079
 detectives-0.077
 troll-0.077
Token.
Token ID13
Feature activation+7.21e-03

Pos logit contributions
ags+0.013
rell+0.012
icer+0.012
agged+0.012
opl+0.012
Neg logit contributions
 that-0.008
 setup-0.007
 rescue-0.007
 spies-0.007
 university-0.007
Token He
Token ID679
Feature activation-2.00e-03

Pos logit contributions
ags+0.013
opl+0.011
rell+0.011
aters+0.011
aces+0.011
Neg logit contributions
 traveler-0.013
 setup-0.013
 spies-0.012
 portal-0.012
 troll-0.012
Token was
Token ID373
Feature activation+1.38e-02

Pos logit contributions
 traveler+0.003
 spies+0.003
 detectives+0.003
 portal+0.003
 university+0.003
Neg logit contributions
ags-0.004
opl-0.004
rell-0.004
icer-0.004
c-0.004
Token slur
Token ID40066
Feature activation+2.81e-01

Pos logit contributions
ags+0.031
opl+0.028
rell+0.027
icer+0.027
aces+0.027
Neg logit contributions
 traveler-0.019
 spies-0.017
 portal-0.017
 setup-0.017
 troll-0.017
Tokenping
Token ID13886
Feature activation+8.46e-02

Pos logit contributions
icer+0.385
ags+0.380
opl+0.361
aters+0.359
c+0.337
Neg logit contributions
 troll-0.516
 spies-0.506
 traveler-0.496
 granted-0.486
 rescue-0.485
Token his
Token ID465
Feature activation+1.37e-01

Pos logit contributions
ags+0.198
opl+0.177
rell+0.176
icer+0.174
aters+0.171
Neg logit contributions
 traveler-0.105
 spies-0.101
 troll-0.098
 setup-0.098
 detectives-0.096
Token soup
Token ID17141
Feature activation+6.90e-02

Pos logit contributions
opl+0.287
rell+0.280
ags+0.279
aters+0.263
akes+0.251
Neg logit contributions
 troll-0.194
 traveler-0.193
 spies-0.192
 rescue-0.190
 clues-0.189
Token and
Token ID290
Feature activation+1.09e-01

Pos logit contributions
ags+0.170
opl+0.154
rell+0.153
icer+0.151
agged+0.147
Neg logit contributions
 traveler-0.076
 spies-0.073
 setup-0.072
fire-0.071
 troll-0.071
Token making
Token ID1642
Feature activation+7.19e-02

Pos logit contributions
ags+0.209
opl+0.188
rell+0.188
aces+0.181
c+0.180
Neg logit contributions
 traveler-0.179
 portal-0.167
 spies-0.165
 university-0.165
 criminal-0.162
Token worm
Token ID23849
Feature activation-2.88e-02

Pos logit contributions
ags+0.125
opl+0.115
rell+0.114
aters+0.111
icer+0.109
Neg logit contributions
 traveler-0.056
 troll-0.053
 spies-0.052
 setup-0.051
 detectives-0.049
Token with
Token ID351
Feature activation+2.70e-02

Pos logit contributions
 traveler+0.035
 that+0.034
 rescue+0.034
 spies+0.033
 troll+0.033
Neg logit contributions
ags-0.069
icer-0.062
opl-0.062
agged-0.060
rell-0.059
Token her
Token ID607
Feature activation+1.16e-01

Pos logit contributions
ags+0.057
opl+0.052
icer+0.052
rell+0.051
aters+0.050
Neg logit contributions
 traveler-0.039
 spies-0.038
 setup-0.037
 troll-0.036
 detectives-0.036
Token eye
Token ID4151
Feature activation-6.36e-02

Pos logit contributions
ags+0.270
rell+0.251
opl+0.247
agged+0.236
icer+0.232
Neg logit contributions
 rescue-0.146
 traveler-0.143
 setup-0.140
 that-0.140
 spies-0.138
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 traveler+0.072
 that+0.068
 setup+0.066
 spies+0.066
 troll+0.065
Neg logit contributions
ags-0.158
opl-0.143
icer-0.142
rell-0.139
akes-0.134
Token slur
Token ID40066
Feature activation+2.81e-01

Pos logit contributions
ags+0.194
rell+0.182
c+0.178
aces+0.174
opl+0.173
Neg logit contributions
 traveler-0.169
 university-0.163
 portal-0.163
 criminal-0.158
 spies-0.158
Tokenped
Token ID9124
Feature activation+3.39e-02

Pos logit contributions
ags+0.376
icer+0.366
opl+0.346
aters+0.326
rell+0.310
Neg logit contributions
 traveler-0.550
 troll-0.548
 granted-0.537
 spies-0.534
 that-0.528
Token it
Token ID340
Feature activation+5.34e-02

Pos logit contributions
ags+0.081
opl+0.073
rell+0.073
icer+0.071
aters+0.070
Neg logit contributions
 traveler-0.041
 spies-0.038
 setup-0.038
 troll-0.037
 detectives-0.037
Token up
Token ID510
Feature activation+1.32e-02

Pos logit contributions
ags+0.124
opl+0.113
icer+0.112
rell+0.108
aters+0.106
Neg logit contributions
 traveler-0.066
 spies-0.063
 setup-0.062
 portal-0.062
 rescue-0.061
Token with
Token ID351
Feature activation+2.72e-02

Pos logit contributions
ags+0.029
rell+0.026
opl+0.026
icer+0.025
aters+0.025
Neg logit contributions
 traveler-0.018
 spies-0.017
 setup-0.017
 detectives-0.017
 troll-0.017
Token a
Token ID257
Feature activation-1.49e-01

Pos logit contributions
ags+0.059
opl+0.053
rell+0.053
icer+0.052
aters+0.051
Neg logit contributions
 traveler-0.039
 spies-0.037
 setup-0.036
 troll-0.036
 detectives-0.036
Token with
Token ID351
Feature activation+2.59e-02

Pos logit contributions
ags+0.187
rell+0.173
c+0.173
opl+0.166
icer+0.163
Neg logit contributions
 traveler-0.155
 spies-0.153
 university-0.150
 portal-0.150
 detectives-0.146
Token one
Token ID530
Feature activation+2.12e-03

Pos logit contributions
ags+0.059
opl+0.052
rell+0.052
icer+0.050
c+0.050
Neg logit contributions
 traveler-0.034
 spies-0.033
 setup-0.032
 detectives-0.032
 scientist-0.032
Token bite
Token ID13197
Feature activation+5.53e-02

Pos logit contributions
ags+0.005
rell+0.004
opl+0.004
icer+0.004
aters+0.004
Neg logit contributions
 traveler-0.003
 spies-0.003
 setup-0.003
 troll-0.003
 portal-0.003
Token,
Token ID11
Feature activation-8.95e-02

Pos logit contributions
ags+0.136
opl+0.123
rell+0.123
icer+0.121
aters+0.119
Neg logit contributions
 traveler-0.062
 spies-0.058
 setup-0.057
 troll-0.056
 detectives-0.056
Token he
Token ID339
Feature activation-3.31e-02

Pos logit contributions
 traveler+0.102
 portal+0.097
 spies+0.095
 detectives+0.093
 setup+0.092
Neg logit contributions
ags-0.211
rell-0.193
opl-0.192
icer-0.188
aces-0.186
Token slur
Token ID40066
Feature activation+2.80e-01

Pos logit contributions
 traveler+0.046
 spies+0.045
 university+0.043
 portal+0.043
 detectives+0.043
Neg logit contributions
ags-0.071
rell-0.064
opl-0.063
icer-0.063
c-0.062
Tokenped
Token ID9124
Feature activation+3.21e-02

Pos logit contributions
ags+0.386
icer+0.371
opl+0.345
aters+0.339
rell+0.315
Neg logit contributions
 traveler-0.553
 troll-0.549
 granted-0.535
 rescue-0.523
 spies-0.520
Token it
Token ID340
Feature activation+5.22e-02

Pos logit contributions
ags+0.072
rell+0.064
opl+0.064
icer+0.062
aters+0.062
Neg logit contributions
 traveler-0.043
 spies-0.041
 setup-0.040
 troll-0.040
 detectives-0.040
Token all
Token ID477
Feature activation+7.50e-02

Pos logit contributions
ags+0.126
opl+0.113
icer+0.111
rell+0.109
agged+0.107
Neg logit contributions
 traveler-0.061
 setup-0.058
 spies-0.057
 rescue-0.057
 portal-0.057
Token up
Token ID510
Feature activation+1.16e-02

Pos logit contributions
ags+0.177
aters+0.166
icer+0.166
ittens+0.162
aces+0.160
Neg logit contributions
 that-0.100
 possible-0.083
 suddenly-0.080
 setup-0.080
 ranger-0.078
Token.
Token ID13
Feature activation-6.81e-03

Pos logit contributions
ags+0.025
rell+0.022
opl+0.022
icer+0.022
aters+0.021
Neg logit contributions
 traveler-0.017
 spies-0.016
 setup-0.015
 detectives-0.015
 portal-0.015
Tokenbled
Token ID9342
Feature activation-5.74e-02

Pos logit contributions
ags+0.223
opl+0.212
rell+0.212
aters+0.200
c+0.198
Neg logit contributions
 traveler-0.164
 spies-0.159
 detectives-0.154
 troll-0.154
 setup-0.154
Token it
Token ID340
Feature activation+5.50e-02

Pos logit contributions
 traveler+0.051
 setup+0.048
 that+0.048
 spies+0.048
 troll+0.047
Neg logit contributions
ags-0.153
opl-0.141
rell-0.139
aters-0.138
aces-0.134
Token all
Token ID477
Feature activation+7.50e-02

Pos logit contributions
ags+0.117
opl+0.109
rell+0.104
aters+0.103
icer+0.101
Neg logit contributions
 traveler-0.078
 setup-0.075
 that-0.074
 portal-0.072
 spies-0.072
Token up
Token ID510
Feature activation+1.14e-02

Pos logit contributions
ags+0.168
aters+0.160
ittens+0.157
aces+0.156
opl+0.153
Neg logit contributions
 that-0.103
 possible-0.091
 traveler-0.087
fire-0.085
 setup-0.085
Token,
Token ID11
Feature activation-8.41e-02

Pos logit contributions
ags+0.024
opl+0.022
rell+0.022
icer+0.021
aters+0.021
Neg logit contributions
 traveler-0.017
 spies-0.016
 setup-0.016
 troll-0.016
 detectives-0.015
Token slur
Token ID40066
Feature activation+2.80e-01

Pos logit contributions
 traveler+0.107
 setup+0.105
 spies+0.101
 troll+0.099
 detectives+0.097
Neg logit contributions
ags-0.199
rell-0.179
opl-0.177
aters-0.174
icer-0.174
Tokenping
Token ID13886
Feature activation+8.15e-02

Pos logit contributions
ags+0.372
opl+0.331
icer+0.331
aters+0.323
rell+0.312
Neg logit contributions
 troll-0.579
 traveler-0.579
 granted-0.555
fire-0.549
 Wizard-0.548
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
ags+0.184
opl+0.165
rell+0.163
icer+0.161
aters+0.158
Neg logit contributions
 traveler-0.109
 spies-0.104
 setup-0.102
 troll-0.101
 detectives-0.100
Token sav
Token ID6799
Feature activation+2.06e-02

Pos logit contributions
ags+0.213
rell+0.191
opl+0.190
icer+0.184
aters+0.181
Neg logit contributions
 traveler-0.163
 spies-0.155
 portal-0.151
 university-0.151
 detectives-0.150
Tokenoring
Token ID3255
Feature activation-2.65e-03

Pos logit contributions
ags+0.034
opl+0.030
rell+0.030
aters+0.028
icer+0.028
Neg logit contributions
 traveler-0.039
 setup-0.038
 spies-0.037
 troll-0.037
 detectives-0.037
Token every
Token ID790
Feature activation+8.59e-02

Pos logit contributions
 traveler+0.003
 spies+0.003
 setup+0.003
 detectives+0.003
 troll+0.003
Neg logit contributions
ags-0.006
opl-0.006
rell-0.006
icer-0.005
aters-0.005
Token he
Token ID339
Feature activation-3.37e-02

Pos logit contributions
 traveler+0.239
 spies+0.227
 detectives+0.220
 portal+0.219
 setup+0.215
Neg logit contributions
ags-0.400
opl-0.386
c-0.381
rell-0.376
icer-0.367
Token lun
Token ID14678
Feature activation-7.99e-02

Pos logit contributions
 traveler+0.046
 spies+0.045
 setup+0.043
 university+0.043
 portal+0.043
Neg logit contributions
ags-0.073
rell-0.066
opl-0.065
icer-0.064
aces-0.063
Tokenged
Token ID2004
Feature activation-5.27e-03

Pos logit contributions
 traveler+0.128
 setup+0.125
 spies+0.123
 troll+0.120
 portal+0.118
Neg logit contributions
ags-0.157
opl-0.146
rell-0.138
icer-0.135
aters-0.134
Token forward
Token ID2651
Feature activation+3.65e-02

Pos logit contributions
 spies+0.007
 traveler+0.007
 setup+0.007
 criminal+0.006
 clues+0.006
Neg logit contributions
ags-0.012
opl-0.011
rell-0.011
icer-0.011
aters-0.010
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
ags+0.070
rell+0.063
aces+0.061
opl+0.060
c+0.059
Neg logit contributions
 traveler-0.059
 setup-0.057
 portal-0.056
 spies-0.055
 detectives-0.055
Token slur
Token ID40066
Feature activation+2.80e-01

Pos logit contributions
ags+0.195
rell+0.176
opl+0.174
c+0.174
ag+0.163
Neg logit contributions
 traveler-0.152
 spies-0.145
 portal-0.143
 scientist-0.142
 troll-0.141
Tokenped
Token ID9124
Feature activation+3.14e-02

Pos logit contributions
ags+0.345
icer+0.304
opl+0.304
aters+0.287
rell+0.286
Neg logit contributions
 traveler-0.614
 spies-0.606
 troll-0.600
 setup-0.584
 detectives-0.584
Token it
Token ID340
Feature activation+5.27e-02

Pos logit contributions
ags+0.070
rell+0.062
opl+0.062
icer+0.061
aters+0.060
Neg logit contributions
 traveler-0.043
 spies-0.040
 setup-0.040
 portal-0.039
 troll-0.039
Token all
Token ID477
Feature activation+7.41e-02

Pos logit contributions
ags+0.119
opl+0.106
icer+0.105
rell+0.103
aters+0.102
Neg logit contributions
 traveler-0.069
 spies-0.067
 setup-0.066
 portal-0.065
 detectives-0.064
Token up
Token ID510
Feature activation+1.26e-02

Pos logit contributions
ags+0.169
icer+0.161
aters+0.159
ittens+0.152
aces+0.151
Neg logit contributions
 that-0.097
 possible-0.086
 setup-0.085
 ranger-0.084
 unexpected-0.081
Token.
Token ID13
Feature activation-5.89e-03

Pos logit contributions
ags+0.028
rell+0.024
opl+0.024
icer+0.024
aters+0.023
Neg logit contributions
 traveler-0.018
 spies-0.017
 setup-0.016
 detectives-0.016
 portal-0.016
Token soon
Token ID2582
Feature activation-9.53e-02

Pos logit contributions
ags+0.063
opl+0.057
rell+0.057
icer+0.057
c+0.056
Neg logit contributions
 traveler-0.029
 spies-0.027
 portal-0.027
 detectives-0.026
 troll-0.026
Token he
Token ID339
Feature activation-3.65e-02

Pos logit contributions
 portal+0.151
 scientist+0.149
 reporter+0.146
 traveler+0.145
 professor+0.144
Neg logit contributions
aces-0.145
rell-0.142
c-0.138
icer-0.134
ags-0.132
Token heard
Token ID2982
Feature activation-1.02e-01

Pos logit contributions
 traveler+0.051
 university+0.051
 spies+0.050
 detectives+0.049
 portal+0.049
Neg logit contributions
ags-0.071
c-0.068
icer-0.067
opl-0.067
rell-0.066
Token a
Token ID257
Feature activation-1.53e-01

Pos logit contributions
 traveler+0.124
 setup+0.115
 detectives+0.114
 spies+0.113
 scientist+0.111
Neg logit contributions
ags-0.233
opl-0.215
rell-0.215
c-0.210
aces-0.205
Token big
Token ID1263
Feature activation-5.53e-02

Pos logit contributions
 spies+0.221
 detectives+0.216
 traveler+0.212
 setup+0.193
 explorers+0.193
Neg logit contributions
ags-0.297
rell-0.278
opl-0.275
c-0.271
icer-0.267
Token slur
Token ID40066
Feature activation+2.78e-01

Pos logit contributions
 traveler+0.083
 spies+0.081
 detectives+0.081
eware+0.080
 setup+0.077
Neg logit contributions
ags-0.110
opl-0.100
rell-0.099
icer-0.095
c-0.093
Tokenp
Token ID79
Feature activation+2.80e-02

Pos logit contributions
ags+0.401
icer+0.363
aters+0.349
opl+0.328
rell+0.314
Neg logit contributions
 traveler-0.570
 troll-0.562
fire-0.554
 granted-0.540
 portal-0.536
Token and
Token ID290
Feature activation+1.01e-01

Pos logit contributions
ags+0.065
icer+0.057
rell+0.057
opl+0.057
aters+0.056
Neg logit contributions
 traveler-0.035
 troll-0.033
 spies-0.033
 setup-0.033
 portal-0.033
Token a
Token ID257
Feature activation-1.53e-01

Pos logit contributions
ags+0.167
rell+0.165
c+0.164
allows+0.163
opl+0.157
Neg logit contributions
 traveler-0.153
 criminal-0.152
 university-0.146
 portal-0.146
 spies-0.146
Token splash
Token ID22870
Feature activation+3.93e-02

Pos logit contributions
 spies+0.223
 detectives+0.220
 traveler+0.218
 criminal+0.199
 Wizard+0.199
Neg logit contributions
ags-0.287
rell-0.273
c-0.269
opl-0.269
icer-0.266
Token coming
Token ID2406
Feature activation-4.43e-02

Pos logit contributions
ags+0.086
rell+0.077
opl+0.077
icer+0.075
aters+0.074
Neg logit contributions
 traveler-0.055
 spies-0.052
 setup-0.051
 detectives-0.050
 portal-0.050
Token to
Token ID284
Feature activation-6.87e-02

Pos logit contributions
ags+0.326
opl+0.293
rell+0.292
icer+0.286
aters+0.280
Neg logit contributions
 traveler-0.180
 spies-0.170
 setup-0.167
 troll-0.164
 portal-0.164
Token receive
Token ID3328
Feature activation-1.44e-01

Pos logit contributions
 traveler+0.097
 spies+0.093
 setup+0.091
 detectives+0.090
 portal+0.090
Neg logit contributions
ags-0.150
opl-0.133
rell-0.133
icer-0.130
aters-0.127
Token it
Token ID340
Feature activation+5.16e-02

Pos logit contributions
 traveler+0.162
 spies+0.151
 setup+0.147
 portal+0.145
 troll+0.145
Neg logit contributions
ags-0.356
opl-0.321
rell-0.320
icer-0.315
aters-0.310
Token.
Token ID13
Feature activation-6.60e-03

Pos logit contributions
ags+0.118
opl+0.105
rell+0.105
icer+0.103
aters+0.101
Neg logit contributions
 traveler-0.068
 spies-0.064
 setup-0.063
 troll-0.062
 detectives-0.062
Token He
Token ID679
Feature activation-5.29e-03

Pos logit contributions
 traveler+0.009
 spies+0.009
 setup+0.009
 detectives+0.009
 troll+0.009
Neg logit contributions
ags-0.014
opl-0.013
rell-0.013
icer-0.013
aters-0.012
Token slur
Token ID40066
Feature activation+2.78e-01

Pos logit contributions
 traveler+0.007
 spies+0.007
 setup+0.007
 portal+0.007
 detectives+0.007
Neg logit contributions
ags-0.011
rell-0.010
opl-0.010
icer-0.010
aters-0.010
Tokenped
Token ID9124
Feature activation+3.23e-02

Pos logit contributions
ags+0.370
icer+0.359
opl+0.344
aters+0.321
c+0.312
Neg logit contributions
 troll-0.533
 traveler-0.525
 spies-0.519
 rescue-0.518
 scientist-0.514
Token it
Token ID340
Feature activation+5.22e-02

Pos logit contributions
ags+0.072
rell+0.065
opl+0.065
icer+0.063
aters+0.062
Neg logit contributions
 traveler-0.043
 spies-0.041
 setup-0.040
 portal-0.040
 troll-0.040
Token up
Token ID510
Feature activation+1.15e-02

Pos logit contributions
ags+0.115
opl+0.102
icer+0.099
rell+0.098
aters+0.096
Neg logit contributions
 traveler-0.073
 setup-0.070
 spies-0.069
 portal-0.068
 scientist-0.067
Token with
Token ID351
Feature activation+2.72e-02

Pos logit contributions
ags+0.022
rell+0.019
opl+0.019
icer+0.019
aters+0.018
Neg logit contributions
 traveler-0.019
 spies-0.019
 detectives-0.018
 portal-0.018
 troll-0.018
Token delight
Token ID10974
Feature activation-1.11e-02

Pos logit contributions
ags+0.061
opl+0.054
rell+0.054
icer+0.053
aters+0.052
Neg logit contributions
 traveler-0.037
 spies-0.035
 setup-0.035
 troll-0.034
 detectives-0.034
Token into
Token ID656
Feature activation-4.27e-02

Pos logit contributions
ags+0.269
opl+0.247
rell+0.238
icer+0.237
agged+0.232
Neg logit contributions
 traveler-0.140
 spies-0.137
 setup-0.134
 detectives-0.129
 scientist-0.128
Token his
Token ID465
Feature activation+1.33e-01

Pos logit contributions
 traveler+0.060
 portal+0.055
 setup+0.054
 spies+0.054
 detectives+0.053
Neg logit contributions
ags-0.094
icer-0.085
rell-0.083
opl-0.083
aters-0.081
Token cup
Token ID6508
Feature activation+1.54e-01

Pos logit contributions
ags+0.263
opl+0.252
rell+0.245
agged+0.232
aces+0.232
Neg logit contributions
 rescue-0.207
 traveler-0.196
 spies-0.187
 ranger-0.186
 troll-0.186
Token.
Token ID13
Feature activation+1.24e-03

Pos logit contributions
ags+0.381
opl+0.346
icer+0.343
rell+0.336
aters+0.326
Neg logit contributions
 traveler-0.160
 spies-0.155
 ranger-0.150
 troll-0.150
 scientist-0.149
Token He
Token ID679
Feature activation-5.85e-03

Pos logit contributions
ags+0.002
opl+0.002
rell+0.002
icer+0.002
aters+0.002
Neg logit contributions
 traveler-0.002
 spies-0.002
 troll-0.002
 university-0.002
 detectives-0.002
Token slur
Token ID40066
Feature activation+2.78e-01

Pos logit contributions
 traveler+0.007
 setup+0.007
 spies+0.007
 troll+0.007
 detectives+0.007
Neg logit contributions
ags-0.014
opl-0.013
rell-0.012
icer-0.012
aters-0.012
Tokenped
Token ID9124
Feature activation+2.85e-02

Pos logit contributions
ags+0.389
icer+0.375
opl+0.363
aters+0.350
c+0.324
Neg logit contributions
 troll-0.508
 traveler-0.506
 spies-0.495
 rescue-0.493
 granted-0.492
Token it
Token ID340
Feature activation+5.03e-02

Pos logit contributions
ags+0.073
opl+0.067
rell+0.065
icer+0.065
aters+0.064
Neg logit contributions
 traveler-0.029
 spies-0.029
 setup-0.028
 detectives-0.027
 troll-0.027
Token up
Token ID510
Feature activation+9.85e-03

Pos logit contributions
ags+0.122
opl+0.110
icer+0.108
agged+0.107
aters+0.103
Neg logit contributions
 that-0.057
 rescue-0.055
 setup-0.055
 traveler-0.054
 spies-0.052
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
ags+0.019
opl+0.017
rell+0.017
icer+0.016
aters+0.016
Neg logit contributions
 traveler-0.016
 spies-0.016
 setup-0.015
 troll-0.015
 detectives-0.015
Token felt
Token ID2936
Feature activation-4.53e-02

Pos logit contributions
ags+0.210
rell+0.188
opl+0.184
c+0.179
aters+0.178
Neg logit contributions
 traveler-0.167
 spies-0.158
 portal-0.155
 university-0.154
 troll-0.154
Token it
Token ID340
Feature activation+4.44e-02

Pos logit contributions
ags+0.179
rell+0.163
opl+0.161
agged+0.156
aters+0.155
Neg logit contributions
 traveler-0.094
 spies-0.092
 university-0.090
 troll-0.089
 setup-0.089
Token,
Token ID11
Feature activation-9.09e-02

Pos logit contributions
ags+0.105
opl+0.095
icer+0.094
rell+0.093
agged+0.091
Neg logit contributions
 traveler-0.051
 spies-0.049
 university-0.048
 setup-0.048
 portal-0.047
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
 spies+0.108
 traveler+0.106
 setup+0.105
 detectives+0.102
 troll+0.099
Neg logit contributions
ags-0.215
opl-0.196
rell-0.195
icer-0.191
agged-0.187
Token then
Token ID788
Feature activation-1.36e-01

Pos logit contributions
ags+0.244
rell+0.220
opl+0.219
icer+0.215
c+0.214
Neg logit contributions
 traveler-0.130
 spies-0.120
 portal-0.119
 troll-0.117
 setup-0.116
Token you
Token ID345
Feature activation-2.41e-02

Pos logit contributions
 traveler+0.181
 portal+0.170
 spies+0.167
 criminal+0.164
 scientist+0.163
Neg logit contributions
ags-0.292
rell-0.270
opl-0.265
c-0.263
aces-0.263
Token slur
Token ID40066
Feature activation+2.74e-01

Pos logit contributions
 traveler+0.037
 spies+0.035
 setup+0.034
 detectives+0.034
 troll+0.034
Neg logit contributions
ags-0.050
rell-0.044
opl-0.044
icer-0.043
aters-0.042
Tokenp
Token ID79
Feature activation+2.70e-02

Pos logit contributions
ags+0.395
icer+0.373
opl+0.364
aters+0.350
rell+0.340
Neg logit contributions
 troll-0.476
 spies-0.472
 granted-0.469
 that-0.462
 someone-0.457
Token it
Token ID340
Feature activation+4.61e-02

Pos logit contributions
ags+0.065
opl+0.059
icer+0.056
rell+0.056
agged+0.056
Neg logit contributions
 that-0.031
 granted-0.027
 setup-0.027
fire-0.027
 troll-0.026
Token down
Token ID866
Feature activation+2.51e-02

Pos logit contributions
ags+0.106
opl+0.095
agged+0.093
icer+0.091
rell+0.090
Neg logit contributions
 setup-0.055
 traveler-0.054
 spies-0.054
 that-0.053
 rescue-0.052
Token your
Token ID534
Feature activation+6.08e-02

Pos logit contributions
ags+0.055
opl+0.049
rell+0.049
icer+0.048
aters+0.048
Neg logit contributions
 traveler-0.034
 spies-0.033
 setup-0.032
 troll-0.032
 detectives-0.032
Token mouth
Token ID5422
Feature activation+1.34e-01

Pos logit contributions
ags+0.103
opl+0.096
rell+0.093
icer+0.088
agged+0.087
Neg logit contributions
 rescue-0.106
 spies-0.106
 troll-0.105
 setup-0.104
 traveler-0.104
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
 traveler+0.054
 portal+0.050
 setup+0.049
 troll+0.049
 spies+0.049
Neg logit contributions
ags-0.067
rell-0.057
opl-0.056
aters-0.056
icer-0.055
Token She
Token ID1375
Feature activation-1.55e-02

Pos logit contributions
ags+0.037
opl+0.032
rell+0.032
icer+0.031
aters+0.031
Neg logit contributions
 traveler-0.031
 setup-0.030
 spies-0.029
 portal-0.029
 troll-0.029
Token recognizes
Token ID21570
Feature activation-6.78e-02

Pos logit contributions
 traveler+0.028
 spies+0.027
 setup+0.027
 detectives+0.026
 criminal+0.026
Neg logit contributions
ags-0.027
opl-0.024
rell-0.023
icer-0.023
aters-0.022
Token the
Token ID262
Feature activation+5.87e-02

Pos logit contributions
 traveler+0.080
 spies+0.073
 setup+0.072
 portal+0.072
 detectives+0.071
Neg logit contributions
ags-0.166
opl-0.150
rell-0.149
icer-0.148
aters-0.144
Token bird
Token ID6512
Feature activation-1.79e-02

Pos logit contributions
ags+0.129
opl+0.116
rell+0.115
icer+0.114
c+0.111
Neg logit contributions
 traveler-0.080
 spies-0.076
 setup-0.075
 detectives-0.075
 portal-0.074
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 traveler+0.041
 that+0.041
 troll+0.041
 granted+0.040
 spies+0.040
Neg logit contributions
ags-0.025
agged-0.018
rell-0.018
icer-0.018
opl-0.017
Tokenage
Token ID496
Feature activation-3.84e-02

Pos logit contributions
ags+0.491
icer+0.481
ash+0.445
wy+0.438
aters+0.422
Neg logit contributions
 rescue-0.394
 troll-0.387
fire-0.384
 that-0.382
 guide-0.381
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
 traveler+0.055
 spies+0.052
 setup+0.051
 detectives+0.051
 troll+0.051
Neg logit contributions
ags-0.083
opl-0.074
rell-0.074
icer-0.071
aters-0.071
Token It
Token ID632
Feature activation+3.92e-02

Pos logit contributions
ags+0.037
opl+0.032
rell+0.032
icer+0.031
aters+0.031
Neg logit contributions
 traveler-0.032
 setup-0.031
 spies-0.030
 portal-0.030
 troll-0.030
Token belongs
Token ID14448
Feature activation+8.83e-03

Pos logit contributions
ags+0.089
rell+0.079
opl+0.078
icer+0.077
aters+0.076
Neg logit contributions
 traveler-0.053
 spies-0.050
 setup-0.050
 troll-0.050
 portal-0.049
Token to
Token ID284
Feature activation-5.83e-02

Pos logit contributions
ags+0.019
aces+0.017
c+0.017
aters+0.017
icer+0.016
Neg logit contributions
 traveler-0.012
fire-0.012
 criminal-0.011
 university-0.011
 scientist-0.011
Token goodbye
Token ID24829
Feature activation+1.57e-02

Pos logit contributions
 traveler+0.060
 scientist+0.057
 troll+0.057
 setup+0.057
 portal+0.056
Neg logit contributions
ags-0.069
rell-0.057
opl-0.056
aters-0.056
aces-0.056
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
ags+0.036
rell+0.032
aces+0.031
c+0.031
opl+0.031
Neg logit contributions
 traveler-0.020
 troll-0.019
 setup-0.019
 scientist-0.018
 spies-0.018
Token They
Token ID1119
Feature activation-2.14e-02

Pos logit contributions
ags+0.026
aters+0.020
rell+0.020
agged+0.020
aces+0.019
Neg logit contributions
 traveler-0.032
 setup-0.031
 troll-0.031
 criminal-0.030
 scientist-0.029
Token said
Token ID531
Feature activation+4.57e-02

Pos logit contributions
 traveler+0.031
 spies+0.030
 university+0.029
 scientist+0.029
 portal+0.029
Neg logit contributions
ags-0.044
rell-0.040
opl-0.039
icer-0.038
c-0.038
Token "
Token ID366
Feature activation-1.66e-02

Pos logit contributions
ags+0.083
rell+0.072
aces+0.071
opl+0.069
agged+0.068
Neg logit contributions
 traveler-0.076
 scientist-0.073
 criminal-0.073
 troll-0.072
 portal-0.071
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 traveler+0.018
 spies+0.016
 setup+0.016
 portal+0.016
 detectives+0.015
Neg logit contributions
ags-0.043
opl-0.038
rell-0.038
icer-0.038
aters-0.037
Tokenaw
Token ID707
Feature activation-5.54e-02

Pos logit contributions
ags+0.526
icer+0.479
aters+0.452
opl+0.436
ash+0.433
Neg logit contributions
 traveler-0.427
 troll-0.420
 portal-0.407
 that-0.401
 spies-0.399
Token,
Token ID11
Feature activation-7.31e-02

Pos logit contributions
 traveler+0.073
 spies+0.069
 detectives+0.067
 university+0.067
 portal+0.067
Neg logit contributions
ags-0.125
rell-0.115
opl-0.114
icer-0.111
c-0.110
Token c
Token ID269
Feature activation+9.11e-02

Pos logit contributions
 traveler+0.081
 setup+0.074
 spies+0.073
 portal+0.072
 criminal+0.072
Neg logit contributions
ags-0.178
opl-0.163
rell-0.161
icer-0.159
c-0.156
Tokenaw
Token ID707
Feature activation-5.47e-02

Pos logit contributions
ags+0.139
rell+0.122
opl+0.119
icer+0.117
aces+0.114
Neg logit contributions
 traveler-0.184
 portal-0.176
 spies-0.175
 setup-0.173
 troll-0.171
Token,
Token ID11
Feature activation-7.32e-02

Pos logit contributions
 traveler+0.066
 spies+0.061
 detectives+0.060
 setup+0.060
 portal+0.060
Neg logit contributions
ags-0.130
opl-0.119
rell-0.119
icer-0.116
aters-0.114
Token made
Token ID925
Feature activation+8.13e-03

Pos logit contributions
ags+0.083
rell+0.075
opl+0.074
c+0.074
icer+0.073
Neg logit contributions
 traveler-0.056
 spies-0.054
 university-0.052
 detectives-0.052
 setup-0.052
Token a
Token ID257
Feature activation-1.43e-01

Pos logit contributions
ags+0.021
rell+0.019
opl+0.019
icer+0.018
c+0.018
Neg logit contributions
 traveler-0.008
 setup-0.007
 scientist-0.007
 portal-0.007
 criminal-0.007
Token c
Token ID269
Feature activation+9.06e-02

Pos logit contributions
 traveler+0.195
 spies+0.191
 detectives+0.188
 criminal+0.183
 Wizard+0.181
Neg logit contributions
ags-0.303
opl-0.271
icer-0.269
rell-0.266
c-0.263
Tokenaw
Token ID707
Feature activation-5.71e-02

Pos logit contributions
ags+0.148
rell+0.133
opl+0.126
icer+0.126
aters+0.126
Neg logit contributions
 traveler-0.174
 portal-0.166
 troll-0.165
 spies-0.164
 university-0.160
Token-
Token ID12
Feature activation-4.68e-02

Pos logit contributions
 traveler+0.084
 spies+0.080
 detectives+0.079
 setup+0.079
 scientist+0.079
Neg logit contributions
ags-0.119
opl-0.107
rell-0.106
icer-0.102
c-0.102
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 traveler+0.078
 troll+0.072
fire+0.071
 portal+0.071
 spies+0.070
Neg logit contributions
ags-0.095
opl-0.081
aters-0.081
icer-0.081
rell-0.081
Tokenaw
Token ID707
Feature activation-5.66e-02

Pos logit contributions
ags+0.484
icer+0.462
aters+0.423
wy+0.410
rell+0.410
Neg logit contributions
 traveler-0.450
 portal-0.437
 troll-0.435
fire-0.433
 that-0.433
Token sound
Token ID2128
Feature activation-8.57e-02

Pos logit contributions
 traveler+0.085
 spies+0.081
 setup+0.080
 detectives+0.079
 troll+0.079
Neg logit contributions
ags-0.118
opl-0.105
rell-0.105
icer-0.102
aters-0.100
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
 traveler+0.134
 spies+0.127
 setup+0.126
 detectives+0.124
 troll+0.124
Neg logit contributions
ags-0.175
opl-0.154
rell-0.154
icer-0.149
aters-0.147
Token They
Token ID1119
Feature activation-2.18e-02

Pos logit contributions
ags+0.019
rell+0.016
opl+0.015
icer+0.015
aters+0.015
Neg logit contributions
 traveler-0.022
 setup-0.021
 troll-0.021
 portal-0.020
 spies-0.020
Token tried
Token ID3088
Feature activation+2.32e-03

Pos logit contributions
 traveler+0.031
 spies+0.030
 criminal+0.030
 scientist+0.030
 portal+0.030
Neg logit contributions
ags-0.045
rell-0.040
c-0.039
aces-0.039
opl-0.039
Token the
Token ID262
Feature activation+5.54e-02

Pos logit contributions
 traveler+0.047
 spies+0.046
 setup+0.044
 university+0.043
 detectives+0.043
Neg logit contributions
ags-0.140
opl-0.130
rell-0.129
icer-0.125
aters-0.124
Token door
Token ID3420
Feature activation-1.26e-01

Pos logit contributions
ags+0.127
opl+0.113
rell+0.113
icer+0.111
aters+0.109
Neg logit contributions
 traveler-0.073
 spies-0.069
 setup-0.068
 troll-0.067
 portal-0.067
Token of
Token ID286
Feature activation-2.26e-02

Pos logit contributions
 traveler+0.181
 spies+0.171
 setup+0.167
 troll+0.167
 detectives+0.166
Neg logit contributions
ags-0.273
rell-0.242
opl-0.240
icer-0.235
aters-0.234
Token the
Token ID262
Feature activation+5.58e-02

Pos logit contributions
 traveler+0.027
 spies+0.025
 setup+0.024
 portal+0.024
 detectives+0.024
Neg logit contributions
ags-0.055
opl-0.049
rell-0.049
icer-0.048
aters-0.048
Token bird
Token ID6512
Feature activation-1.83e-02

Pos logit contributions
ags+0.123
rell+0.109
opl+0.109
icer+0.107
aters+0.105
Neg logit contributions
 traveler-0.079
 spies-0.075
 setup-0.074
 portal-0.073
 troll-0.073
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 that+0.028
 rescue+0.028
 ranger+0.027
 traveler+0.027
fire+0.027
Neg logit contributions
ags-0.040
agged-0.035
affles-0.033
opl-0.033
icer-0.033
Tokenage
Token ID496
Feature activation-3.83e-02

Pos logit contributions
icer+0.497
wy+0.497
ags+0.485
affles+0.470
ash+0.449
Neg logit contributions
 ranger-0.392
 rescue-0.384
 portal-0.368
 keeper-0.367
 troll-0.366
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
 traveler+0.047
 spies+0.045
 setup+0.044
 portal+0.043
 university+0.043
Neg logit contributions
ags-0.090
opl-0.082
rell-0.081
icer-0.080
aters-0.078
Token It
Token ID632
Feature activation+3.87e-02

Pos logit contributions
ags+0.032
rell+0.027
opl+0.026
aces+0.026
c+0.026
Neg logit contributions
 traveler-0.031
 setup-0.029
 criminal-0.028
 troll-0.028
 spies-0.028
Token is
Token ID318
Feature activation+1.85e-02

Pos logit contributions
ags+0.085
opl+0.073
rell+0.073
agged+0.072
aters+0.072
Neg logit contributions
 traveler-0.057
 portal-0.055
 setup-0.054
 troll-0.054
 spies-0.053
Token locked
Token ID8970
Feature activation-1.37e-01

Pos logit contributions
ags+0.042
opl+0.038
rell+0.038
icer+0.037
aters+0.036
Neg logit contributions
 traveler-0.024
 spies-0.023
 setup-0.023
 troll-0.022
 detectives-0.022
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
ags+0.099
rell+0.088
opl+0.086
aces+0.084
aters+0.084
Neg logit contributions
 traveler-0.063
 spies-0.059
 setup-0.058
 troll-0.058
 portal-0.058
Token She
Token ID1375
Feature activation-1.82e-02

Pos logit contributions
ags+0.023
rell+0.020
opl+0.019
aters+0.019
aces+0.019
Neg logit contributions
 traveler-0.024
 setup-0.023
 portal-0.022
 troll-0.022
 criminal-0.022
Token carried
Token ID5281
Feature activation-2.56e-02

Pos logit contributions
 traveler+0.027
 spies+0.026
 university+0.026
 troll+0.026
 detectives+0.026
Neg logit contributions
ags-0.037
rell-0.033
icer-0.032
opl-0.032
aces-0.032
Token the
Token ID262
Feature activation+5.33e-02

Pos logit contributions
 traveler+0.032
 troll+0.030
 spies+0.029
 portal+0.029
 setup+0.029
Neg logit contributions
ags-0.061
rell-0.054
opl-0.053
icer-0.053
aters-0.052
Token bird
Token ID6512
Feature activation-2.02e-02

Pos logit contributions
ags+0.120
opl+0.107
rell+0.107
aters+0.103
icer+0.103
Neg logit contributions
 traveler-0.074
 spies-0.071
 setup-0.070
 university-0.068
 troll-0.068
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 traveler+0.026
 rescue+0.024
 that+0.024
fire+0.023
 explorers+0.023
Neg logit contributions
ags-0.049
opl-0.046
rell-0.044
agged-0.043
affles-0.043
Tokenage
Token ID496
Feature activation-4.05e-02

Pos logit contributions
ags+0.498
icer+0.494
wy+0.472
affles+0.464
opl+0.450
Neg logit contributions
 rescue-0.388
 that-0.384
fire-0.382
 traveler-0.378
 ranger-0.377
Token to
Token ID284
Feature activation-6.08e-02

Pos logit contributions
 traveler+0.055
 spies+0.053
 portal+0.052
 setup+0.051
 university+0.050
Neg logit contributions
ags-0.089
opl-0.085
icer-0.083
rell-0.082
aters-0.079
Token the
Token ID262
Feature activation+5.33e-02

Pos logit contributions
eware+0.098
 traveler+0.097
 portal+0.090
 criminal+0.090
 troll+0.090
Neg logit contributions
ags-0.123
opl-0.100
rell-0.098
icer-0.094
c-0.094
Token window
Token ID4324
Feature activation+1.54e-02

Pos logit contributions
ags+0.098
icer+0.085
opl+0.085
c+0.083
rell+0.081
Neg logit contributions
 traveler-0.090
 spies-0.085
 detectives-0.084
 setup-0.084
 portal-0.084
Token sill
Token ID49276
Feature activation+3.03e-02

Pos logit contributions
ags+0.032
rell+0.029
opl+0.029
icer+0.028
aters+0.027
Neg logit contributions
 traveler-0.023
 spies-0.022
 setup-0.022
 portal-0.021
 detectives-0.021
Token He
Token ID679
Feature activation+1.60e-03

Pos logit contributions
ags+0.032
rell+0.026
icer+0.025
aters+0.025
agged+0.025
Neg logit contributions
 traveler-0.036
 portal-0.035
 setup-0.035
 reality-0.033
 clues-0.033
Token shows
Token ID2523
Feature activation+2.44e-02

Pos logit contributions
ags+0.003
rell+0.003
opl+0.003
icer+0.003
aters+0.003
Neg logit contributions
 traveler-0.003
 spies-0.003
 setup-0.003
 detectives-0.003
 troll-0.003
Token Mom
Token ID11254
Feature activation+5.03e-02

Pos logit contributions
ags+0.048
rell+0.042
opl+0.041
icer+0.040
aters+0.040
Neg logit contributions
 traveler-0.041
 spies-0.039
 setup-0.039
 portal-0.038
 troll-0.038
Token the
Token ID262
Feature activation+5.71e-02

Pos logit contributions
ags+0.131
opl+0.120
rell+0.117
icer+0.115
akes+0.115
Neg logit contributions
 traveler-0.050
 spies-0.049
 setup-0.048
 detectives-0.046
 security-0.046
Token bird
Token ID6512
Feature activation-1.86e-02

Pos logit contributions
ags+0.125
opl+0.111
rell+0.111
icer+0.110
c+0.107
Neg logit contributions
 traveler-0.079
 spies-0.074
 setup-0.073
 detectives-0.072
 portal-0.072
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 traveler+0.021
 troll+0.021
 spies+0.020
 that+0.020
 ranger+0.020
Neg logit contributions
ags-0.047
opl-0.041
agged-0.041
rell-0.041
icer-0.041
Tokenage
Token ID496
Feature activation-3.91e-02

Pos logit contributions
ags+0.496
icer+0.478
wy+0.439
ash+0.427
aters+0.423
Neg logit contributions
 troll-0.416
 that-0.416
fire-0.416
 rescue-0.411
 traveler-0.410
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
 traveler+0.045
 spies+0.043
 setup+0.042
 troll+0.041
 portal+0.041
Neg logit contributions
ags-0.095
opl-0.087
icer-0.087
rell-0.086
aters-0.083
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
ags+0.042
opl+0.038
rell+0.038
icer+0.037
aters+0.036
Neg logit contributions
 traveler-0.026
 spies-0.025
 setup-0.024
 troll-0.024
 detectives-0.024
Token
Token ID198
Feature activation+1.50e-02

Pos logit contributions
ags+0.036
rell+0.033
opl+0.033
aces+0.032
icer+0.032
Neg logit contributions
 traveler-0.017
 spies-0.017
 university-0.016
 portal-0.016
 setup-0.016
Token"
Token ID1
Feature activation-3.07e-02

Pos logit contributions
ags+0.033
icer+0.029
rell+0.029
opl+0.029
aters+0.028
Neg logit contributions
 traveler-0.021
 spies-0.019
 troll-0.019
 setup-0.019
 portal-0.019
Token bird
Token ID6512
Feature activation-2.13e-02

Pos logit contributions
 setup+0.095
 traveler+0.093
 portal+0.092
 spies+0.090
Where+0.090
Neg logit contributions
ags-0.146
icer-0.134
rell-0.131
c-0.130
aters-0.128
Token in
Token ID287
Feature activation+3.18e-02

Pos logit contributions
 traveler+0.021
 spies+0.019
 troll+0.019
 portal+0.019
 university+0.018
Neg logit contributions
ags-0.056
rell-0.051
opl-0.050
icer-0.049
aters-0.049
Token a
Token ID257
Feature activation-1.43e-01

Pos logit contributions
ags+0.089
rell+0.082
opl+0.082
icer+0.080
aters+0.079
Neg logit contributions
 traveler-0.025
 spies-0.023
 setup-0.022
 troll-0.022
 detectives-0.022
Token black
Token ID2042
Feature activation+8.72e-02

Pos logit contributions
 traveler+0.183
 spies+0.175
 setup+0.174
 detectives+0.168
 troll+0.166
Neg logit contributions
ags-0.326
opl-0.295
rell-0.291
icer-0.290
c-0.288
Token bird
Token ID6512
Feature activation-2.13e-02

Pos logit contributions
ags+0.216
rell+0.194
opl+0.193
aces+0.191
icer+0.189
Neg logit contributions
 traveler-0.097
 troll-0.089
 portal-0.089
 university-0.089
 spies-0.087
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 that+0.022
 traveler+0.020
 portal+0.018
 suddenly+0.018
 granted+0.018
Neg logit contributions
ags-0.055
rell-0.050
icer-0.049
opl-0.048
aters-0.048
Tokenage
Token ID496
Feature activation-4.07e-02

Pos logit contributions
ags+0.462
icer+0.426
rell+0.397
ash+0.390
aters+0.387
Neg logit contributions
 traveler-0.462
 troll-0.444
 portal-0.440
 that-0.436
 rescue-0.432
Token.
Token ID13
Feature activation+3.59e-03

Pos logit contributions
 traveler+0.049
 spies+0.046
 setup+0.044
 portal+0.044
 troll+0.044
Neg logit contributions
ags-0.097
rell-0.087
opl-0.087
icer-0.086
aters-0.085
Token The
Token ID383
Feature activation+8.20e-02

Pos logit contributions
ags+0.008
opl+0.007
ash+0.007
icer+0.007
aters+0.007
Neg logit contributions
 university-0.005
 traveler-0.004
 troll-0.004
 spies-0.004
 portal-0.004
Token bird
Token ID6512
Feature activation-2.06e-02

Pos logit contributions
ags+0.222
rell+0.202
opl+0.201
aters+0.198
icer+0.196
Neg logit contributions
 traveler-0.078
 spies-0.070
 setup-0.070
 troll-0.070
 portal-0.068
Token was
Token ID373
Feature activation+1.50e-02

Pos logit contributions
 that+0.020
 granted+0.018
 suddenly+0.018
 saw+0.017
 traveler+0.016
Neg logit contributions
ags-0.061
rell-0.051
aters-0.050
opl-0.050
icer-0.049
TokenThey
Token ID2990
Feature activation-6.84e-02

Pos logit contributions
ags+0.025
icer+0.022
opl+0.022
rell+0.022
aters+0.022
Neg logit contributions
 traveler-0.017
 troll-0.016
 setup-0.016
 portal-0.016
 spies-0.015
Token look
Token ID804
Feature activation-7.86e-02

Pos logit contributions
 traveler+0.093
 spies+0.088
 setup+0.087
 portal+0.085
 troll+0.085
Neg logit contributions
ags-0.153
opl-0.137
rell-0.137
icer-0.134
aters-0.132
Token inside
Token ID2641
Feature activation-1.18e-01

Pos logit contributions
 traveler+0.123
 spies+0.123
 detectives+0.118
 setup+0.116
 university+0.115
Neg logit contributions
ags-0.156
opl-0.146
rell-0.142
icer-0.141
aters-0.137
Token the
Token ID262
Feature activation+5.17e-02

Pos logit contributions
 traveler+0.116
 spies+0.114
 setup+0.114
 university+0.108
 detectives+0.106
Neg logit contributions
ags-0.303
rell-0.283
opl-0.282
icer-0.273
aces-0.272
Token bird
Token ID6512
Feature activation-2.08e-02

Pos logit contributions
ags+0.101
opl+0.089
rell+0.088
icer+0.087
aters+0.084
Neg logit contributions
 traveler-0.085
 spies-0.081
 setup-0.080
 detectives-0.080
 troll-0.079
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 that+0.036
 portal+0.033
 rescue+0.033
 traveler+0.033
fire+0.033
Neg logit contributions
ags-0.043
agged-0.037
rell-0.035
affles-0.034
aces-0.034
Tokenage
Token ID496
Feature activation-4.05e-02

Pos logit contributions
icer+0.481
ags+0.479
wy+0.478
affles+0.458
ash+0.446
Neg logit contributions
 rescue-0.381
 ranger-0.375
 that-0.375
fire-0.374
 portal-0.368
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
 traveler+0.050
 spies+0.049
 setup+0.048
 portal+0.048
 university+0.047
Neg logit contributions
ags-0.094
rell-0.085
opl-0.085
icer-0.084
aters-0.081
Token They
Token ID1119
Feature activation-2.03e-02

Pos logit contributions
ags+0.019
aters+0.015
rell+0.015
icer+0.014
opl+0.014
Neg logit contributions
 traveler-0.028
 setup-0.027
 troll-0.026
 criminal-0.026
 portal-0.025
Token see
Token ID766
Feature activation-1.54e-01

Pos logit contributions
 traveler+0.020
 spies+0.018
 setup+0.018
 troll+0.018
 detectives+0.017
Neg logit contributions
ags-0.053
opl-0.048
rell-0.048
icer-0.047
aters-0.047
Token a
Token ID257
Feature activation-1.45e-01

Pos logit contributions
 traveler+0.182
 scientist+0.165
 setup+0.165
 criminal+0.164
 portal+0.163
Neg logit contributions
ags-0.371
opl-0.338
rell-0.331
c-0.327
icer-0.323
Token is
Token ID318
Feature activation+1.39e-02

Pos logit contributions
 traveler+0.031
 portal+0.029
 troll+0.029
 spies+0.029
 setup+0.029
Neg logit contributions
ags-0.046
rell-0.040
icer-0.039
opl-0.039
aters-0.039
Token sitting
Token ID5586
Feature activation+7.46e-02

Pos logit contributions
ags+0.029
opl+0.027
c+0.027
rell+0.026
icer+0.024
Neg logit contributions
 traveler-0.020
 criminal-0.019
 setup-0.018
 clues-0.018
 spies-0.018
Token in
Token ID287
Feature activation+2.84e-02

Pos logit contributions
ags+0.157
rell+0.145
opl+0.143
aters+0.139
icer+0.136
Neg logit contributions
 traveler-0.108
 spies-0.105
 detectives-0.101
 setup-0.101
 troll-0.100
Token the
Token ID262
Feature activation+5.26e-02

Pos logit contributions
ags+0.069
opl+0.062
rell+0.062
icer+0.061
aters+0.060
Neg logit contributions
 traveler-0.034
 spies-0.032
 setup-0.031
 detectives-0.031
 troll-0.030
Token bird
Token ID6512
Feature activation-2.09e-02

Pos logit contributions
ags+0.111
aces+0.096
rell+0.096
aters+0.096
agged+0.095
Neg logit contributions
 traveler-0.079
 portal-0.077
 university-0.075
 rescue-0.074
 troll-0.074
Tokenc
Token ID66
Feature activation+2.72e-01

Pos logit contributions
 that+0.052
 granted+0.051
 traveler+0.051
 portal+0.051
 rescue+0.050
Neg logit contributions
ags-0.024
agged-0.016
icer-0.015
affles-0.014
rell-0.014
Tokenage
Token ID496
Feature activation-4.13e-02

Pos logit contributions
icer+0.484
ags+0.474
wy+0.460
affles+0.439
ash+0.427
Neg logit contributions
 portal-0.372
 rescue-0.364
 troll-0.360
 that-0.352
 traveler-0.350
Token.
Token ID13
Feature activation+1.37e-03

Pos logit contributions
 traveler+0.052
 spies+0.049
 setup+0.049
 portal+0.048
 detectives+0.048
Neg logit contributions
ags-0.096
opl-0.086
rell-0.085
icer-0.084
aters-0.083
Token That
Token ID1320
Feature activation-7.10e-03

Pos logit contributions
ags+0.003
rell+0.003
opl+0.003
aters+0.003
icer+0.003
Neg logit contributions
 traveler-0.002
 spies-0.002
 setup-0.002
 troll-0.002
 portal-0.002
Token's
Token ID338
Feature activation+1.77e-01

Pos logit contributions
 traveler+0.011
 spies+0.010
 setup+0.010
 detectives+0.010
 troll+0.010
Neg logit contributions
ags-0.015
opl-0.013
rell-0.013
icer-0.013
aters-0.013
Token why
Token ID1521
Feature activation-1.35e-01

Pos logit contributions
ags+0.406
opl+0.364
rell+0.355
c+0.355
icer+0.345
Neg logit contributions
 traveler-0.232
 spies-0.217
 scientist-0.216
 detectives-0.213
 troll-0.213
Tokenage
Token ID496
Feature activation-4.06e-02

Pos logit contributions
ags+0.445
icer+0.440
wy+0.391
aters+0.390
ash+0.386
Neg logit contributions
 traveler-0.461
 troll-0.459
 portal-0.452
 spies-0.450
 that-0.437
Token.
Token ID13
Feature activation+4.93e-03

Pos logit contributions
 traveler+0.060
 spies+0.057
 setup+0.057
 portal+0.056
 troll+0.056
Neg logit contributions
ags-0.086
rell-0.076
opl-0.076
icer-0.074
aters-0.073
Token Inside
Token ID14384
Feature activation-9.71e-02

Pos logit contributions
ags+0.010
opl+0.009
rell+0.009
icer+0.009
aters+0.008
Neg logit contributions
 traveler-0.008
 spies-0.007
 troll-0.007
 setup-0.007
 detectives-0.007
Token the
Token ID262
Feature activation+5.86e-02

Pos logit contributions
 traveler+0.070
 university+0.068
 that+0.067
 setup+0.064
 troll+0.062
Neg logit contributions
ags-0.280
aters-0.250
opl-0.249
rell-0.249
aces-0.248
Token bird
Token ID6512
Feature activation-1.85e-02

Pos logit contributions
ags+0.131
opl+0.117
rell+0.117
icer+0.114
aters+0.112
Neg logit contributions
 traveler-0.081
 spies-0.077
 setup-0.075
 portal-0.074
 troll-0.074
Tokenc
Token ID66
Feature activation+2.71e-01

Pos logit contributions
 portal+0.042
 that+0.041
 traveler+0.041
 troll+0.041
 university+0.041
Neg logit contributions
ags-0.026
rell-0.018
agged-0.018
aters-0.018
icer-0.018
Tokenage
Token ID496
Feature activation-3.99e-02

Pos logit contributions
ags+0.453
icer+0.449
wy+0.437
ash+0.398
ksh+0.397
Neg logit contributions
 troll-0.429
 portal-0.425
 university-0.416
 rescue-0.416
 traveler-0.412
Token,
Token ID11
Feature activation-7.55e-02

Pos logit contributions
 traveler+0.029
 spies+0.028
 portal+0.027
 university+0.027
 troll+0.027
Neg logit contributions
ags-0.112
opl-0.103
icer-0.103
rell-0.102
aters-0.101
Token there
Token ID612
Feature activation-4.23e-02

Pos logit contributions
 traveler+0.075
 criminal+0.068
 setup+0.067
 portal+0.067
 scientist+0.067
Neg logit contributions
ags-0.193
icer-0.173
c-0.173
aces-0.172
rell-0.172
Token was
Token ID373
Feature activation+1.80e-02

Pos logit contributions
 traveler+0.045
 spies+0.043
 troll+0.043
 setup+0.041
 university+0.041
Neg logit contributions
ags-0.109
opl-0.097
rell-0.095
icer-0.094
aters-0.094
Token a
Token ID257
Feature activation-1.41e-01

Pos logit contributions
ags+0.045
c+0.041
rell+0.041
icer+0.040
aces+0.040
Neg logit contributions
 traveler-0.020
 scientist-0.018
 criminal-0.017
 portal-0.017
 setup-0.017
Token to
Token ID284
Feature activation-5.90e-02

Pos logit contributions
ags+0.120
rell+0.103
opl+0.103
aces+0.101
aters+0.100
Neg logit contributions
 traveler-0.099
 spies-0.097
 detectives-0.096
 portal-0.095
 scientist-0.095
Token remain
Token ID3520
Feature activation-2.78e-03

Pos logit contributions
 traveler+0.078
 spies+0.077
 detectives+0.074
 troll+0.073
 portal+0.073
Neg logit contributions
ags-0.132
opl-0.117
rell-0.115
c-0.111
aces-0.109
Token until
Token ID1566
Feature activation+6.07e-02

Pos logit contributions
 traveler+0.004
 spies+0.004
 setup+0.004
 detectives+0.004
 portal+0.004
Neg logit contributions
ags-0.006
rell-0.005
opl-0.005
icer-0.005
aters-0.005
Token another
Token ID1194
Feature activation-1.37e-01

Pos logit contributions
ags+0.137
opl+0.122
rell+0.121
icer+0.118
c+0.118
Neg logit contributions
 traveler-0.078
 spies-0.075
 troll-0.073
 portal-0.073
 detectives-0.073
Token weary
Token ID34730
Feature activation-1.01e-01

Pos logit contributions
 spies+0.280
 detectives+0.276
 traveler+0.274
 portal+0.266
 setup+0.266
Neg logit contributions
ags-0.192
opl-0.173
rell-0.167
c-0.163
aters-0.161
Token traveler
Token ID40168
Feature activation-3.22e-01

Pos logit contributions
 spies+0.156
 detectives+0.155
 traveler+0.146
 portal+0.146
 setup+0.143
Neg logit contributions
ags-0.190
opl-0.176
rell-0.175
aters-0.170
c-0.167
Token needed
Token ID2622
Feature activation-8.93e-02

Pos logit contributions
 traveler+0.449
 spies+0.442
 detectives+0.430
 setup+0.427
 portal+0.423
Neg logit contributions
ags-0.674
opl-0.608
rell-0.604
c-0.588
aces-0.585
Token it
Token ID340
Feature activation+4.66e-02

Pos logit contributions
 traveler+0.111
 spies+0.105
 troll+0.103
 setup+0.102
 detectives+0.102
Neg logit contributions
ags-0.207
rell-0.184
opl-0.183
aces-0.177
aters-0.177
Token.
Token ID13
Feature activation-7.77e-03

Pos logit contributions
ags+0.112
rell+0.101
opl+0.101
icer+0.098
aters+0.097
Neg logit contributions
 traveler-0.055
 spies-0.051
 setup-0.050
 troll-0.050
 detectives-0.050
Token<|endoftext|>
Token ID50256
Feature activation+1.02e-01

Pos logit contributions
 traveler+0.012
 spies+0.012
 setup+0.012
 troll+0.011
 detectives+0.011
Neg logit contributions
ags-0.016
rell-0.014
opl-0.014
icer-0.013
aters-0.013
TokenOnce
Token ID7454
Feature activation+5.93e-02

Pos logit contributions
ags+0.239
opl+0.213
rell+0.213
aters+0.206
icer+0.205
Neg logit contributions
 traveler-0.124
 spies-0.119
 portal-0.116
 university-0.116
 troll-0.115
Token oak
Token ID26210
Feature activation-3.96e-02

Pos logit contributions
ags+0.091
icer+0.086
opl+0.081
c+0.078
aters+0.071
Neg logit contributions
eware-0.078
 detectives-0.075
 traveler-0.075
 criminal-0.074
 spies-0.074
Token tree
Token ID5509
Feature activation+1.21e-02

Pos logit contributions
 traveler+0.061
 spies+0.059
 setup+0.057
 detectives+0.057
 troll+0.056
Neg logit contributions
ags-0.080
opl-0.072
rell-0.072
icer-0.070
c-0.068
Token,
Token ID11
Feature activation-8.27e-02

Pos logit contributions
ags+0.023
opl+0.020
rell+0.020
icer+0.020
c+0.019
Neg logit contributions
 traveler-0.020
 spies-0.019
 setup-0.019
 detectives-0.019
 troll-0.019
Token their
Token ID511
Feature activation+6.86e-02

Pos logit contributions
 traveler+0.094
 spies+0.091
 setup+0.089
 detectives+0.085
 university+0.084
Neg logit contributions
ags-0.208
rell-0.188
opl-0.187
icer-0.185
aters-0.181
Token secret
Token ID3200
Feature activation-6.27e-02

Pos logit contributions
ags+0.143
opl+0.126
icer+0.124
rell+0.122
aters+0.121
Neg logit contributions
 traveler-0.099
 spies-0.094
 detectives-0.093
 troll-0.092
 scientist-0.092
Token portal
Token ID17898
Feature activation-3.20e-01

Pos logit contributions
 traveler+0.092
 spies+0.091
 detectives+0.090
 setup+0.089
 criminal+0.089
Neg logit contributions
ags-0.122
aces-0.111
opl-0.109
c-0.109
aters-0.107
Token.
Token ID13
Feature activation+6.20e-03

Pos logit contributions
 traveler+0.433
 spies+0.423
 detectives+0.421
Who+0.411
 scientist+0.410
Neg logit contributions
ags-0.683
rell-0.612
c-0.612
aces-0.609
opl-0.602
Token They
Token ID1119
Feature activation-2.36e-02

Pos logit contributions
ags+0.012
opl+0.011
rell+0.011
icer+0.010
aters+0.010
Neg logit contributions
 traveler-0.010
 setup-0.010
 troll-0.010
 spies-0.010
 detectives-0.009
Token never
Token ID1239
Feature activation-6.80e-02

Pos logit contributions
 traveler+0.034
 spies+0.033
 detectives+0.032
 scientist+0.032
 criminal+0.032
Neg logit contributions
ags-0.048
rell-0.044
opl-0.043
c-0.042
icer-0.042
Token told
Token ID1297
Feature activation+3.74e-02

Pos logit contributions
 scientist+0.111
 traveler+0.107
 spies+0.106
 university+0.103
 detectives+0.102
Neg logit contributions
c-0.121
rell-0.118
ags-0.110
my-0.107
 pads-0.107
Token anyone
Token ID2687
Feature activation-1.11e-01

Pos logit contributions
ags+0.080
opl+0.069
rell+0.069
aters+0.065
aces+0.065
Neg logit contributions
 traveler-0.056
 scientist-0.052
 troll-0.051
 portal-0.051
 detectives-0.051
Token forgot
Token ID16453
Feature activation-1.03e-01

Pos logit contributions
 traveler+0.119
 scientist+0.117
fire+0.114
 spies+0.114
 university+0.112
Neg logit contributions
c-0.113
rell-0.109
my-0.105
ags-0.105
 pads-0.102
Token the
Token ID262
Feature activation+4.99e-02

Pos logit contributions
 traveler+0.139
 portal+0.128
 setup+0.128
 spies+0.127
 troll+0.126
Neg logit contributions
ags-0.231
opl-0.206
rell-0.205
aces-0.197
icer-0.196
Token lion
Token ID18744
Feature activation-7.96e-02

Pos logit contributions
ags+0.107
opl+0.096
rell+0.095
icer+0.093
c+0.090
Neg logit contributions
 traveler-0.067
 spies-0.065
 detectives-0.064
 portal-0.062
 setup-0.062
Token and
Token ID290
Feature activation+1.08e-01

Pos logit contributions
 traveler+0.114
 spies+0.108
 setup+0.108
 portal+0.106
 detectives+0.106
Neg logit contributions
ags-0.165
opl-0.150
rell-0.149
c-0.145
aces-0.144
Token his
Token ID465
Feature activation+1.36e-01

Pos logit contributions
ags+0.199
opl+0.182
aces+0.178
c+0.177
rell+0.175
Neg logit contributions
 traveler-0.164
 portal-0.158
eware-0.155
 spies-0.154
 setup-0.153
Token mercy
Token ID17703
Feature activation-3.20e-01

Pos logit contributions
ags+0.295
rell+0.263
opl+0.263
icer+0.257
aters+0.253
Neg logit contributions
 traveler-0.193
 spies-0.183
 setup-0.180
 troll-0.177
 detectives-0.177
Token.
Token ID13
Feature activation+7.53e-03

Pos logit contributions
 traveler+0.466
 spies+0.452
 setup+0.448
 detectives+0.446
 portal+0.444
Neg logit contributions
ags-0.645
opl-0.587
rell-0.580
c-0.576
icer-0.563
Token He
Token ID679
Feature activation-2.03e-03

Pos logit contributions
ags+0.014
opl+0.012
rell+0.011
icer+0.011
aters+0.011
Neg logit contributions
 traveler-0.014
 setup-0.013
 portal-0.013
 spies-0.013
 detectives-0.013
Token always
Token ID1464
Feature activation+2.43e-02

Pos logit contributions
 traveler+0.003
 spies+0.003
 setup+0.003
 detectives+0.003
 portal+0.003
Neg logit contributions
ags-0.005
opl-0.004
rell-0.004
icer-0.004
aters-0.004
Token tried
Token ID3088
Feature activation-2.37e-03

Pos logit contributions
ags+0.048
rell+0.043
c+0.043
opl+0.042
aces+0.041
Neg logit contributions
 traveler-0.036
 university-0.036
 spies-0.036
 criminal-0.035
 scientist-0.035
Token to
Token ID284
Feature activation-6.79e-02

Pos logit contributions
 traveler+0.003
 portal+0.003
 spies+0.003
 setup+0.003
 troll+0.003
Neg logit contributions
ags-0.006
opl-0.005
rell-0.005
icer-0.005
c-0.005
Token said
Token ID531
Feature activation+4.50e-02

Pos logit contributions
 traveler+0.003
 spies+0.003
 university+0.003
 portal+0.003
 criminal+0.003
Neg logit contributions
ags-0.004
c-0.004
icer-0.004
aces-0.004
rell-0.004
Token the
Token ID262
Feature activation+5.61e-02

Pos logit contributions
aces+0.089
ags+0.087
rell+0.086
c+0.083
allows+0.082
Neg logit contributions
 portal-0.059
 traveler-0.054
fire-0.052
 setup-0.052
 Wizard-0.050
Token mirror
Token ID10162
Feature activation+5.42e-02

Pos logit contributions
ags+0.112
opl+0.098
rell+0.097
icer+0.096
c+0.094
Neg logit contributions
 traveler-0.087
 spies-0.084
 setup-0.083
 detectives-0.082
 criminal-0.082
Token was
Token ID373
Feature activation+1.69e-02

Pos logit contributions
ags+0.114
rell+0.102
opl+0.102
icer+0.099
aces+0.098
Neg logit contributions
 traveler-0.078
 spies-0.075
 setup-0.074
 detectives-0.073
 portal-0.073
Token a
Token ID257
Feature activation-1.43e-01

Pos logit contributions
ags+0.038
opl+0.033
rell+0.033
icer+0.033
aces+0.032
Neg logit contributions
 traveler-0.022
 portal-0.021
 spies-0.020
 troll-0.020
 setup-0.020
Token portal
Token ID17898
Feature activation-3.18e-01

Pos logit contributions
 spies+0.211
eware+0.207
 detectives+0.206
 traveler+0.206
 setup+0.201
Neg logit contributions
ags-0.250
opl-0.241
icer-0.228
c-0.227
rell-0.212
Token to
Token ID284
Feature activation-6.14e-02

Pos logit contributions
Who+0.453
 detectives+0.440
 traveler+0.440
 spies+0.437
 troll+0.433
Neg logit contributions
ags-0.614
opl-0.572
c-0.570
rell-0.562
aces-0.559
Token another
Token ID1194
Feature activation-1.25e-01

Pos logit contributions
 traveler+0.080
 setup+0.077
 scientist+0.076
 criminal+0.075
 spies+0.075
Neg logit contributions
ags-0.141
opl-0.122
rell-0.120
c-0.118
ag-0.115
Token world
Token ID995
Feature activation+1.01e-02

Pos logit contributions
eware+0.191
 detectives+0.190
 spies+0.189
 setup+0.189
 criminal+0.185
Neg logit contributions
opl-0.219
c-0.218
ags-0.206
icer-0.199
rell-0.187
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
ags+0.019
opl+0.018
rell+0.018
c+0.017
aces+0.017
Neg logit contributions
 traveler-0.016
 setup-0.015
 spies-0.015
 detectives-0.015
 troll-0.015
Token He
Token ID679
Feature activation-2.24e-03

Pos logit contributions
ags+0.023
rell+0.019
opl+0.019
aces+0.019
icer+0.018
Neg logit contributions
 traveler-0.026
 setup-0.025
 portal-0.024
 troll-0.024
fire-0.023
Token very
Token ID845
Feature activation+2.69e-02

Pos logit contributions
ags+0.035
opl+0.031
rell+0.031
icer+0.030
aters+0.030
Neg logit contributions
 traveler-0.020
 spies-0.019
 setup-0.018
 portal-0.018
 detectives-0.018
Token scared
Token ID12008
Feature activation-3.90e-02

Pos logit contributions
ags+0.055
opl+0.049
rell+0.048
icer+0.047
c+0.047
Neg logit contributions
 traveler-0.040
 spies-0.038
 portal-0.038
 university-0.038
 setup-0.038
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
 traveler+0.069
 setup+0.066
 portal+0.065
 detectives+0.064
 troll+0.064
Neg logit contributions
ags-0.072
rell-0.063
opl-0.062
aters-0.060
aces-0.060
Token begged
Token ID32946
Feature activation+2.02e-02

Pos logit contributions
ags+0.231
rell+0.206
opl+0.203
c+0.199
aces+0.198
Neg logit contributions
 traveler-0.161
 spies-0.151
 university-0.150
 scientist-0.150
 portal-0.150
Token for
Token ID329
Feature activation+2.94e-03

Pos logit contributions
ags+0.040
rell+0.034
opl+0.034
icer+0.033
aces+0.033
Neg logit contributions
 traveler-0.033
 portal-0.032
 spies-0.031
 setup-0.031
 detectives-0.031
Token mercy
Token ID17703
Feature activation-3.17e-01

Pos logit contributions
ags+0.006
ag+0.005
allows+0.004
rell+0.004
aces+0.004
Neg logit contributions
 portal-0.005
eware-0.004
 traveler-0.004
 scientist-0.004
 Wizard-0.004
Token.
Token ID13
Feature activation+7.49e-03

Pos logit contributions
 traveler+0.480
 portal+0.458
 spies+0.451
 setup+0.449
 detectives+0.447
Neg logit contributions
ags-0.640
opl-0.572
rell-0.563
aces-0.552
c-0.551
Token He
Token ID679
Feature activation-3.38e-03

Pos logit contributions
ags+0.015
rell+0.013
opl+0.013
icer+0.013
aters+0.012
Neg logit contributions
 traveler-0.012
 spies-0.012
 setup-0.011
 portal-0.011
 detectives-0.011
Token said
Token ID531
Feature activation+4.32e-02

Pos logit contributions
 traveler+0.005
 university+0.005
 spies+0.005
 portal+0.005
 detectives+0.005
Neg logit contributions
ags-0.007
rell-0.006
c-0.006
opl-0.006
icer-0.006
Token,
Token ID11
Feature activation-7.89e-02

Pos logit contributions
ags+0.095
aces+0.085
rell+0.084
opl+0.083
c+0.082
Neg logit contributions
 traveler-0.057
 portal-0.056
 setup-0.052
 scientist-0.052
 spies-0.052
Token '
Token ID705
Feature activation-4.08e-02

Pos logit contributions
 traveler+0.061
 spies+0.058
 setup+0.057
 scientist+0.055
 university+0.054
Neg logit contributions
ags-0.222
opl-0.209
rell-0.207
icer-0.206
aters-0.206
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
 detectives+0.018
 traveler+0.018
 spies+0.018
 criminal+0.018
 setup+0.017
Neg logit contributions
ags-0.020
c-0.018
opl-0.018
rell-0.018
aters-0.017
Token The
Token ID383
Feature activation+8.04e-02

Pos logit contributions
ags+0.017
aces+0.014
rell+0.014
opl+0.014
aters+0.013
Neg logit contributions
 traveler-0.022
 setup-0.021
 diver-0.020
 criminal-0.020
 troll-0.020
Token light
Token ID1657
Feature activation-1.08e-02

Pos logit contributions
ags+0.148
opl+0.129
rell+0.128
icer+0.125
aters+0.122
Neg logit contributions
 traveler-0.142
 spies-0.137
 setup-0.135
 detectives-0.133
 troll-0.133
Token was
Token ID373
Feature activation+1.76e-02

Pos logit contributions
 traveler+0.016
 spies+0.016
 detectives+0.016
 criminal+0.015
 troll+0.015
Neg logit contributions
ags-0.021
rell-0.019
opl-0.019
c-0.019
aters-0.018
Token a
Token ID257
Feature activation-1.43e-01

Pos logit contributions
ags+0.038
opl+0.033
rell+0.033
aters+0.032
icer+0.032
Neg logit contributions
 traveler-0.026
 criminal-0.024
 troll-0.024
 spies-0.024
 scientist-0.024
Token portal
Token ID17898
Feature activation-3.17e-01

Pos logit contributions
eware+0.231
 detectives+0.230
 spies+0.229
 traveler+0.219
 criminal+0.219
Neg logit contributions
opl-0.213
c-0.209
ags-0.208
icer-0.204
rell-0.189
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
 detectives+0.532
 spies+0.524
 traveler+0.522
Who+0.515
 scientist+0.511
Neg logit contributions
ags-0.523
c-0.505
rell-0.494
aces-0.487
opl-0.473
Token It
Token ID632
Feature activation+3.66e-02

Pos logit contributions
ags+0.020
opl+0.017
rell+0.017
aces+0.017
aters+0.016
Neg logit contributions
 traveler-0.018
 setup-0.017
 troll-0.017
 diver-0.017
 spies-0.017
Token took
Token ID1718
Feature activation+4.31e-02

Pos logit contributions
ags+0.062
c+0.060
rell+0.058
opl+0.055
aces+0.054
Neg logit contributions
 traveler-0.062
 criminal-0.062
 spies-0.062
 detectives-0.061
 scientist-0.060
Token her
Token ID607
Feature activation+1.20e-01

Pos logit contributions
ags+0.093
rell+0.081
aces+0.079
c+0.078
opl+0.077
Neg logit contributions
 traveler-0.063
 setup-0.059
 scientist-0.059
 troll-0.058
 criminal-0.058
Token to
Token ID284
Feature activation-6.12e-02

Pos logit contributions
ags+0.283
rell+0.261
icer+0.256
opl+0.255
agged+0.249
Neg logit contributions
 traveler-0.152
 university-0.148
 spies-0.148
 portal-0.146
 clues-0.142
Token top
Token ID1353
Feature activation-2.09e-02

Pos logit contributions
ags+0.154
opl+0.136
rell+0.135
icer+0.132
aters+0.128
Neg logit contributions
 traveler-0.131
 spies-0.126
 setup-0.124
 detectives-0.123
 troll-0.123
Token had
Token ID550
Feature activation-2.08e-02

Pos logit contributions
 traveler+0.033
 spies+0.032
 detectives+0.032
 troll+0.031
 scientist+0.030
Neg logit contributions
ags-0.040
rell-0.037
opl-0.036
c-0.036
icer-0.035
Token become
Token ID1716
Feature activation-1.48e-02

Pos logit contributions
 traveler+0.030
 scientist+0.029
 spies+0.029
 detectives+0.028
 criminal+0.028
Neg logit contributions
ags-0.043
rell-0.042
aces-0.039
opl-0.038
c-0.038
Token a
Token ID257
Feature activation-1.48e-01

Pos logit contributions
 traveler+0.020
 troll+0.019
 criminal+0.019
 spies+0.018
 scientist+0.018
Neg logit contributions
ags-0.032
rell-0.030
c-0.029
aters-0.029
aces-0.029
Token magic
Token ID5536
Feature activation-4.94e-02

Pos logit contributions
eware+0.246
 spies+0.240
 detectives+0.229
Where+0.221
Who+0.219
Neg logit contributions
opl-0.212
icer-0.209
c-0.201
ags-0.189
rell-0.177
Token portal
Token ID17898
Feature activation-3.17e-01

Pos logit contributions
 spies+0.073
 detectives+0.073
 setup+0.070
Where+0.069
 traveler+0.068
Neg logit contributions
ags-0.090
opl-0.088
aces-0.083
rell-0.083
c-0.082
Token that
Token ID326
Feature activation-1.88e-01

Pos logit contributions
 traveler+0.485
 spies+0.481
 detectives+0.474
 troll+0.464
Who+0.461
Neg logit contributions
ags-0.601
rell-0.571
opl-0.566
c-0.562
aces-0.553
Token took
Token ID1718
Feature activation+3.77e-02

Pos logit contributions
 traveler+0.254
 spies+0.237
 troll+0.229
 detectives+0.228
 criminal+0.225
Neg logit contributions
ags-0.414
rell-0.381
opl-0.373
c-0.364
aces-0.362
Token her
Token ID607
Feature activation+1.15e-01

Pos logit contributions
ags+0.073
rell+0.064
aces+0.062
opl+0.061
c+0.060
Neg logit contributions
 traveler-0.063
 setup-0.059
 troll-0.059
 spies-0.059
 scientist-0.059
Token straight
Token ID3892
Feature activation-5.16e-02

Pos logit contributions
ags+0.261
rell+0.234
opl+0.232
icer+0.231
agged+0.224
Neg logit contributions
 traveler-0.154
 spies-0.148
 portal-0.148
 university-0.146
 setup-0.144
Token to
Token ID284
Feature activation-6.62e-02

Pos logit contributions
 traveler+0.083
 spies+0.081
 setup+0.079
 detectives+0.078
 troll+0.078
Neg logit contributions
ags-0.103
rell-0.091
opl-0.090
aces-0.087
aters-0.087
Token your
Token ID534
Feature activation+6.77e-02

Pos logit contributions
 traveler+0.168
 portal+0.164
 troll+0.153
 clues+0.150
 spies+0.149
Neg logit contributions
ags-0.303
opl-0.284
rell-0.279
c-0.271
aces-0.267
Token fault
Token ID8046
Feature activation+4.57e-02

Pos logit contributions
ags+0.155
rell+0.141
opl+0.138
icer+0.137
aters+0.133
Neg logit contributions
 traveler-0.091
 spies-0.085
 setup-0.085
 troll-0.083
 university-0.083
Token and
Token ID290
Feature activation+1.13e-01

Pos logit contributions
ags+0.109
icer+0.098
opl+0.098
rell+0.097
c+0.094
Neg logit contributions
 traveler-0.054
 spies-0.053
 setup-0.051
 detectives-0.050
 scientist-0.049
Token ask
Token ID1265
Feature activation-4.77e-02

Pos logit contributions
ags+0.224
opl+0.200
rell+0.190
aces+0.190
c+0.189
Neg logit contributions
 traveler-0.172
 portal-0.164
 troll-0.159
 scientist-0.159
 spies-0.158
Token for
Token ID329
Feature activation-3.51e-04

Pos logit contributions
 traveler+0.074
 spies+0.070
 portal+0.069
 troll+0.068
 setup+0.068
Neg logit contributions
ags-0.099
opl-0.085
rell-0.085
icer-0.083
aters-0.081
Token mercy
Token ID17703
Feature activation-3.17e-01

Pos logit contributions
 traveler+0.000
 portal+0.000
 troll+0.000
eware+0.000
 scientist+0.000
Neg logit contributions
ags-0.001
opl-0.001
ag-0.001
rell-0.001
aces-0.001
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
 traveler+0.420
 spies+0.399
 setup+0.391
 detectives+0.384
 troll+0.383
Neg logit contributions
ags-0.717
rell-0.643
opl-0.641
icer-0.628
aters-0.617
Token I
Token ID314
Feature activation-1.37e-01

Pos logit contributions
ags+0.026
rell+0.023
opl+0.023
icer+0.023
aters+0.022
Neg logit contributions
 traveler-0.020
 spies-0.019
 setup-0.018
 detectives-0.018
 troll-0.018
Token will
Token ID481
Feature activation+7.02e-03

Pos logit contributions
 traveler+0.204
 spies+0.193
 setup+0.190
 troll+0.187
 detectives+0.187
Neg logit contributions
ags-0.292
opl-0.258
rell-0.258
icer-0.252
aters-0.248
Token spare
Token ID13952
Feature activation-6.29e-02

Pos logit contributions
ags+0.015
opl+0.014
rell+0.013
ag+0.012
aters+0.012
Neg logit contributions
 traveler-0.009
 spies-0.009
 detectives-0.009
 troll-0.009
 portal-0.009
Token you
Token ID345
Feature activation-2.02e-02

Pos logit contributions
 traveler+0.087
 spies+0.083
 setup+0.081
 detectives+0.080
 troll+0.080
Neg logit contributions
ags-0.139
rell-0.125
opl-0.125
icer-0.122
aters-0.120
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
ags+0.180
rell+0.158
opl+0.151
aces+0.151
aters+0.149
Neg logit contributions
 traveler-0.159
 scientist-0.147
 portal-0.146
 setup-0.145
 troll-0.145
Token She
Token ID1375
Feature activation-1.76e-02

Pos logit contributions
ags+0.022
opl+0.019
rell+0.019
aces+0.018
aters+0.018
Neg logit contributions
 traveler-0.025
 setup-0.024
 troll-0.023
 criminal-0.023
 spies-0.023
Token skipped
Token ID26684
Feature activation-1.79e-02

Pos logit contributions
 traveler+0.026
 spies+0.025
 scientist+0.025
 criminal+0.025
 university+0.025
Neg logit contributions
ags-0.034
c-0.030
aces-0.030
rell-0.030
opl-0.029
Token into
Token ID656
Feature activation-3.66e-02

Pos logit contributions
 traveler+0.029
 troll+0.029
 setup+0.028
 criminal+0.028
 spies+0.028
Neg logit contributions
ags-0.035
rell-0.030
aces-0.028
opl-0.028
aters-0.028
Token the
Token ID262
Feature activation+5.87e-02

Pos logit contributions
 traveler+0.057
 criminal+0.055
eware+0.055
fire+0.054
 Wizard+0.053
Neg logit contributions
ags-0.073
opl-0.061
icer-0.060
c-0.060
rell-0.059
Token portal
Token ID17898
Feature activation-3.16e-01

Pos logit contributions
ags+0.109
opl+0.093
c+0.092
rell+0.089
icer+0.089
Neg logit contributions
 traveler-0.100
 spies-0.099
 criminal-0.097
 troll-0.096
 detectives-0.095
Token and
Token ID290
Feature activation+1.15e-01

Pos logit contributions
 traveler+0.511
 spies+0.497
 detectives+0.487
 troll+0.482
 criminal+0.482
Neg logit contributions
ags-0.607
rell-0.537
opl-0.535
c-0.529
aces-0.528
Token began
Token ID2540
Feature activation-8.95e-02

Pos logit contributions
ags+0.215
c+0.205
rell+0.187
aces+0.183
opl+0.175
Neg logit contributions
 criminal-0.170
 scientist-0.167
 traveler-0.165
 troll-0.162
 spies-0.158
Token her
Token ID607
Feature activation+1.21e-01

Pos logit contributions
 traveler+0.127
 criminal+0.121
 troll+0.121
 scientist+0.118
 diver+0.117
Neg logit contributions
ags-0.183
c-0.169
opl-0.160
rell-0.160
aces-0.156
Token story
Token ID1621
Feature activation+3.34e-02

Pos logit contributions
ags+0.252
opl+0.224
rell+0.221
icer+0.215
aters+0.214
Neg logit contributions
 traveler-0.181
 spies-0.171
 troll-0.168
 setup-0.168
 detectives-0.167
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
ags+0.063
aces+0.056
rell+0.056
opl+0.055
c+0.054
Neg logit contributions
 traveler-0.055
 troll-0.054
 detectives-0.052
 spies-0.052
 scientist-0.052
Token her
Token ID607
Feature activation+1.17e-01

Pos logit contributions
 traveler+0.065
 setup+0.064
 spies+0.063
 portal+0.063
 troll+0.063
Neg logit contributions
ags-0.067
rell-0.057
opl-0.055
aces-0.054
c-0.054
Token wand
Token ID11569
Feature activation-6.26e-02

Pos logit contributions
ags+0.256
opl+0.242
rell+0.242
icer+0.237
c+0.231
Neg logit contributions
 traveler-0.166
 spies-0.154
 portal-0.153
 setup-0.150
 scientist-0.148
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 traveler+0.097
 setup+0.091
 spies+0.091
 portal+0.090
 troll+0.088
Neg logit contributions
ags-0.130
opl-0.116
icer-0.115
rell-0.114
aters-0.110
Token a
Token ID257
Feature activation-1.46e-01

Pos logit contributions
c+0.160
rell+0.148
ags+0.147
aces+0.142
et+0.140
Neg logit contributions
 criminal-0.173
 university-0.167
eware-0.166
 scientist-0.163
 spies-0.161
Token magical
Token ID10883
Feature activation-8.88e-02

Pos logit contributions
 spies+0.212
 detectives+0.206
 traveler+0.195
eware+0.191
 setup+0.189
Neg logit contributions
ags-0.274
c-0.258
rell-0.252
opl-0.250
icer-0.242
Token portal
Token ID17898
Feature activation-3.16e-01

Pos logit contributions
 spies+0.125
 detectives+0.121
Where+0.111
Who+0.110
 setup+0.109
Neg logit contributions
ags-0.183
c-0.169
rell-0.166
opl-0.166
ag-0.161
Token opened
Token ID4721
Feature activation-2.94e-02

Pos logit contributions
 spies+0.455
 detectives+0.452
 traveler+0.442
 scientist+0.436
 criminal+0.432
Neg logit contributions
ags-0.613
aces-0.573
rell-0.573
c-0.569
opl-0.546
Token.
Token ID13
Feature activation+8.59e-04

Pos logit contributions
 traveler+0.044
 spies+0.042
 setup+0.041
 criminal+0.041
 scientist+0.041
Neg logit contributions
ags-0.061
opl-0.054
rell-0.054
c-0.053
icer-0.052
Token
Token ID198
Feature activation+1.41e-03

Pos logit contributions
ags+0.002
rell+0.001
opl+0.001
aces+0.001
aters+0.001
Neg logit contributions
 traveler-0.002
 setup-0.001
 spies-0.001
 troll-0.001
 detectives-0.001
Token
Token ID198
Feature activation+1.72e-03

Pos logit contributions
ags+0.003
opl+0.003
rell+0.003
icer+0.002
aters+0.002
Neg logit contributions
 traveler-0.002
 spies-0.002
 setup-0.002
 detectives-0.002
 troll-0.002
TokenThe
Token ID464
Feature activation+3.78e-02

Pos logit contributions
ags+0.003
opl+0.003
rell+0.003
c+0.003
icer+0.003
Neg logit contributions
 traveler-0.003
 spies-0.003
 setup-0.003
 troll-0.003
 detectives-0.003
Token!
Token ID0
Feature activation-8.06e-02

Pos logit contributions
 traveler+0.015
 spies+0.015
 portal+0.014
 detectives+0.014
 troll+0.014
Neg logit contributions
ags-0.013
c-0.011
opl-0.010
rell-0.010
aters-0.010
Token He
Token ID679
Feature activation-3.17e-03

Pos logit contributions
 traveler+0.158
 setup+0.154
 spies+0.154
 detectives+0.149
 troll+0.147
Neg logit contributions
ags-0.130
rell-0.113
opl-0.111
aces-0.108
aters-0.105
Token stepped
Token ID10764
Feature activation-5.17e-02

Pos logit contributions
 spies+0.005
 detectives+0.005
 criminal+0.005
 traveler+0.005
 university+0.005
Neg logit contributions
ags-0.006
c-0.005
rell-0.005
opl-0.005
aces-0.005
Token through
Token ID832
Feature activation-2.12e-02

Pos logit contributions
 traveler+0.087
 spies+0.086
 troll+0.085
 setup+0.084
 criminal+0.083
Neg logit contributions
ags-0.098
rell-0.081
icer-0.079
agged-0.078
c-0.078
Token the
Token ID262
Feature activation+5.40e-02

Pos logit contributions
 traveler+0.033
eware+0.033
 diver+0.032
 setup+0.032
 criminal+0.032
Neg logit contributions
ags-0.038
c-0.034
opl-0.034
aces-0.033
rell-0.033
Token portal
Token ID17898
Feature activation-3.16e-01

Pos logit contributions
ags+0.114
opl+0.103
rell+0.102
c+0.101
icer+0.096
Neg logit contributions
 spies-0.077
 traveler-0.073
 detectives-0.072
 diver-0.071
 criminal-0.071
Token,
Token ID11
Feature activation-8.13e-02

Pos logit contributions
 traveler+0.535
 spies+0.526
 detectives+0.514
 troll+0.507
 setup+0.505
Neg logit contributions
ags-0.571
rell-0.513
opl-0.511
c-0.491
aces-0.487
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 traveler+0.115
 spies+0.111
 troll+0.108
 detectives+0.108
 criminal+0.108
Neg logit contributions
ags-0.171
opl-0.152
rell-0.152
c-0.147
icer-0.146
Token landed
Token ID11406
Feature activation-8.93e-02

Pos logit contributions
ags+0.191
c+0.187
aces+0.177
rell+0.177
et+0.171
Neg logit contributions
 spies-0.147
 criminal-0.147
 traveler-0.143
 detectives-0.141
 scientist-0.139
Token in
Token ID287
Feature activation+2.64e-02

Pos logit contributions
 traveler+0.151
 setup+0.147
 spies+0.145
 detectives+0.142
 criminal+0.142
Neg logit contributions
ags-0.168
opl-0.141
aces-0.141
rell-0.139
icer-0.138
Token the
Token ID262
Feature activation+5.44e-02

Pos logit contributions
ags+0.058
opl+0.050
icer+0.050
c+0.049
rell+0.048
Neg logit contributions
eware-0.035
 traveler-0.034
 detectives-0.033
 spies-0.033
 criminal-0.033
Token He
Token ID679
Feature activation-4.79e-03

Pos logit contributions
ags+0.015
icer+0.012
rell+0.012
aters+0.012
opl+0.012
Neg logit contributions
 setup-0.018
 traveler-0.018
 portal-0.017
 troll-0.016
 reality-0.016
Token said
Token ID531
Feature activation+4.18e-02

Pos logit contributions
 university+0.008
 traveler+0.007
 portal+0.007
 spies+0.007
 detectives+0.007
Neg logit contributions
ags-0.009
c-0.008
aces-0.008
rell-0.008
icer-0.008
Token he
Token ID339
Feature activation-3.50e-02

Pos logit contributions
ags+0.097
aces+0.092
rell+0.091
allows+0.089
c+0.088
Neg logit contributions
 portal-0.045
 traveler-0.043
 scientist-0.040
 setup-0.040
 criminal-0.039
Token was
Token ID373
Feature activation+1.12e-02

Pos logit contributions
 traveler+0.051
 spies+0.049
 setup+0.048
 portal+0.048
 university+0.048
Neg logit contributions
ags-0.073
rell-0.065
opl-0.065
icer-0.064
c-0.063
Token a
Token ID257
Feature activation-1.49e-01

Pos logit contributions
ags+0.025
opl+0.021
rell+0.021
icer+0.021
aces+0.021
Neg logit contributions
 traveler-0.015
 portal-0.015
 setup-0.014
eware-0.014
 troll-0.014
Token traveler
Token ID40168
Feature activation-3.13e-01

Pos logit contributions
eware+0.235
 occurred+0.206
 portal+0.203
 arose+0.197
cess+0.197
Neg logit contributions
ags-0.231
opl-0.216
c-0.214
icer-0.210
ish-0.193
Token and
Token ID290
Feature activation+1.09e-01

Pos logit contributions
 portal+0.484
 setup+0.478
eware+0.474
 detectives+0.468
 traveler+0.467
Neg logit contributions
ags-0.561
c-0.504
opl-0.495
aces-0.490
rell-0.480
Token he
Token ID339
Feature activation-3.48e-02

Pos logit contributions
ags+0.207
aces+0.201
c+0.195
rell+0.177
opl+0.177
Neg logit contributions
 portal-0.139
 setup-0.138
 traveler-0.133
eware-0.131
 criminal-0.130
Token liked
Token ID8288
Feature activation+7.47e-02

Pos logit contributions
 traveler+0.049
 spies+0.047
 setup+0.046
 detectives+0.046
 portal+0.046
Neg logit contributions
ags-0.074
opl-0.066
rell-0.066
icer-0.065
aters-0.063
Token to
Token ID284
Feature activation-6.57e-02

Pos logit contributions
ags+0.177
opl+0.160
rell+0.158
icer+0.155
aters+0.154
Neg logit contributions
 traveler-0.089
 portal-0.083
 setup-0.082
 spies-0.081
 detectives-0.079
Token play
Token ID711
Feature activation+5.90e-02

Pos logit contributions
 traveler+0.092
 setup+0.091
 spies+0.090
 detectives+0.087
 portal+0.087
Neg logit contributions
ags-0.142
opl-0.127
rell-0.125
c-0.122
aters-0.119
Token day
Token ID1110
Feature activation+5.35e-02

Pos logit contributions
ags+0.064
opl+0.058
rell+0.056
icer+0.056
c+0.054
Neg logit contributions
 traveler-0.072
 spies-0.071
 detectives-0.070
 troll-0.068
 portal-0.066
Token,
Token ID11
Feature activation-9.70e-02

Pos logit contributions
ags+0.112
rell+0.104
opl+0.101
aters+0.100
c+0.100
Neg logit contributions
 traveler-0.073
 troll-0.068
 setup-0.068
 portal-0.067
 spies-0.067
Token Max
Token ID5436
Feature activation-2.03e-03

Pos logit contributions
 portal+0.128
 traveler+0.128
 troll+0.120
 setup+0.120
 spies+0.118
Neg logit contributions
ags-0.196
icer-0.180
rell-0.179
c-0.175
aters-0.174
Token thanked
Token ID26280
Feature activation-2.65e-02

Pos logit contributions
 traveler+0.003
 spies+0.003
 setup+0.003
 detectives+0.003
 portal+0.003
Neg logit contributions
ags-0.004
rell-0.004
opl-0.004
icer-0.003
c-0.003
Token the
Token ID262
Feature activation+4.09e-02

Pos logit contributions
 traveler+0.042
 spies+0.040
 detectives+0.040
 setup+0.040
 portal+0.040
Neg logit contributions
ags-0.054
rell-0.046
opl-0.046
icer-0.043
aters-0.043
Token Wizard
Token ID16884
Feature activation-3.12e-01

Pos logit contributions
ags+0.087
opl+0.080
rell+0.077
icer+0.077
c+0.075
Neg logit contributions
 traveler-0.052
 spies-0.050
 detectives-0.050
 portal-0.049
 setup-0.047
Token as
Token ID355
Feature activation-1.09e-01

Pos logit contributions
 traveler+0.486
 spies+0.460
 setup+0.456
 troll+0.449
 detectives+0.445
Neg logit contributions
ags-0.644
rell-0.568
opl-0.568
icer-0.554
aters-0.544
Token he
Token ID339
Feature activation-3.79e-02

Pos logit contributions
 portal+0.142
 setup+0.139
 traveler+0.139
 spies+0.134
 criminal+0.133
Neg logit contributions
ags-0.216
opl-0.204
c-0.200
aces-0.196
rell-0.196
Token handed
Token ID10158
Feature activation-6.75e-02

Pos logit contributions
 traveler+0.054
 spies+0.054
 detectives+0.052
 university+0.052
 setup+0.051
Neg logit contributions
ags-0.078
rell-0.070
opl-0.069
c-0.069
icer-0.068
Token him
Token ID683
Feature activation+9.31e-02

Pos logit contributions
 traveler+0.102
 spies+0.097
 setup+0.095
 troll+0.094
 detectives+0.094
Neg logit contributions
ags-0.141
opl-0.126
rell-0.126
icer-0.122
aters-0.121
Token his
Token ID465
Feature activation+1.30e-01

Pos logit contributions
ags+0.231
opl+0.209
rell+0.209
icer+0.204
aters+0.201
Neg logit contributions
 traveler-0.104
 spies-0.097
 setup-0.095
 troll-0.093
 detectives-0.093
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
ags+0.088
rell+0.078
opl+0.078
aters+0.076
aces+0.076
Neg logit contributions
 traveler-0.065
 troll-0.062
 portal-0.061
 spies-0.061
 setup-0.061
Token It
Token ID632
Feature activation+3.69e-02

Pos logit contributions
ags+0.028
rell+0.025
opl+0.025
icer+0.025
aters+0.024
Neg logit contributions
 traveler-0.015
 spies-0.014
 detectives-0.014
 setup-0.014
 troll-0.014
Token is
Token ID318
Feature activation+1.48e-02

Pos logit contributions
ags+0.078
opl+0.066
icer+0.065
aters+0.064
rell+0.064
Neg logit contributions
 traveler-0.059
 setup-0.055
 spies-0.055
 troll-0.055
 portal-0.054
Token called
Token ID1444
Feature activation-7.51e-02

Pos logit contributions
ags+0.033
opl+0.030
rell+0.029
icer+0.029
aters+0.029
Neg logit contributions
 traveler-0.020
 spies-0.018
 setup-0.018
 troll-0.018
 portal-0.018
Token The
Token ID383
Feature activation+7.80e-02

Pos logit contributions
 traveler+0.094
 spies+0.088
 setup+0.087
 troll+0.086
 portal+0.085
Neg logit contributions
ags-0.176
rell-0.157
opl-0.157
icer-0.153
aters-0.152
Token Wizard
Token ID16884
Feature activation-3.12e-01

Pos logit contributions
ags+0.161
opl+0.140
icer+0.139
rell+0.137
aters+0.134
Neg logit contributions
 traveler-0.113
 spies-0.109
 portal-0.108
 detectives-0.108
 troll-0.107
Token of
Token ID286
Feature activation-2.97e-02

Pos logit contributions
 traveler+0.435
 spies+0.416
 setup+0.409
 portal+0.404
 detectives+0.403
Neg logit contributions
ags-0.673
opl-0.603
rell-0.601
icer-0.587
aters-0.579
Token Oz
Token ID18024
Feature activation+5.85e-03

Pos logit contributions
 traveler+0.039
 spies+0.036
 setup+0.036
 portal+0.036
 detectives+0.035
Neg logit contributions
ags-0.068
opl-0.061
rell-0.060
icer-0.059
aters-0.058
Token.
Token ID13
Feature activation+6.36e-03

Pos logit contributions
ags+0.013
rell+0.011
opl+0.011
icer+0.011
aters+0.011
Neg logit contributions
 traveler-0.008
 spies-0.008
 setup-0.008
 troll-0.008
 detectives-0.008
Token Do
Token ID2141
Feature activation-3.77e-02

Pos logit contributions
ags+0.013
opl+0.012
rell+0.012
icer+0.012
aters+0.011
Neg logit contributions
 traveler-0.009
 spies-0.009
 setup-0.009
 troll-0.009
 portal-0.009
Token you
Token ID345
Feature activation-2.16e-02

Pos logit contributions
 traveler+0.045
 spies+0.043
 setup+0.042
 troll+0.041
 detectives+0.041
Neg logit contributions
ags-0.091
opl-0.082
rell-0.082
icer-0.080
aters-0.079
Token him
Token ID683
Feature activation+9.31e-02

Pos logit contributions
 traveler+0.102
 spies+0.097
 setup+0.095
 troll+0.094
 detectives+0.094
Neg logit contributions
ags-0.141
opl-0.126
rell-0.126
icer-0.122
aters-0.121
Token his
Token ID465
Feature activation+1.30e-01

Pos logit contributions
ags+0.231
opl+0.209
rell+0.209
icer+0.204
aters+0.201
Neg logit contributions
 traveler-0.104
 spies-0.097
 setup-0.095
 troll-0.093
 detectives-0.093
Token match
Token ID2872
Feature activation+2.35e-02

Pos logit contributions
ags+0.277
opl+0.246
rell+0.246
icer+0.240
aters+0.236
Neg logit contributions
 traveler-0.189
 spies-0.180
 setup-0.177
 detectives-0.174
 troll-0.174
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
ags+0.052
opl+0.047
rell+0.047
icer+0.046
aters+0.045
Neg logit contributions
 traveler-0.032
 spies-0.030
 setup-0.030
 detectives-0.029
 portal-0.029
Token The
Token ID383
Feature activation+6.93e-02

Pos logit contributions
 traveler+0.021
 spies+0.020
 setup+0.020
 troll+0.020
 portal+0.020
Neg logit contributions
ags-0.021
rell-0.018
opl-0.018
aters-0.017
icer-0.017
Token Wizard
Token ID16884
Feature activation-3.11e-01

Pos logit contributions
ags+0.160
opl+0.147
icer+0.143
rell+0.141
at+0.137
Neg logit contributions
 spies-0.077
 detectives-0.069
 portal-0.066
Where-0.066
eware-0.066
Token smiled
Token ID13541
Feature activation+5.42e-02

Pos logit contributions
 traveler+0.486
 spies+0.481
 detectives+0.467
 university+0.465
 portal+0.463
Neg logit contributions
ags-0.593
rell-0.529
opl-0.523
c-0.516
icer-0.515
Token and
Token ID290
Feature activation+9.60e-02

Pos logit contributions
ags+0.087
rell+0.073
aters+0.073
opl+0.071
c+0.070
Neg logit contributions
 traveler-0.100
 troll-0.099
 university-0.099
 spies-0.099
 criminal-0.098
Token said
Token ID531
Feature activation+3.48e-02

Pos logit contributions
ags+0.146
rell+0.142
c+0.136
opl+0.131
aces+0.128
Neg logit contributions
 university-0.170
 spies-0.168
 portal-0.166
 detectives-0.164
eware-0.162
Token,
Token ID11
Feature activation-9.43e-02

Pos logit contributions
ags+0.062
aces+0.061
rell+0.058
ag+0.054
c+0.054
Neg logit contributions
 portal-0.053
 setup-0.052
 traveler-0.052
 spies-0.051
 troll-0.050
Token "
Token ID366
Feature activation-2.42e-02

Pos logit contributions
 traveler+0.119
 spies+0.114
 detectives+0.111
 portal+0.110
 troll+0.110
Neg logit contributions
ags-0.214
rell-0.195
opl-0.194
icer-0.186
aters-0.185
Token Tom
Token ID4186
Feature activation+8.21e-03

Pos logit contributions
ags+0.268
rell+0.238
opl+0.235
icer+0.228
aces+0.227
Neg logit contributions
 traveler-0.150
 portal-0.142
 scientist-0.137
 troll-0.136
 spies-0.135
Token cried
Token ID16896
Feature activation+4.56e-02

Pos logit contributions
ags+0.015
rell+0.013
opl+0.013
icer+0.013
aces+0.013
Neg logit contributions
 traveler-0.014
 spies-0.014
 detectives-0.013
 portal-0.013
 setup-0.013
Token and
Token ID290
Feature activation+1.15e-01

Pos logit contributions
ags+0.099
opl+0.089
rell+0.088
icer+0.087
aters+0.085
Neg logit contributions
 traveler-0.065
 spies-0.062
 setup-0.061
 detectives-0.060
 troll-0.060
Token begged
Token ID32946
Feature activation+2.25e-02

Pos logit contributions
ags+0.189
rell+0.152
aces+0.151
agged+0.150
ag+0.133
Neg logit contributions
 portal-0.218
 scientist-0.216
 reality-0.216
 traveler-0.214
eware-0.213
Token for
Token ID329
Feature activation+4.08e-03

Pos logit contributions
ags+0.049
rell+0.043
opl+0.042
icer+0.041
aters+0.041
Neg logit contributions
 traveler-0.033
 portal-0.031
 spies-0.030
 troll-0.030
 setup-0.030
Token mercy
Token ID17703
Feature activation-3.11e-01

Pos logit contributions
ags+0.008
ag+0.006
allows+0.006
rell+0.006
ittens+0.006
Neg logit contributions
 portal-0.007
 traveler-0.007
eware-0.006
 troll-0.006
 Wizard-0.006
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
 traveler+0.486
 portal+0.467
 troll+0.451
 detectives+0.451
 spies+0.451
Neg logit contributions
ags-0.616
opl-0.547
rell-0.531
icer-0.524
aces-0.521
Token They
Token ID1119
Feature activation-2.09e-02

Pos logit contributions
ags+0.022
rell+0.018
opl+0.018
agged+0.017
icer+0.017
Neg logit contributions
 traveler-0.029
 portal-0.028
 setup-0.027
 troll-0.027
 reality-0.027
Token wished
Token ID16555
Feature activation-1.23e-01

Pos logit contributions
 traveler+0.031
 spies+0.029
 portal+0.029
 university+0.029
 detectives+0.029
Neg logit contributions
ags-0.043
rell-0.039
opl-0.037
aces-0.037
icer-0.037
Token they
Token ID484
Feature activation-5.39e-02

Pos logit contributions
 traveler+0.091
 portal+0.083
 troll+0.079
 spies+0.076
 detectives+0.076
Neg logit contributions
ags-0.353
rell-0.321
opl-0.319
icer-0.313
aces-0.311
Token had
Token ID550
Feature activation-1.70e-02

Pos logit contributions
 traveler+0.077
 troll+0.073
 spies+0.073
 portal+0.073
 detectives+0.072
Neg logit contributions
ags-0.114
rell-0.102
opl-0.099
icer-0.099
aces-0.098
Token
Token ID198
Feature activation+7.14e-03

Pos logit contributions
ags+0.014
opl+0.013
rell+0.013
icer+0.012
aters+0.012
Neg logit contributions
 traveler-0.010
 spies-0.010
 setup-0.010
 detectives-0.010
 troll-0.010
Token"
Token ID1
Feature activation-3.61e-02

Pos logit contributions
ags+0.014
opl+0.012
c+0.012
rell+0.012
icer+0.012
Neg logit contributions
 traveler-0.011
 setup-0.011
 spies-0.011
 troll-0.011
 portal-0.011
TokenI
Token ID40
Feature activation-1.07e-01

Pos logit contributions
 traveler+0.050
 setup+0.048
 portal+0.047
 spies+0.047
 troll+0.047
Neg logit contributions
ags-0.078
opl-0.071
rell-0.070
icer-0.068
aters-0.067
Token'm
Token ID1101
Feature activation-5.81e-02

Pos logit contributions
 spies+0.163
 traveler+0.161
 setup+0.160
 portal+0.159
 detectives+0.157
Neg logit contributions
ags-0.191
c-0.190
opl-0.188
rell-0.184
icer-0.177
Token a
Token ID257
Feature activation-1.45e-01

Pos logit contributions
 portal+0.078
 traveler+0.078
 setup+0.074
eware+0.073
fire+0.073
Neg logit contributions
ags-0.126
opl-0.113
rell-0.110
aces-0.105
c-0.105
Token traveler
Token ID40168
Feature activation-3.10e-01

Pos logit contributions
 setup+0.197
eware+0.193
 portal+0.187
ocated+0.186
 spies+0.184
Neg logit contributions
opl-0.250
ags-0.248
c-0.230
icer-0.224
ab-0.213
Token,"
Token ID553
Feature activation-5.43e-02

Pos logit contributions
 portal+0.464
 setup+0.450
Who+0.439
Where+0.439
eware+0.434
Neg logit contributions
ags-0.587
c-0.528
opl-0.522
aces-0.517
rell-0.514
Token the
Token ID262
Feature activation+5.54e-02

Pos logit contributions
 traveler+0.085
 portal+0.084
 setup+0.079
fire+0.078
 detectives+0.078
Neg logit contributions
ags-0.100
c-0.092
rell-0.092
opl-0.086
icer-0.086
Token man
Token ID582
Feature activation-1.08e-01

Pos logit contributions
ags+0.104
opl+0.084
rell+0.078
ag+0.077
aces+0.076
Neg logit contributions
eware-0.081
 reality-0.080
 setup-0.074
Jud-0.072
 diver-0.072
Token said
Token ID531
Feature activation+4.37e-02

Pos logit contributions
 traveler+0.157
 spies+0.152
 setup+0.149
 university+0.149
 detectives+0.148
Neg logit contributions
ags-0.222
rell-0.202
c-0.199
opl-0.197
aces-0.194
Token.
Token ID13
Feature activation+9.96e-03

Pos logit contributions
ags+0.083
aces+0.072
rell+0.072
agged+0.068
ag+0.068
Neg logit contributions
 traveler-0.076
 portal-0.072
 university-0.071
 setup-0.070
 troll-0.070
TokenWow
Token ID22017
Feature activation-3.77e-02

Pos logit contributions
 traveler+0.049
 spies+0.047
 setup+0.046
 troll+0.045
 detectives+0.045
Neg logit contributions
ags-0.092
opl-0.082
rell-0.082
icer-0.081
aters-0.080
Token,
Token ID11
Feature activation-8.35e-02

Pos logit contributions
 traveler+0.056
 criminal+0.052
 troll+0.052
 scientist+0.052
fire+0.051
Neg logit contributions
ags-0.079
rell-0.073
icer-0.071
aces-0.070
aters-0.069
Token you
Token ID345
Feature activation-1.35e-02

Pos logit contributions
 portal+0.111
fire+0.110
 traveler+0.108
 criminal+0.108
 scientist+0.108
Neg logit contributions
ags-0.150
icer-0.135
c-0.133
aces-0.132
rell-0.131
Token are
Token ID389
Feature activation+7.50e-02

Pos logit contributions
 traveler+0.027
 criminal+0.025
 spies+0.025
 ranger+0.025
 setup+0.025
Neg logit contributions
ags-0.022
rell-0.019
icer-0.019
aces-0.018
opl-0.018
Token a
Token ID257
Feature activation-1.41e-01

Pos logit contributions
ags+0.141
rell+0.121
aces+0.115
opl+0.113
icer+0.109
Neg logit contributions
 traveler-0.141
 portal-0.130
fire-0.128
eware-0.127
 criminal-0.126
Token traveler
Token ID40168
Feature activation-3.10e-01

Pos logit contributions
eware+0.206
 occurred+0.199
 setup+0.195
fire+0.194
 entering+0.193
Neg logit contributions
ags-0.228
icer-0.227
opl-0.222
c-0.207
ish-0.201
Token?
Token ID30
Feature activation-3.96e-02

Pos logit contributions
 portal+0.454
Where+0.451
Who+0.443
 setup+0.441
 clues+0.437
Neg logit contributions
ags-0.569
c-0.529
opl-0.527
rell-0.512
icer-0.508
Token Where
Token ID6350
Feature activation-2.76e-01

Pos logit contributions
 setup+0.062
 traveler+0.059
 troll+0.058
 reality+0.057
 clues+0.057
Neg logit contributions
ags-0.075
c-0.068
opl-0.068
aters-0.067
rell-0.067
Token did
Token ID750
Feature activation+5.27e-02

Pos logit contributions
 traveler+0.535
 setup+0.509
 spies+0.507
 scientist+0.505
 troll+0.502
Neg logit contributions
ags-0.440
rell-0.377
opl-0.376
aces-0.373
c-0.371
Token you
Token ID345
Feature activation-1.89e-02

Pos logit contributions
c+0.081
rell+0.080
ags+0.080
opl+0.079
aces+0.074
Neg logit contributions
 scientist-0.090
 traveler-0.086
 setup-0.086
 diver-0.085
 Wizard-0.083
Token come
Token ID1282
Feature activation-6.15e-03

Pos logit contributions
 traveler+0.025
 setup+0.024
 spies+0.024
 criminal+0.023
 troll+0.023
Neg logit contributions
ags-0.041
opl-0.037
rell-0.037
c-0.035
aces-0.035
Token Rabbit
Token ID25498
Feature activation-4.50e-02

Pos logit contributions
ags+0.103
opl+0.092
rell+0.091
icer+0.091
c+0.086
Neg logit contributions
 traveler-0.069
 portal-0.068
 spies-0.068
 university-0.067
 setup-0.067
Token took
Token ID1718
Feature activation+3.77e-02

Pos logit contributions
 spies+0.076
 university+0.075
 traveler+0.073
 detectives+0.073
 portal+0.071
Neg logit contributions
ags-0.075
c-0.072
rell-0.071
aces-0.067
aters-0.066
Token Max
Token ID5436
Feature activation-4.44e-05

Pos logit contributions
ags+0.087
rell+0.077
opl+0.076
icer+0.074
c+0.073
Neg logit contributions
 traveler-0.050
 setup-0.047
 spies-0.046
 portal-0.046
 troll-0.045
Token to
Token ID284
Feature activation-6.61e-02

Pos logit contributions
 traveler+0.000
 spies+0.000
 setup+0.000
 detectives+0.000
 troll+0.000
Neg logit contributions
ags-0.000
opl-0.000
rell-0.000
icer-0.000
aters-0.000
Token the
Token ID262
Feature activation+5.06e-02

Pos logit contributions
eware+0.117
 traveler+0.112
 portal+0.108
 diver+0.106
 reporter+0.105
Neg logit contributions
ags-0.119
opl-0.097
rell-0.093
c-0.091
ag-0.089
Token Wizard
Token ID16884
Feature activation-3.09e-01

Pos logit contributions
ags+0.094
c+0.082
opl+0.079
icer+0.078
ag+0.073
Neg logit contributions
eware-0.076
 spies-0.068
 occurred-0.067
 setup-0.067
 explorers-0.067
Token who
Token ID508
Feature activation-4.51e-02

Pos logit contributions
 traveler+0.502
 spies+0.483
 setup+0.475
 portal+0.473
 detectives+0.470
Neg logit contributions
ags-0.592
opl-0.521
rell-0.521
icer-0.512
aters-0.501
Token said
Token ID531
Feature activation+3.66e-02

Pos logit contributions
 traveler+0.067
 spies+0.067
 detectives+0.064
 setup+0.063
 portal+0.063
Neg logit contributions
ags-0.089
opl-0.082
rell-0.081
icer-0.081
c-0.080
Token,
Token ID11
Feature activation-8.43e-02

Pos logit contributions
ags+0.075
aces+0.071
rell+0.069
opl+0.066
c+0.066
Neg logit contributions
 traveler-0.049
 portal-0.048
 setup-0.046
 spies-0.046
 troll-0.045
Token "
Token ID366
Feature activation-2.09e-02

Pos logit contributions
 traveler+0.102
 spies+0.096
 portal+0.094
 setup+0.093
 troll+0.093
Neg logit contributions
ags-0.197
rell-0.179
opl-0.177
icer-0.174
aters-0.171
TokenI
Token ID40
Feature activation-1.10e-01

Pos logit contributions
 traveler+0.025
 setup+0.023
 spies+0.023
 troll+0.022
 scientist+0.022
Neg logit contributions
ags-0.051
opl-0.046
rell-0.046
icer-0.045
aters-0.044
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
ags+0.023
opl+0.019
rell+0.019
c+0.018
icer+0.018
Neg logit contributions
 traveler-0.025
 setup-0.025
 troll-0.024
 detectives-0.024
 spies-0.024
Token"
Token ID1
Feature activation-3.12e-02

Pos logit contributions
ags+0.020
c+0.017
icer+0.016
agged+0.015
opl+0.015
Neg logit contributions
 reality-0.023
 troll-0.023
 setup-0.023
 portal-0.023
 traveler-0.023
TokenI
Token ID40
Feature activation-1.05e-01

Pos logit contributions
 setup+0.045
 portal+0.045
 traveler+0.044
 troll+0.043
 reality+0.043
Neg logit contributions
ags-0.059
opl-0.052
icer-0.051
aces-0.051
c-0.051
Token am
Token ID716
Feature activation-8.23e-02

Pos logit contributions
 portal+0.170
 setup+0.168
 traveler+0.163
 university+0.163
 scientist+0.162
Neg logit contributions
c-0.171
ags-0.170
rell-0.161
opl-0.160
icer-0.159
Token a
Token ID257
Feature activation-1.41e-01

Pos logit contributions
 portal+0.111
 traveler+0.110
 clues+0.105
Where+0.104
 setup+0.103
Neg logit contributions
ags-0.174
opl-0.156
rell-0.151
c-0.149
icer-0.145
Token traveler
Token ID40168
Feature activation-3.08e-01

Pos logit contributions
eware+0.187
 setup+0.180
ocated+0.175
 portal+0.166
 clues+0.166
Neg logit contributions
ags-0.238
opl-0.236
icer-0.227
c-0.224
ish-0.209
Token,"
Token ID553
Feature activation-5.03e-02

Pos logit contributions
 portal+0.481
eware+0.469
 setup+0.464
Who+0.458
Where+0.457
Neg logit contributions
ags-0.558
c-0.497
icer-0.484
opl-0.481
aces-0.480
Token the
Token ID262
Feature activation+6.00e-02

Pos logit contributions
eware+0.081
 portal+0.081
 traveler+0.079
fire+0.078
 criminal+0.076
Neg logit contributions
ags-0.090
c-0.075
icer-0.072
rell-0.072
agged-0.068
Token person
Token ID1048
Feature activation-1.43e-01

Pos logit contributions
ags+0.114
icer+0.090
opl+0.085
allows+0.084
ag+0.084
Neg logit contributions
eware-0.077
 reality-0.076
 interrupted-0.074
icol-0.072
 awaited-0.072
Token said
Token ID531
Feature activation+4.73e-02

Pos logit contributions
 traveler+0.195
 spies+0.186
 setup+0.182
 detectives+0.180
 portal+0.179
Neg logit contributions
ags-0.317
rell-0.283
opl-0.281
icer-0.275
c-0.271
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
ags+0.091
aces+0.078
rell+0.076
agged+0.074
c+0.073
Neg logit contributions
 traveler-0.081
 portal-0.077
 university-0.076
 troll-0.076
 setup-0.075
Token can
Token ID460
Feature activation+9.93e-03

Pos logit contributions
 traveler+0.025
 spies+0.024
 setup+0.024
 troll+0.024
 portal+0.024
Neg logit contributions
ags-0.032
rell-0.028
opl-0.028
icer-0.027
aters-0.027
Token say
Token ID910
Feature activation+2.22e-02

Pos logit contributions
ags+0.022
opl+0.019
rell+0.019
aters+0.018
icer+0.018
Neg logit contributions
 traveler-0.014
 spies-0.013
 troll-0.013
 portal-0.013
 setup-0.013
Token sorry
Token ID7926
Feature activation-6.15e-02

Pos logit contributions
ags+0.050
opl+0.044
rell+0.044
aces+0.042
aters+0.042
Neg logit contributions
 traveler-0.030
 portal-0.028
 troll-0.027
 setup-0.027
 spies-0.027
Token to
Token ID284
Feature activation-5.93e-02

Pos logit contributions
 traveler+0.085
 portal+0.080
 troll+0.078
 setup+0.078
 spies+0.077
Neg logit contributions
ags-0.136
rell-0.120
opl-0.119
aters-0.118
aces-0.116
Token each
Token ID1123
Feature activation+2.43e-02

Pos logit contributions
 traveler+0.094
 portal+0.093
 setup+0.088
fire+0.087
 troll+0.087
Neg logit contributions
ags-0.120
opl-0.104
rell-0.101
icer-0.100
c-0.097
Token other
Token ID584
Feature activation+1.39e-01

Pos logit contributions
ags+0.049
rell+0.044
opl+0.044
icer+0.043
aters+0.043
Neg logit contributions
 traveler-0.038
 spies-0.037
 setup-0.037
 scientist-0.036
 troll-0.036
Token and
Token ID290
Feature activation+1.15e-01

Pos logit contributions
ags+0.293
opl+0.260
rell+0.260
icer+0.254
aters+0.249
Neg logit contributions
 traveler-0.206
 spies-0.196
 setup-0.193
 detectives-0.190
 troll-0.190
Token give
Token ID1577
Feature activation-7.16e-02

Pos logit contributions
ags+0.233
opl+0.198
rell+0.193
c+0.191
aces+0.188
Neg logit contributions
 traveler-0.174
 troll-0.165
 portal-0.163
eware-0.159
 scientist-0.158
Token each
Token ID1123
Feature activation+2.42e-02

Pos logit contributions
 traveler+0.102
 spies+0.098
 setup+0.096
 detectives+0.094
 troll+0.094
Neg logit contributions
ags-0.154
opl-0.140
rell-0.139
icer-0.136
aters-0.134
Token other
Token ID584
Feature activation+1.39e-01

Pos logit contributions
ags+0.054
opl+0.052
rell+0.052
icer+0.051
aters+0.051
Neg logit contributions
 traveler-0.031
 spies-0.031
 clues-0.031
 troll-0.030
 scientist-0.029
Token a
Token ID257
Feature activation-1.41e-01

Pos logit contributions
ags+0.389
opl+0.372
icer+0.372
rell+0.369
aters+0.358
Neg logit contributions
 spies-0.099
 traveler-0.097
 setup-0.090
 detectives-0.088
 university-0.087
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
 traveler+0.022
 spies+0.021
 setup+0.021
 detectives+0.021
 portal+0.021
Neg logit contributions
ags-0.032
opl-0.029
rell-0.029
aces-0.028
aters-0.028
Token
Token ID220
Feature activation+2.31e-01

Pos logit contributions
 traveler+0.016
 spies+0.016
 setup+0.016
 detectives+0.015
 university+0.015
Neg logit contributions
ags-0.021
opl-0.019
rell-0.018
icer-0.018
aces-0.017
Token
Token ID198
Feature activation-3.57e-03

Pos logit contributions
ags+0.529
opl+0.480
rell+0.476
icer+0.474
aters+0.457
Neg logit contributions
 traveler-0.277
 spies-0.275
 university-0.267
 setup-0.261
 detectives-0.260
Token
Token ID198
Feature activation-3.15e-03

Pos logit contributions
 traveler+0.004
 spies+0.004
 setup+0.004
 university+0.004
 scientist+0.004
Neg logit contributions
ags-0.008
rell-0.008
opl-0.008
icer-0.007
aces-0.007
TokenThey
Token ID2990
Feature activation-8.07e-02

Pos logit contributions
 traveler+0.004
 spies+0.004
 setup+0.004
 portal+0.004
 detectives+0.004
Neg logit contributions
ags-0.007
rell-0.006
opl-0.006
icer-0.006
aters-0.006
Token both
Token ID1111
Feature activation+1.35e-01

Pos logit contributions
 traveler+0.116
 spies+0.109
 setup+0.109
 troll+0.106
 detectives+0.106
Neg logit contributions
ags-0.176
opl-0.157
rell-0.155
icer-0.152
aters-0.150
Token went
Token ID1816
Feature activation-5.29e-02

Pos logit contributions
ags+0.307
opl+0.275
icer+0.270
rell+0.270
aters+0.263
Neg logit contributions
 traveler-0.184
 setup-0.171
 spies-0.168
 troll-0.165
 portal-0.164
Token down
Token ID866
Feature activation+3.07e-02

Pos logit contributions
 traveler+0.078
 setup+0.075
 troll+0.075
 portal+0.074
 spies+0.073
Neg logit contributions
ags-0.114
rell-0.099
opl-0.096
aters-0.094
c-0.093
Token to
Token ID284
Feature activation-7.25e-02

Pos logit contributions
ags+0.060
rell+0.051
c+0.050
opl+0.047
icer+0.047
Neg logit contributions
 traveler-0.050
 portal-0.047
 troll-0.047
 spies-0.046
 setup-0.046
Token the
Token ID262
Feature activation+3.88e-02

Pos logit contributions
 traveler+0.109
 portal+0.101
 troll+0.101
 spies+0.100
eware+0.099
Neg logit contributions
ags-0.154
rell-0.132
c-0.132
opl-0.128
icer-0.128
Token river
Token ID7850
Feature activation-6.54e-02

Pos logit contributions
ags+0.079
icer+0.071
rell+0.071
opl+0.070
c+0.068
Neg logit contributions
 traveler-0.057
 portal-0.055
 spies-0.054
 detectives-0.053
 setup-0.052
Token and
Token ID290
Feature activation+9.96e-02

Pos logit contributions
 traveler+0.070
 spies+0.066
 setup+0.065
 troll+0.064
 portal+0.064
Neg logit contributions
ags-0.114
opl-0.102
rell-0.102
icer-0.099
aters-0.098
Token explore
Token ID7301
Feature activation-6.22e-02

Pos logit contributions
ags+0.207
c+0.189
rell+0.188
opl+0.183
icer+0.174
Neg logit contributions
 troll-0.128
 traveler-0.128
 setup-0.128
 spies-0.128
 criminal-0.124
Token the
Token ID262
Feature activation+4.29e-02

Pos logit contributions
 traveler+0.093
 setup+0.089
 troll+0.086
 spies+0.085
 criminal+0.084
Neg logit contributions
ags-0.132
opl-0.117
rell-0.114
icer-0.114
c-0.113
Token woods
Token ID16479
Feature activation-1.46e-01

Pos logit contributions
ags+0.076
icer+0.070
opl+0.069
rell+0.065
c+0.065
Neg logit contributions
 traveler-0.071
 setup-0.070
 spies-0.069
 criminal-0.067
 troll-0.067
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
 traveler+0.263
 setup+0.250
 troll+0.246
 scientist+0.244
 reporter+0.243
Neg logit contributions
ags-0.236
c-0.219
opl-0.216
aces-0.213
rell-0.207
Token His
Token ID2399
Feature activation+1.73e-01

Pos logit contributions
 traveler+0.015
 university+0.014
 spies+0.014
 Where+0.013
 Wizard+0.013
Neg logit contributions
ags-0.026
opl-0.023
rell-0.022
icer-0.022
aters-0.022
Token mother
Token ID2802
Feature activation-6.44e-02

Pos logit contributions
ags+0.383
opl+0.341
rell+0.340
icer+0.333
aters+0.328
Neg logit contributions
 traveler-0.244
 spies-0.232
 setup-0.228
 troll-0.225
 detectives-0.224
Token was
Token ID373
Feature activation+4.30e-03

Pos logit contributions
 traveler+0.084
 spies+0.080
 setup+0.078
 portal+0.077
 detectives+0.077
Neg logit contributions
ags-0.146
rell-0.132
opl-0.131
icer-0.129
aters-0.127
Token worried
Token ID7960
Feature activation-2.56e-02

Pos logit contributions
ags+0.008
c+0.007
rell+0.007
opl+0.007
icer+0.007
Neg logit contributions
 traveler-0.007
 troll-0.007
 clues-0.007
 portal-0.007
eware-0.006
Token so
Token ID523
Feature activation-1.40e-02

Pos logit contributions
 traveler+0.035
 spies+0.033
 setup+0.033
 troll+0.032
 detectives+0.032
Neg logit contributions
ags-0.057
opl-0.051
rell-0.051
icer-0.050
aters-0.049
Token before
Token ID878
Feature activation-4.19e-03

Pos logit contributions
 traveler+0.019
 portal+0.018
 spies+0.018
 detectives+0.018
 setup+0.018
Neg logit contributions
opl-0.028
ags-0.027
rell-0.027
ittens-0.026
c-0.025
TokenYes
Token ID5297
Feature activation-2.05e-02

Pos logit contributions
 traveler+0.023
 spies+0.022
 setup+0.022
 troll+0.021
 detectives+0.021
Neg logit contributions
ags-0.044
opl-0.039
rell-0.039
icer-0.038
aters-0.038
Token,
Token ID11
Feature activation-8.04e-02

Pos logit contributions
 traveler+0.032
 portal+0.030
 Wizard+0.028
 spies+0.028
fire+0.028
Neg logit contributions
ags-0.042
rell-0.038
icer-0.037
opl-0.036
aters-0.035
Token Sam
Token ID3409
Feature activation-2.43e-02

Pos logit contributions
 traveler+0.098
 portal+0.091
fire+0.089
 Wizard+0.089
 setup+0.089
Neg logit contributions
ags-0.183
opl-0.164
rell-0.164
aces-0.162
aters-0.159
Token.
Token ID13
Feature activation+6.71e-03

Pos logit contributions
 traveler+0.030
 spies+0.028
 setup+0.028
 portal+0.028
 university+0.027
Neg logit contributions
ags-0.057
rell-0.052
opl-0.051
aces-0.050
icer-0.050
Token It
Token ID632
Feature activation+3.74e-02

Pos logit contributions
ags+0.013
rell+0.012
opl+0.012
aters+0.012
icer+0.012
Neg logit contributions
 traveler-0.011
 setup-0.010
 spies-0.010
 troll-0.010
 criminal-0.010
Token's
Token ID338
Feature activation+1.75e-01

Pos logit contributions
ags+0.057
rell+0.048
opl+0.048
icer+0.046
aters+0.045
Neg logit contributions
 traveler-0.077
 spies-0.075
 setup-0.074
 troll-0.073
 detectives-0.073
Token very
Token ID845
Feature activation+2.43e-02

Pos logit contributions
ags+0.386
opl+0.339
rell+0.338
aters+0.330
icer+0.328
Neg logit contributions
 traveler-0.254
 criminal-0.236
 troll-0.233
 scientist-0.232
 spies-0.230
Token pretty
Token ID2495
Feature activation+6.15e-02

Pos logit contributions
ags+0.040
opl+0.034
allows+0.033
rell+0.033
ag+0.032
Neg logit contributions
 traveler-0.044
 spies-0.044
 criminal-0.044
 scientist-0.043
 setup-0.043
Token.
Token ID13
Feature activation+7.30e-03

Pos logit contributions
ags+0.124
aters+0.111
rell+0.110
opl+0.109
c+0.107
Neg logit contributions
 traveler-0.096
 setup-0.092
 criminal-0.092
 spies-0.089
 troll-0.089
Token Do
Token ID2141
Feature activation-3.53e-02

Pos logit contributions
ags+0.015
opl+0.013
rell+0.013
icer+0.013
aters+0.013
Neg logit contributions
 traveler-0.011
 spies-0.010
 setup-0.010
 troll-0.010
 detectives-0.010
Token you
Token ID345
Feature activation-1.91e-02

Pos logit contributions
 traveler+0.038
 spies+0.037
 setup+0.036
 university+0.036
 detectives+0.036
Neg logit contributions
ags-0.088
rell-0.080
opl-0.080
icer-0.078
aters-0.078
Token be
Token ID307
Feature activation-3.98e-02

Pos logit contributions
 traveler+0.072
 setup+0.067
 rescue+0.066
 university+0.064
 gn+0.061
Neg logit contributions
ags-0.160
rell-0.146
aters-0.146
aces-0.145
icer-0.145
Token wealthy
Token ID11574
Feature activation-7.87e-02

Pos logit contributions
 traveler+0.061
 portal+0.058
 spies+0.058
 troll+0.057
 setup+0.057
Neg logit contributions
ags-0.083
opl-0.071
rell-0.071
icer-0.070
c-0.069
Token to
Token ID284
Feature activation-6.26e-02

Pos logit contributions
 portal+0.118
 traveler+0.116
 troll+0.115
 setup+0.111
 criminal+0.108
Neg logit contributions
ags-0.156
aters-0.137
aces-0.137
opl-0.136
icer-0.136
Token buy
Token ID2822
Feature activation-2.67e-02

Pos logit contributions
 portal+0.088
 traveler+0.087
 spies+0.087
 troll+0.086
 scientist+0.085
Neg logit contributions
ags-0.138
opl-0.121
rell-0.117
icer-0.111
ag-0.111
Token your
Token ID534
Feature activation+6.83e-02

Pos logit contributions
 traveler+0.029
 spies+0.028
 setup+0.027
 detectives+0.027
 university+0.026
Neg logit contributions
ags-0.066
opl-0.060
rell-0.059
aters-0.058
icer-0.058
Token favorite
Token ID4004
Feature activation+1.51e-01

Pos logit contributions
ags+0.146
rell+0.137
opl+0.132
aters+0.129
agged+0.128
Neg logit contributions
 traveler-0.097
 rescue-0.096
 setup-0.094
 spies-0.092
 clues-0.090
Token ice
Token ID4771
Feature activation+6.22e-02

Pos logit contributions
ags+0.348
agged+0.308
opl+0.301
rell+0.300
aces+0.299
Neg logit contributions
 traveler-0.199
 rescue-0.187
 clues-0.186
 that-0.185
Who-0.181
Token cream
Token ID8566
Feature activation-3.20e-03

Pos logit contributions
ags+0.137
opl+0.124
rell+0.121
aters+0.118
agged+0.116
Neg logit contributions
 traveler-0.085
 ranger-0.084
 that-0.083
 worker-0.082
 spies-0.078
Token."
Token ID526
Feature activation+3.54e-02

Pos logit contributions
 traveler+0.004
 spies+0.004
 setup+0.004
 detectives+0.004
 Wizard+0.004
Neg logit contributions
ags-0.007
opl-0.007
rell-0.007
aters-0.006
agged-0.006
Token
Token ID198
Feature activation+6.62e-03

Pos logit contributions
ags+0.091
opl+0.085
icer+0.083
at+0.082
aces+0.081
Neg logit contributions
 spies-0.030
 ranger-0.030
 university-0.029
 that-0.029
 scientist-0.029
Token
Token ID198
Feature activation+6.87e-03

Pos logit contributions
ags+0.016
opl+0.014
rell+0.014
icer+0.014
aters+0.014
Neg logit contributions
 traveler-0.008
 spies-0.008
 portal-0.007
 university-0.007
 setup-0.007
Token can
Token ID460
Feature activation+5.89e-03

Pos logit contributions
 traveler+0.071
 spies+0.067
 setup+0.066
 troll+0.066
 detectives+0.065
Neg logit contributions
ags-0.094
opl-0.083
rell-0.083
icer-0.081
aters-0.080
Token squeeze
Token ID21229
Feature activation+5.91e-02

Pos logit contributions
ags+0.013
rell+0.011
opl+0.011
aters+0.011
icer+0.011
Neg logit contributions
 traveler-0.009
 setup-0.008
 troll-0.008
 spies-0.008
 portal-0.008
Token the
Token ID262
Feature activation+5.13e-02

Pos logit contributions
ags+0.155
opl+0.143
rell+0.140
agged+0.138
aters+0.136
Neg logit contributions
 spies-0.055
 traveler-0.053
 detectives-0.050
 university-0.050
 setup-0.050
Token car
Token ID1097
Feature activation+8.56e-03

Pos logit contributions
ags+0.112
opl+0.101
at+0.101
rell+0.099
agged+0.099
Neg logit contributions
 spies-0.071
 traveler-0.069
 police-0.068
 security-0.067
 clues-0.067
Token the
Token ID262
Feature activation+5.15e-02

Pos logit contributions
ags+0.020
opl+0.018
rell+0.017
icer+0.017
agged+0.017
Neg logit contributions
 traveler-0.011
 spies-0.010
 setup-0.010
 university-0.010
 detectives-0.010
Token most
Token ID749
Feature activation+2.58e-02

Pos logit contributions
ags+0.107
opl+0.095
rell+0.094
agged+0.092
aters+0.091
Neg logit contributions
 traveler-0.076
 spies-0.076
 setup-0.074
 university-0.073
 portal-0.072
Token".
Token ID1911
Feature activation-4.99e-02

Pos logit contributions
ags+0.058
opl+0.053
rell+0.052
icer+0.050
aters+0.050
Neg logit contributions
 traveler-0.035
 spies-0.032
 setup-0.032
 troll-0.032
 portal-0.032
Token Bob
Token ID5811
Feature activation+3.52e-02

Pos logit contributions
 traveler+0.051
 university+0.049
 spies+0.048
 Wizard+0.047
 rescue+0.047
Neg logit contributions
ags-0.126
opl-0.114
wy-0.112
icer-0.112
agged-0.109
Token liked
Token ID8288
Feature activation+7.27e-02

Pos logit contributions
ags+0.087
opl+0.078
rell+0.075
icer+0.075
aters+0.073
Neg logit contributions
 traveler-0.044
 setup-0.041
 Wizard-0.040
 troll-0.040
 scientist-0.040
Token the
Token ID262
Feature activation+5.16e-02

Pos logit contributions
ags+0.171
opl+0.155
rell+0.155
icer+0.151
aters+0.148
Neg logit contributions
 traveler-0.090
 portal-0.083
 spies-0.082
 troll-0.081
 setup-0.080
Token idea
Token ID2126
Feature activation-9.39e-03

Pos logit contributions
ags+0.113
icer+0.102
at+0.101
aters+0.100
akes+0.099
Neg logit contributions
 traveler-0.070
 university-0.070
 spies-0.068
 security-0.068
 setup-0.067
Token to
Token ID284
Feature activation-6.25e-02

Pos logit contributions
ags+0.038
rell+0.033
ag+0.032
c+0.032
aces+0.031
Neg logit contributions
 traveler-0.024
 portal-0.023
fire-0.020
 troll-0.020
 Wizard-0.020
Token say
Token ID910
Feature activation+2.18e-02

Pos logit contributions
 traveler+0.108
 troll+0.104
 setup+0.103
 spies+0.102
 detectives+0.100
Neg logit contributions
ags-0.121
rell-0.100
opl-0.099
ag-0.095
aters-0.093
Token sorry
Token ID7926
Feature activation-6.28e-02

Pos logit contributions
ags+0.047
rell+0.041
opl+0.041
aces+0.040
c+0.039
Neg logit contributions
 traveler-0.031
 setup-0.030
 portal-0.029
 troll-0.029
 spies-0.029
Token again
Token ID757
Feature activation-3.51e-02

Pos logit contributions
 traveler+0.085
 portal+0.079
 setup+0.078
 troll+0.076
 spies+0.076
Neg logit contributions
ags-0.143
rell-0.127
opl-0.124
aters-0.123
aces-0.122
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
 traveler+0.062
 portal+0.058
 setup+0.057
 troll+0.057
 detectives+0.057
Neg logit contributions
ags-0.061
opl-0.054
c-0.054
aters-0.053
aces-0.052
Token He
Token ID679
Feature activation-2.19e-03

Pos logit contributions
ags+0.027
rell+0.024
opl+0.024
icer+0.023
aters+0.023
Neg logit contributions
 traveler-0.024
 spies-0.023
 detectives-0.023
 setup-0.023
 troll-0.023
Token wanted
Token ID2227
Feature activation+1.75e-02

Pos logit contributions
 traveler+0.003
 university+0.003
 spies+0.003
 portal+0.003
 detectives+0.003
Neg logit contributions
ags-0.004
rell-0.004
c-0.004
icer-0.004
opl-0.004
Token to
Token ID284
Feature activation-6.22e-02

Pos logit contributions
ags+0.040
rell+0.036
c+0.034
ag+0.034
opl+0.034
Neg logit contributions
 traveler-0.023
 portal-0.022
 setup-0.020
 troll-0.020
 Wizard-0.020
Token be
Token ID307
Feature activation-3.77e-02

Pos logit contributions
 traveler+0.109
 setup+0.103
 spies+0.103
 troll+0.103
 detectives+0.101
Neg logit contributions
ags-0.119
opl-0.100
rell-0.100
ag-0.094
aters-0.092
Token friends
Token ID2460
Feature activation-1.10e-03

Pos logit contributions
 traveler+0.060
 portal+0.058
 setup+0.058
 troll+0.057
 criminal+0.055
Neg logit contributions
ags-0.080
opl-0.064
rell-0.063
c-0.063
aces-0.063
Token with
Token ID351
Feature activation+3.24e-02

Pos logit contributions
 traveler+0.001
 spies+0.001
 setup+0.001
 troll+0.001
 detectives+0.001
Neg logit contributions
ags-0.003
opl-0.002
rell-0.002
icer-0.002
aters-0.002
Token saw
Token ID2497
Feature activation-9.09e-02

Pos logit contributions
 traveler+0.089
 spies+0.086
 setup+0.084
 portal+0.084
 university+0.083
Neg logit contributions
ags-0.144
opl-0.131
rell-0.129
c-0.127
icer-0.127
Token in
Token ID287
Feature activation+2.84e-02

Pos logit contributions
 traveler+0.120
 spies+0.114
 setup+0.112
 detectives+0.110
 troll+0.110
Neg logit contributions
ags-0.205
opl-0.185
rell-0.184
icer-0.180
aters-0.177
Token person
Token ID1048
Feature activation-1.45e-01

Pos logit contributions
ags+0.062
opl+0.058
icer+0.056
rell+0.055
aters+0.054
Neg logit contributions
 traveler-0.038
 detectives-0.036
 spies-0.035
eware-0.035
 setup-0.035
Token!
Token ID0
Feature activation-7.84e-02

Pos logit contributions
 traveler+0.231
 spies+0.221
 setup+0.217
 detectives+0.215
 troll+0.214
Neg logit contributions
ags-0.291
opl-0.255
rell-0.255
icer-0.249
aters-0.243
Token It
Token ID632
Feature activation+3.77e-02

Pos logit contributions
 traveler+0.117
 spies+0.113
 scientist+0.111
 troll+0.111
 university+0.111
Neg logit contributions
ags-0.160
opl-0.145
rell-0.142
aces-0.137
icer-0.137
Token was
Token ID373
Feature activation+1.32e-02

Pos logit contributions
ags+0.073
rell+0.068
opl+0.067
c+0.065
icer+0.064
Neg logit contributions
 traveler-0.058
 spies-0.057
 detectives-0.055
 university-0.055
 setup-0.054
Token an
Token ID281
Feature activation-8.35e-02

Pos logit contributions
ags+0.029
opl+0.026
rell+0.026
icer+0.025
aters+0.025
Neg logit contributions
 traveler-0.018
 spies-0.017
 portal-0.017
 detectives-0.017
 troll-0.016
Token incredible
Token ID8082
Feature activation-1.82e-02

Pos logit contributions
 traveler+0.106
 spies+0.097
 troll+0.095
 university+0.095
 criminal+0.094
Neg logit contributions
ags-0.194
icer-0.174
aters-0.173
akes-0.172
rell-0.171
Token experience
Token ID1998
Feature activation-8.08e-02

Pos logit contributions
 traveler+0.025
 spies+0.024
 setup+0.024
 detectives+0.024
 troll+0.023
Neg logit contributions
ags-0.040
opl-0.036
rell-0.036
icer-0.035
aters-0.034
Token.
Token ID13
Feature activation+3.29e-03

Pos logit contributions
 traveler+0.113
 spies+0.109
 troll+0.108
 detectives+0.107
 setup+0.107
Neg logit contributions
ags-0.167
opl-0.154
rell-0.154
aces-0.149
aters-0.147
Token<|endoftext|>
Token ID50256
Feature activation+1.07e-01

Pos logit contributions
ags+0.007
opl+0.006
rell+0.006
icer+0.006
aces+0.006
Neg logit contributions
 traveler-0.005
 spies-0.005
 scientist-0.005
 troll-0.005
Where-0.005
Token the
Token ID262
Feature activation+5.26e-02

Pos logit contributions
 traveler+0.077
 spies+0.074
 setup+0.072
 detectives+0.071
 university+0.070
Neg logit contributions
ags-0.153
opl-0.140
rell-0.139
icer-0.134
aters-0.134
Token hot
Token ID3024
Feature activation+8.72e-02

Pos logit contributions
ags+0.122
aces+0.110
opl+0.106
agged+0.106
aters+0.105
Neg logit contributions
 traveler-0.071
 setup-0.069
 scientist-0.067
 troll-0.066
 spies-0.066
Tokendogs
Token ID22242
Feature activation+7.89e-02

Pos logit contributions
ags+0.166
aces+0.146
opl+0.139
rell+0.137
agged+0.137
Neg logit contributions
 traveler-0.145
 scientist-0.139
 spies-0.138
 troll-0.137
fire-0.137
Token.
Token ID13
Feature activation-1.64e-03

Pos logit contributions
ags+0.181
opl+0.161
rell+0.158
icer+0.157
aters+0.153
Neg logit contributions
 traveler-0.104
 spies-0.100
 setup-0.098
 scientist-0.096
 detectives-0.095
Token Sarah
Token ID10490
Feature activation-3.90e-02

Pos logit contributions
 traveler+0.002
 spies+0.002
 setup+0.002
 troll+0.002
 detectives+0.002
Neg logit contributions
ags-0.004
rell-0.003
opl-0.003
icer-0.003
aters-0.003
Token was
Token ID373
Feature activation+9.63e-03

Pos logit contributions
 traveler+0.050
 setup+0.047
 scientist+0.046
 troll+0.046
 spies+0.045
Neg logit contributions
ags-0.096
opl-0.083
rell-0.081
aters-0.079
agged-0.079
Token so
Token ID523
Feature activation-5.70e-03

Pos logit contributions
ags+0.021
opl+0.019
rell+0.019
icer+0.019
aters+0.018
Neg logit contributions
 traveler-0.013
 spies-0.013
 setup-0.013
 detectives-0.012
 troll-0.012
Token excited
Token ID6568
Feature activation+9.22e-02

Pos logit contributions
 traveler+0.008
 setup+0.007
 spies+0.007
 detectives+0.007
 university+0.007
Neg logit contributions
ags-0.013
opl-0.012
rell-0.012
aters-0.012
icer-0.012
Token,
Token ID11
Feature activation-8.27e-02

Pos logit contributions
ags+0.173
rell+0.169
opl+0.159
ash+0.154
aters+0.153
Neg logit contributions
 traveler-0.145
 portal-0.138
 spies-0.135
 detectives-0.133
 troll-0.133
Token she
Token ID673
Feature activation-6.81e-02

Pos logit contributions
 traveler+0.069
 spies+0.063
 setup+0.060
 portal+0.060
 troll+0.059
Neg logit contributions
ags-0.228
rell-0.209
opl-0.209
icer-0.205
aters-0.203
Token couldn
Token ID3521
Feature activation+8.39e-03

Pos logit contributions
 traveler+0.094
 spies+0.089
 setup+0.087
 portal+0.087
 troll+0.086
Neg logit contributions
ags-0.149
rell-0.134
opl-0.133
icer-0.130
aters-0.128
Token them
Token ID606
Feature activation+1.21e-01

Pos logit contributions
 traveler+0.129
 spies+0.122
 setup+0.120
 troll+0.119
 portal+0.118
Neg logit contributions
ags-0.188
rell-0.166
opl-0.166
icer-0.162
aters-0.160
Token up
Token ID510
Feature activation+1.04e-02

Pos logit contributions
ags+0.245
opl+0.219
icer+0.216
rell+0.214
aters+0.208
Neg logit contributions
 traveler-0.189
 spies-0.183
 setup-0.181
 portal-0.176
 detectives-0.176
Token into
Token ID656
Feature activation-4.42e-02

Pos logit contributions
ags+0.022
rell+0.019
opl+0.019
c+0.018
aters+0.018
Neg logit contributions
 traveler-0.016
 spies-0.015
 troll-0.015
 detectives-0.015
 portal-0.014
Token the
Token ID262
Feature activation+4.43e-02

Pos logit contributions
 traveler+0.072
eware+0.069
 detectives+0.066
 criminal+0.065
 Wizard+0.065
Neg logit contributions
ags-0.085
icer-0.073
opl-0.070
c-0.069
rell-0.069
Token air
Token ID1633
Feature activation+8.81e-03

Pos logit contributions
ags+0.093
opl+0.082
rell+0.082
icer+0.080
aters+0.078
Neg logit contributions
 traveler-0.066
 spies-0.063
 setup-0.062
 detectives-0.062
 troll-0.061
Token.
Token ID13
Feature activation-1.05e-03

Pos logit contributions
ags+0.019
rell+0.017
opl+0.017
icer+0.017
aters+0.016
Neg logit contributions
 traveler-0.012
 spies-0.012
 setup-0.012
 detectives-0.011
 troll-0.011
Token Pat
Token ID3208
Feature activation-6.65e-02

Pos logit contributions
 traveler+0.002
 setup+0.002
 spies+0.002
 detectives+0.002
 portal+0.002
Neg logit contributions
ags-0.002
rell-0.002
opl-0.002
icer-0.002
aters-0.002
Token said
Token ID531
Feature activation+3.86e-02

Pos logit contributions
 traveler+0.098
 spies+0.094
 setup+0.092
 detectives+0.091
 troll+0.091
Neg logit contributions
ags-0.139
rell-0.124
opl-0.124
icer-0.121
aters-0.119
Token,
Token ID11
Feature activation-8.84e-02

Pos logit contributions
ags+0.078
aces+0.072
rell+0.069
ag+0.067
agged+0.066
Neg logit contributions
 traveler-0.054
 portal-0.052
 setup-0.050
 troll-0.050
 spies-0.049
Token "
Token ID366
Feature activation-2.23e-02

Pos logit contributions
 traveler+0.103
 spies+0.097
 setup+0.095
 detectives+0.094
 portal+0.093
Neg logit contributions
ags-0.213
rell-0.192
opl-0.192
icer-0.187
aters-0.185
TokenLet
Token ID5756
Feature activation+4.56e-03

Pos logit contributions
 traveler+0.028
 spies+0.026
 setup+0.026
 portal+0.025
 detectives+0.025
Neg logit contributions
ags-0.053
rell-0.047
opl-0.047
icer-0.046
aters-0.045
Token cried
Token ID16896
Feature activation+4.35e-02

Pos logit contributions
ags+0.308
rell+0.277
opl+0.269
icer+0.266
aces+0.263
Neg logit contributions
 traveler-0.227
 spies-0.217
 detectives-0.212
 setup-0.211
 portal-0.211
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
ags+0.087
rell+0.074
aters+0.071
aces+0.070
ag+0.069
Neg logit contributions
 traveler-0.075
 portal-0.069
 troll-0.069
 setup-0.068
 scientist-0.066
Token
Token ID198
Feature activation+1.21e-02

Pos logit contributions
ags+0.021
rell+0.017
agged+0.016
icer+0.016
aces+0.016
Neg logit contributions
 traveler-0.023
 setup-0.021
 reality-0.021
 troll-0.021
 portal-0.021
Token
Token ID198
Feature activation+1.20e-02

Pos logit contributions
ags+0.021
opl+0.018
rell+0.017
icer+0.017
aters+0.017
Neg logit contributions
 traveler-0.023
 setup-0.022
 troll-0.022
 detectives-0.021
 Wizard-0.021
TokenMom
Token ID29252
Feature activation+3.73e-02

Pos logit contributions
ags+0.020
icer+0.015
agged+0.015
c+0.015
ittens+0.014
Neg logit contributions
 traveler-0.023
 troll-0.023
 reality-0.022
 Wizard-0.021
 goblin-0.021
Token came
Token ID1625
Feature activation-6.17e-02

Pos logit contributions
ags+0.098
rell+0.089
opl+0.088
icer+0.087
aters+0.086
Neg logit contributions
 traveler-0.037
 spies-0.034
 detectives-0.032
 setup-0.032
 portal-0.032
Token to
Token ID284
Feature activation-6.11e-02

Pos logit contributions
 traveler+0.100
 troll+0.095
 portal+0.094
 criminal+0.091
 sorcerer+0.090
Neg logit contributions
ags-0.127
rell-0.104
aters-0.103
icer-0.102
c-0.098
Token see
Token ID766
Feature activation-1.52e-01

Pos logit contributions
 traveler+0.091
 portal+0.090
eware+0.089
 troll+0.087
 criminal+0.084
Neg logit contributions
ags-0.130
icer-0.100
rell-0.098
ag-0.095
c-0.094
Token what
Token ID644
Feature activation-1.40e-01

Pos logit contributions
 traveler+0.221
 portal+0.204
 troll+0.200
 criminal+0.200
 Wizard+0.193
Neg logit contributions
ags-0.347
rell-0.290
icer-0.289
opl-0.286
aters-0.282
Token happened
Token ID3022
Feature activation-1.42e-01

Pos logit contributions
 traveler+0.236
 Wizard+0.211
 portal+0.208
 troll+0.207
 detectives+0.204
Neg logit contributions
ags-0.254
icer-0.223
rell-0.223
c-0.222
aces-0.218
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
 traveler+0.223
 spies+0.206
 troll+0.205
 portal+0.205
 setup+0.204
Neg logit contributions
ags-0.288
opl-0.254
rell-0.251
icer-0.246
aces-0.242
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
ags+0.024
rell+0.022
opl+0.021
icer+0.021
aters+0.021
Neg logit contributions
 traveler-0.015
 spies-0.014
 setup-0.014
 troll-0.014
 portal-0.014
TokenThey
Token ID2990
Feature activation-7.33e-02

Pos logit contributions
ags+0.025
opl+0.023
icer+0.022
rell+0.022
aters+0.022
Neg logit contributions
 traveler-0.015
 spies-0.014
 troll-0.014
 setup-0.014
 portal-0.014
Token ran
Token ID4966
Feature activation-3.71e-02

Pos logit contributions
 traveler+0.109
 spies+0.104
 troll+0.101
 setup+0.101
 detectives+0.101
Neg logit contributions
ags-0.153
rell-0.136
opl-0.135
icer-0.133
aters-0.130
Token to
Token ID284
Feature activation-6.52e-02

Pos logit contributions
 traveler+0.056
 troll+0.054
 setup+0.051
 scientist+0.050
 reality+0.050
Neg logit contributions
ags-0.080
rell-0.071
opl-0.066
aces-0.065
aters-0.065
Token the
Token ID262
Feature activation+5.00e-02

Pos logit contributions
 traveler+0.100
 portal+0.093
 troll+0.093
eware+0.090
 Wizard+0.088
Neg logit contributions
ags-0.143
rell-0.117
opl-0.116
icer-0.114
aters-0.112
Token park
Token ID3952
Feature activation-3.21e-02

Pos logit contributions
ags+0.090
opl+0.076
icer+0.075
aters+0.072
rell+0.072
Neg logit contributions
 traveler-0.086
eware-0.084
 troll-0.082
 Wizard-0.080
 portal-0.080
Token with
Token ID351
Feature activation+2.75e-02

Pos logit contributions
 traveler+0.050
 troll+0.047
 spies+0.047
 setup+0.047
 detectives+0.047
Neg logit contributions
ags-0.064
opl-0.056
rell-0.056
c-0.055
aters-0.053
Token their
Token ID511
Feature activation+6.85e-02

Pos logit contributions
ags+0.063
opl+0.056
rell+0.056
icer+0.055
aters+0.054
Neg logit contributions
 traveler-0.036
 spies-0.034
 setup-0.034
 troll-0.033
 portal-0.033
Token k
Token ID479
Feature activation+5.01e-02

Pos logit contributions
ags+0.164
rell+0.153
opl+0.151
icer+0.148
ash+0.146
Neg logit contributions
 spies-0.080
 setup-0.079
 traveler-0.079
 university-0.075
 clues-0.074
Tokenite
Token ID578
Feature activation+8.84e-04

Pos logit contributions
ags+0.075
rell+0.064
opl+0.064
aces+0.058
icer+0.057
Neg logit contributions
 traveler-0.101
 spies-0.101
 detectives-0.097
 setup-0.096
 portal-0.096
Token.
Token ID13
Feature activation+9.10e-03

Pos logit contributions
ags+0.002
icer+0.002
opl+0.002
rell+0.002
aters+0.002
Neg logit contributions
 traveler-0.001
 spies-0.001
 portal-0.001
 setup-0.001
 university-0.001
Token,
Token ID11
Feature activation-8.45e-02

Pos logit contributions
 traveler+0.076
 setup+0.072
 troll+0.070
 portal+0.069
 scientist+0.069
Neg logit contributions
ags-0.094
opl-0.084
c-0.084
rell-0.082
icer-0.080
Token a
Token ID257
Feature activation-1.44e-01

Pos logit contributions
 traveler+0.085
 setup+0.079
 portal+0.077
 spies+0.075
 troll+0.075
Neg logit contributions
ags-0.218
rell-0.195
opl-0.194
icer-0.191
c-0.190
Token cat
Token ID3797
Feature activation+7.38e-03

Pos logit contributions
 traveler+0.204
 detectives+0.201
 setup+0.200
 spies+0.197
 Wizard+0.196
Neg logit contributions
ags-0.276
opl-0.255
c-0.253
icer-0.251
rell-0.249
Token,
Token ID11
Feature activation-8.07e-02

Pos logit contributions
ags+0.016
opl+0.015
c+0.015
rell+0.014
aces+0.014
Neg logit contributions
 traveler-0.010
 setup-0.010
 reality-0.010
 clues-0.009
 detectives-0.009
Token a
Token ID257
Feature activation-1.42e-01

Pos logit contributions
 traveler+0.104
 setup+0.098
 portal+0.096
 spies+0.095
 troll+0.094
Neg logit contributions
ags-0.185
rell-0.165
opl-0.164
aters-0.160
c-0.159
Token butterfly
Token ID35113
Feature activation-7.29e-02

Pos logit contributions
 traveler+0.208
 setup+0.202
 detectives+0.199
 spies+0.198
 portal+0.196
Neg logit contributions
ags-0.283
opl-0.257
rell-0.253
c-0.252
icer-0.250
Token and
Token ID290
Feature activation+1.16e-01

Pos logit contributions
 traveler+0.093
 spies+0.087
 setup+0.087
 detectives+0.085
 troll+0.084
Neg logit contributions
ags-0.168
opl-0.152
rell-0.152
icer-0.147
c-0.146
Token a
Token ID257
Feature activation-1.43e-01

Pos logit contributions
ags+0.213
aces+0.206
c+0.203
 pads+0.186
rell+0.183
Neg logit contributions
 setup-0.168
 criminal-0.162
 portal-0.162
fire-0.158
 traveler-0.155
Token flower
Token ID15061
Feature activation+5.53e-02

Pos logit contributions
 traveler+0.210
 setup+0.204
 spies+0.201
 detectives+0.199
 portal+0.197
Neg logit contributions
ags-0.285
opl-0.255
rell-0.250
c-0.248
aters-0.244
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
ags+0.118
opl+0.105
rell+0.105
aters+0.102
aces+0.102
Neg logit contributions
 traveler-0.079
 setup-0.075
 spies-0.074
 detectives-0.073
 portal-0.073
Token They
Token ID1119
Feature activation-1.04e-02

Pos logit contributions
ags+0.021
rell+0.018
opl+0.018
icer+0.017
aters+0.017
Neg logit contributions
 traveler-0.020
 setup-0.019
 spies-0.019
 portal-0.019
 troll-0.019
Token cup
Token ID6508
Feature activation+1.55e-01

Pos logit contributions
ags+0.088
opl+0.076
rell+0.076
icer+0.075
aters+0.073
Neg logit contributions
 traveler-0.082
 spies-0.079
 setup-0.078
 detectives-0.077
 portal-0.076
Tokenboard
Token ID3526
Feature activation+1.58e-02

Pos logit contributions
ags+0.266
rell+0.227
opl+0.227
icer+0.222
aters+0.217
Neg logit contributions
 traveler-0.291
 spies-0.281
 troll-0.276
 setup-0.275
 university-0.273
Token.
Token ID13
Feature activation-8.92e-03

Pos logit contributions
ags+0.034
opl+0.031
rell+0.031
icer+0.030
aces+0.029
Neg logit contributions
 traveler-0.022
 spies-0.021
 setup-0.021
 troll-0.020
 detectives-0.020
Token The
Token ID383
Feature activation+7.54e-02

Pos logit contributions
 traveler+0.014
 spies+0.014
 setup+0.014
 portal+0.013
 detectives+0.013
Neg logit contributions
ags-0.018
rell-0.016
opl-0.016
icer-0.015
aters-0.015
Token little
Token ID1310
Feature activation-7.91e-02

Pos logit contributions
ags+0.175
icer+0.166
opl+0.162
rell+0.161
c+0.160
Neg logit contributions
 traveler-0.084
 spies-0.080
 portal-0.079
 detectives-0.077
 university-0.076
Token mouse
Token ID10211
Feature activation-1.06e-01

Pos logit contributions
 traveler+0.085
 spies+0.081
 portal+0.080
 university+0.079
 setup+0.078
Neg logit contributions
ags-0.189
icer-0.177
c-0.177
rell-0.176
aters-0.171
Token nervously
Token ID43324
Feature activation-5.27e-02

Pos logit contributions
 traveler+0.145
 spies+0.138
 setup+0.135
 detectives+0.133
 portal+0.133
Neg logit contributions
ags-0.234
rell-0.210
opl-0.209
icer-0.205
aters-0.201
Token replied
Token ID8712
Feature activation-6.04e-03

Pos logit contributions
 spies+0.088
 university+0.085
 traveler+0.084
 detectives+0.084
 portal+0.083
Neg logit contributions
c-0.089
ags-0.088
rell-0.086
at-0.080
et-0.080
Token,
Token ID11
Feature activation-9.17e-02

Pos logit contributions
 traveler+0.008
 portal+0.008
 spies+0.008
 setup+0.007
 detectives+0.007
Neg logit contributions
ags-0.013
rell-0.012
aces-0.012
opl-0.012
c-0.012
Token "
Token ID366
Feature activation-2.01e-02

Pos logit contributions
 traveler+0.106
 spies+0.099
 setup+0.097
 portal+0.096
 detectives+0.095
Neg logit contributions
ags-0.223
rell-0.201
opl-0.201
icer-0.196
aters-0.194
TokenI
Token ID40
Feature activation-1.10e-01

Pos logit contributions
Where+0.019
 traveler+0.019
Who+0.019
 setup+0.017
 troll+0.017
Neg logit contributions
ags-0.056
rell-0.050
opl-0.049
aces-0.048
aters-0.047
Token was
Token ID373
Feature activation+6.05e-03

Pos logit contributions
 traveler+0.059
 spies+0.055
 troll+0.054
 setup+0.054
 portal+0.054
Neg logit contributions
ags-0.136
opl-0.119
rell-0.119
aters-0.116
aces-0.115
Token a
Token ID257
Feature activation-1.53e-01

Pos logit contributions
ags+0.019
opl+0.018
rell+0.018
icer+0.017
aters+0.017
Neg logit contributions
 spies-0.003
 traveler-0.003
 detectives-0.002
 university-0.002
 explorers-0.002
Token girl
Token ID2576
Feature activation-1.30e-01

Pos logit contributions
 clues+0.209
eware+0.208
 portal+0.208
 setup+0.207
 spies+0.207
Neg logit contributions
ags-0.288
icer-0.280
c-0.268
opl-0.256
ittens-0.253
Token called
Token ID1444
Feature activation-8.04e-02

Pos logit contributions
 traveler+0.167
 setup+0.163
 spies+0.159
 troll+0.153
 detectives+0.153
Neg logit contributions
ags-0.291
c-0.272
opl-0.266
rell-0.258
icer-0.258
Token M
Token ID337
Feature activation+1.45e-01

Pos logit contributions
 traveler+0.134
 setup+0.130
 security+0.124
 criminal+0.124
 reality+0.123
Neg logit contributions
ags-0.146
rell-0.141
aces-0.133
c-0.132
aters-0.128
Tokenilly
Token ID6548
Feature activation-6.43e-02

Pos logit contributions
ags+0.295
rell+0.269
opl+0.266
agged+0.247
aces+0.247
Neg logit contributions
 traveler-0.206
 setup-0.206
 university-0.204
 scientist-0.204
 troll-0.202
Token.
Token ID13
Feature activation-9.11e-03

Pos logit contributions
 traveler+0.081
 spies+0.076
 setup+0.074
 troll+0.073
 scientist+0.073
Neg logit contributions
ags-0.152
opl-0.136
rell-0.136
icer-0.133
aters-0.130
Token M
Token ID337
Feature activation+1.46e-01

Pos logit contributions
 traveler+0.010
 spies+0.010
 scientist+0.010
 detectives+0.010
 university+0.010
Neg logit contributions
ags-0.022
opl-0.020
rell-0.019
aces-0.018
icer-0.018
Tokenilly
Token ID6548
Feature activation-6.30e-02

Pos logit contributions
ags+0.338
rell+0.308
opl+0.302
c+0.288
aces+0.287
Neg logit contributions
 traveler-0.172
 scientist-0.169
 setup-0.166
 troll-0.165
 spies-0.164
Token was
Token ID373
Feature activation+5.12e-03

Pos logit contributions
 traveler+0.106
 spies+0.101
 setup+0.100
 troll+0.099
 detectives+0.099
Neg logit contributions
ags-0.121
opl-0.105
rell-0.105
icer-0.102
aters-0.100
Token 3
Token ID513
Feature activation-1.46e-01

Pos logit contributions
ags+0.011
rell+0.010
opl+0.010
icer+0.010
c+0.010
Neg logit contributions
 traveler-0.007
 portal-0.006
 setup-0.006
 troll-0.006
 spies-0.006
Token and
Token ID290
Feature activation+1.11e-01

Pos logit contributions
ags+0.008
opl+0.008
rell+0.007
icer+0.007
aters+0.007
Neg logit contributions
 traveler-0.004
 spies-0.004
 portal-0.004
 setup-0.004
 university-0.004
Token all
Token ID477
Feature activation+8.19e-02

Pos logit contributions
ags+0.268
opl+0.242
rell+0.242
icer+0.236
aters+0.233
Neg logit contributions
 traveler-0.130
 spies-0.122
 setup-0.119
 portal-0.117
 troll-0.117
Token the
Token ID262
Feature activation+5.80e-02

Pos logit contributions
ags+0.229
icer+0.211
aters+0.207
ittens+0.206
agged+0.203
Neg logit contributions
 that-0.075
 traveler-0.065
 they-0.063
 setup-0.061
 university-0.061
Token other
Token ID584
Feature activation+1.38e-01

Pos logit contributions
ags+0.116
opl+0.103
rell+0.102
icer+0.100
aters+0.099
Neg logit contributions
 traveler-0.093
 spies-0.089
 setup-0.088
 university-0.087
 troll-0.087
Token things
Token ID1243
Feature activation+6.14e-03

Pos logit contributions
ags+0.296
opl+0.261
rell+0.260
icer+0.256
aters+0.250
Neg logit contributions
 traveler-0.203
 spies-0.194
 university-0.189
 setup-0.188
 troll-0.187
Token that
Token ID326
Feature activation-1.85e-01

Pos logit contributions
ags+0.016
opl+0.015
rell+0.015
icer+0.015
aters+0.014
Neg logit contributions
 traveler-0.006
 spies-0.005
 setup-0.005
 detectives-0.005
 troll-0.005
Token make
Token ID787
Feature activation+1.10e-02

Pos logit contributions
 traveler+0.220
 spies+0.206
 university+0.199
 troll+0.199
 detectives+0.197
Neg logit contributions
ags-0.460
rell-0.404
opl-0.404
aters-0.402
icer-0.398
Token up
Token ID510
Feature activation+1.61e-02

Pos logit contributions
ags+0.025
rell+0.022
opl+0.022
icer+0.022
aters+0.021
Neg logit contributions
 traveler-0.015
 spies-0.014
 setup-0.014
 detectives-0.014
 troll-0.014
Token a
Token ID257
Feature activation-1.44e-01

Pos logit contributions
ags+0.032
opl+0.030
rell+0.028
c+0.028
icer+0.028
Neg logit contributions
 traveler-0.025
 troll-0.023
 portal-0.023
 spies-0.023
 scientist-0.023
Token landscape
Token ID10747
Feature activation-2.56e-02

Pos logit contributions
 spies+0.225
 traveler+0.223
 detectives+0.219
 criminal+0.214
eware+0.214
Neg logit contributions
ags-0.250
icer-0.237
c-0.234
opl-0.234
rell-0.230
Token.
Token ID13
Feature activation+4.58e-03

Pos logit contributions
 traveler+0.039
 spies+0.037
 setup+0.036
 troll+0.036
 detectives+0.036
Neg logit contributions
ags-0.052
rell-0.048
opl-0.048
c-0.046
aters-0.046
Token do
Token ID466
Feature activation-6.46e-02

Pos logit contributions
 traveler+0.025
 spies+0.023
 setup+0.023
 university+0.022
 portal+0.022
Neg logit contributions
ags-0.048
rell-0.043
opl-0.043
icer-0.042
aters-0.042
Token not
Token ID407
Feature activation-5.47e-02

Pos logit contributions
 traveler+0.066
 spies+0.060
 troll+0.059
 detectives+0.058
 setup+0.057
Neg logit contributions
ags-0.168
opl-0.152
rell-0.151
icer-0.148
c-0.145
Token hear
Token ID3285
Feature activation-1.53e-01

Pos logit contributions
 traveler+0.076
 portal+0.073
 setup+0.073
 troll+0.071
 detectives+0.071
Neg logit contributions
ags-0.118
rell-0.104
opl-0.102
ag-0.101
aces-0.099
Token the
Token ID262
Feature activation+5.31e-02

Pos logit contributions
 traveler+0.219
 scientist+0.198
 troll+0.198
 Wizard+0.197
 detectives+0.196
Neg logit contributions
ags-0.339
opl-0.302
rell-0.291
aters-0.281
icer-0.280
Token door
Token ID3420
Feature activation-1.30e-01

Pos logit contributions
ags+0.114
opl+0.101
rell+0.100
icer+0.097
aters+0.095
Neg logit contributions
 traveler-0.077
 spies-0.072
 detectives-0.072
 setup-0.070
 troll-0.070
Token that
Token ID326
Feature activation-1.84e-01

Pos logit contributions
 traveler+0.194
 spies+0.182
 troll+0.181
 detectives+0.179
 scientist+0.178
Neg logit contributions
ags-0.271
opl-0.246
rell-0.244
c-0.234
aters-0.232
Token opens
Token ID9808
Feature activation-5.20e-02

Pos logit contributions
 traveler+0.287
 spies+0.271
 setup+0.268
 troll+0.265
 detectives+0.265
Neg logit contributions
ags-0.375
opl-0.333
rell-0.332
icer-0.321
aters-0.316
Token downstairs
Token ID34624
Feature activation-1.86e-01

Pos logit contributions
 traveler+0.091
 scientist+0.087
 troll+0.087
 criminal+0.086
 detectives+0.086
Neg logit contributions
ags-0.093
icer-0.080
c-0.078
opl-0.078
aters-0.078
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 traveler+0.269
 spies+0.251
 setup+0.248
 troll+0.247
 detectives+0.246
Neg logit contributions
ags-0.401
opl-0.356
rell-0.349
icer-0.348
aters-0.343
Token They
Token ID1119
Feature activation-2.01e-02

Pos logit contributions
ags+0.020
aters+0.016
opl+0.016
rell+0.016
c+0.015
Neg logit contributions
 traveler-0.024
 setup-0.023
 troll-0.023
 scientist-0.022
 criminal-0.022
Token do
Token ID466
Feature activation-6.40e-02

Pos logit contributions
 traveler+0.026
 spies+0.025
 setup+0.024
 troll+0.024
 detectives+0.024
Neg logit contributions
ags-0.046
opl-0.041
rell-0.041
icer-0.040
aters-0.039
Token there
Token ID612
Feature activation-4.94e-02

Pos logit contributions
ags+0.238
rell+0.211
opl+0.210
aces+0.204
icer+0.204
Neg logit contributions
 traveler-0.165
 spies-0.155
 troll-0.154
 setup-0.153
 detectives-0.151
Token.
Token ID13
Feature activation-8.34e-04

Pos logit contributions
 traveler+0.068
 spies+0.064
 setup+0.063
 troll+0.062
 detectives+0.062
Neg logit contributions
ags-0.109
rell-0.098
opl-0.097
icer-0.095
aters-0.094
Token She
Token ID1375
Feature activation-2.16e-02

Pos logit contributions
 traveler+0.001
 university+0.001
 spies+0.001
 setup+0.001
 detectives+0.001
Neg logit contributions
ags-0.002
opl-0.002
icer-0.002
rell-0.002
aters-0.002
Token was
Token ID373
Feature activation+1.14e-02

Pos logit contributions
 traveler+0.029
 spies+0.027
 setup+0.027
 troll+0.026
 detectives+0.026
Neg logit contributions
ags-0.050
opl-0.044
rell-0.044
icer-0.043
aters-0.042
Token happy
Token ID3772
Feature activation+1.34e-01

Pos logit contributions
ags+0.025
opl+0.022
rell+0.022
icer+0.022
aters+0.021
Neg logit contributions
 traveler-0.016
 spies-0.015
 setup-0.015
 troll-0.015
 detectives-0.015
Token that
Token ID326
Feature activation-1.89e-01

Pos logit contributions
ags+0.233
opl+0.212
rell+0.209
aters+0.208
aces+0.207
Neg logit contributions
 traveler-0.219
 criminal-0.203
 portal-0.201
 troll-0.201
eware-0.200
Token she
Token ID673
Feature activation-6.80e-02

Pos logit contributions
 traveler+0.206
 spies+0.189
 troll+0.186
 portal+0.185
 detectives+0.182
Neg logit contributions
ags-0.464
opl-0.425
rell-0.424
icer-0.411
c-0.408
Token could
Token ID714
Feature activation+7.46e-03

Pos logit contributions
 traveler+0.101
 spies+0.097
 troll+0.095
 university+0.095
 setup+0.094
Neg logit contributions
ags-0.137
rell-0.124
opl-0.123
c-0.121
aces-0.120
Token have
Token ID423
Feature activation-5.99e-03

Pos logit contributions
ags+0.016
opl+0.014
rell+0.014
aces+0.013
aters+0.013
Neg logit contributions
 traveler-0.011
 setup-0.011
 troll-0.011
 spies-0.010
 portal-0.010
Token some
Token ID617
Feature activation+3.00e-02

Pos logit contributions
 traveler+0.009
 portal+0.009
 troll+0.009
 setup+0.008
 scientist+0.008
Neg logit contributions
ags-0.013
rell-0.011
opl-0.011
c-0.011
aces-0.011
Token healthy
Token ID5448
Feature activation+8.99e-02

Pos logit contributions
ags+0.072
opl+0.066
icer+0.066
rell+0.066
aters+0.063
Neg logit contributions
 traveler-0.034
 spies-0.034
 university-0.033
 detectives-0.033
 setup-0.032
Token.
Token ID13
Feature activation-3.29e-03

Pos logit contributions
 traveler+0.235
 spies+0.222
 detectives+0.216
 portal+0.214
 setup+0.214
Neg logit contributions
ags-0.377
opl-0.337
rell-0.336
icer-0.326
aters-0.324
Token She
Token ID1375
Feature activation-2.40e-02

Pos logit contributions
 traveler+0.005
 spies+0.005
 setup+0.005
 detectives+0.005
 university+0.005
Neg logit contributions
ags-0.007
opl-0.006
rell-0.006
icer-0.006
aters-0.006
Token felt
Token ID2936
Feature activation-4.44e-02

Pos logit contributions
 traveler+0.035
 spies+0.034
 detectives+0.032
 troll+0.032
 portal+0.032
Neg logit contributions
ags-0.050
rell-0.045
opl-0.045
c-0.044
icer-0.044
Token so
Token ID523
Feature activation-7.57e-03

Pos logit contributions
 traveler+0.068
 setup+0.065
 spies+0.064
 troll+0.064
 portal+0.063
Neg logit contributions
ags-0.092
rell-0.080
opl-0.080
aces-0.076
c-0.075
Token relieved
Token ID22598
Feature activation-6.29e-02

Pos logit contributions
 traveler+0.011
 spies+0.011
 setup+0.010
 troll+0.010
 detectives+0.010
Neg logit contributions
ags-0.016
opl-0.014
rell-0.014
icer-0.014
aters-0.014
Token that
Token ID326
Feature activation-1.89e-01

Pos logit contributions
 traveler+0.094
 criminal+0.091
 portal+0.090
 troll+0.089
 detectives+0.089
Neg logit contributions
ags-0.124
opl-0.108
rell-0.107
aces-0.106
aters-0.103
Token she
Token ID673
Feature activation-6.93e-02

Pos logit contributions
 traveler+0.204
 spies+0.189
 troll+0.185
 setup+0.184
 portal+0.182
Neg logit contributions
ags-0.473
opl-0.432
rell-0.429
icer-0.417
aters-0.415
Token remembered
Token ID12086
Feature activation-1.70e-01

Pos logit contributions
 traveler+0.108
 spies+0.106
 university+0.105
 troll+0.103
 criminal+0.102
Neg logit contributions
ags-0.132
rell-0.119
c-0.115
opl-0.115
aces-0.115
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
 traveler+0.246
 portal+0.231
 spies+0.227
 troll+0.226
 detectives+0.223
Neg logit contributions
ags-0.355
opl-0.319
rell-0.311
c-0.307
aces-0.301
Token so
Token ID523
Feature activation-8.25e-03

Pos logit contributions
ags+0.171
c+0.154
rell+0.152
aces+0.152
opl+0.149
Neg logit contributions
eware-0.180
 traveler-0.176
 criminal-0.174
fire-0.171
 troll-0.170
Token happy
Token ID3772
Feature activation+1.33e-01

Pos logit contributions
 traveler+0.014
 spies+0.013
 troll+0.013
 detectives+0.013
 setup+0.013
Neg logit contributions
ags-0.015
opl-0.014
rell-0.013
c-0.013
aces-0.013
Token up
Token ID510
Feature activation+1.30e-02

Pos logit contributions
ags+0.085
opl+0.075
rell+0.074
icer+0.073
aters+0.073
Neg logit contributions
 traveler-0.062
 setup-0.059
 spies-0.058
 portal-0.058
 troll-0.057
Token,
Token ID11
Feature activation-8.85e-02

Pos logit contributions
ags+0.025
opl+0.021
c+0.021
rell+0.021
agged+0.020
Neg logit contributions
 traveler-0.021
 spies-0.021
 detectives-0.020
 clues-0.020
 troll-0.020
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
 traveler+0.142
 spies+0.138
 detectives+0.135
 troll+0.135
 portal+0.133
Neg logit contributions
ags-0.170
opl-0.151
rell-0.150
icer-0.145
c-0.144
Token he
Token ID339
Feature activation-3.32e-02

Pos logit contributions
ags+0.202
c+0.194
rell+0.191
opl+0.191
aters+0.178
Neg logit contributions
 spies-0.135
 criminal-0.132
 detectives-0.131
 troll-0.131
 traveler-0.126
Token understood
Token ID7247
Feature activation-1.07e-01

Pos logit contributions
 traveler+0.048
 spies+0.048
 detectives+0.046
 university+0.046
 troll+0.045
Neg logit contributions
ags-0.067
rell-0.061
opl-0.061
c-0.060
icer-0.058
Token that
Token ID326
Feature activation-1.91e-01

Pos logit contributions
 traveler+0.141
 detectives+0.135
 spies+0.134
 portal+0.130
 troll+0.129
Neg logit contributions
ags-0.240
opl-0.217
rell-0.208
c-0.207
aters-0.202
Token it
Token ID340
Feature activation+5.25e-02

Pos logit contributions
 traveler+0.218
 spies+0.205
 troll+0.202
 detectives+0.200
 portal+0.198
Neg logit contributions
ags-0.443
opl-0.419
rell-0.411
c-0.403
icer-0.402
Token was
Token ID373
Feature activation+7.12e-03

Pos logit contributions
ags+0.111
opl+0.097
rell+0.096
icer+0.096
aters+0.094
Neg logit contributions
 traveler-0.079
 setup-0.076
 spies-0.075
 portal-0.074
 troll-0.074
Token better
Token ID1365
Feature activation-2.19e-02

Pos logit contributions
ags+0.015
opl+0.013
rell+0.013
c+0.013
icer+0.013
Neg logit contributions
 traveler-0.010
 troll-0.010
 spies-0.010
 portal-0.010
 detectives-0.010
Token to
Token ID284
Feature activation-6.87e-02

Pos logit contributions
 traveler+0.031
 troll+0.030
 portal+0.030
 Wizard+0.029
 university+0.029
Neg logit contributions
ags-0.043
opl-0.038
c-0.038
aces-0.036
 pads-0.036
Token stay
Token ID2652
Feature activation+1.61e-02

Pos logit contributions
 spies+0.101
 traveler+0.099
 detectives+0.098
 troll+0.096
 portal+0.093
Neg logit contributions
ags-0.145
opl-0.129
rell-0.128
ag-0.122
c-0.121