TOP ACTIVATIONS
MAX = 9.235

go ." He unh ooks Anna from the zip
Mia and Ben felt w obb ly and gigg ly .
noise . Lily got even ang rier and threw her doll
, but Johnny kept getting ang rier and ang rier .
kept getting ang rier and ang rier . Finally , Tim
to break ." Molly nodded solemn ly , understanding the lesson
to make her mom any ang rier , so she stopped
happened , they were very apolog etic .
so the fox got even ang rier . "
spot . With a small gro an , he hopped down
. Everyone praised her cle vern ess and eventually Jan ie
B izzy nodded solemn ly . She knew she
, getting ang rier and ang rier . The two friends
. The water was so grim y and gross , but
sunrise !" Sam gro aned and pulled his blanket
â t get ang rier . â
Tom and Anna gro aned . They did not
Sara and Ben . He apolog izes again .
the creek ?" asked Peter enthusi atically . "
he felt a bit w obb ly . "

MIN ACTIVATIONS
MIN = -9.914

. They like to play at the beach . They make
Lily who loved to play at the beach . One day
. They like to play at the shore . They see
day , Amy was playing at the park with her friends
. They like to play at the park . One day
. They like to play at school . But today ,
. They like to play at the park . They see
. They like to play at the beach . They have
. They like to play at the park . They like
friends who liked to play at the park . They liked
Tim my loved to play at the park . One day
. She loved to play at the park . Her favorite
. Lily loved to play at the park . Her favorite
Lily who loved to play at the beach . One day
. Tim my loved playing at the park with his friends
friends who liked to play at the beach . They liked
Sally . She loved playing at the park near her house
. She loved to play at the sea and collect pretty
. She loved to play at the park . Her favorite
. They like to play at the park . They like

INTERVAL 4.447 - 9.235
CONTAINS 1.680%

away , but Spot was fast . He caught the cat
S oph ie got very impatient , so she called
said . They waved at the dog and walked
kind ." Mom smiled and said , " You
Ab ig ail felt deep regret and decided to take

INTERVAL -0.340 - 4.447
CONTAINS 55.864%

the forest . One day , they went on a trail
nature ." Little Tim liked the idea and
the p uddle instead ." So , they found
see the sun and felt happy and energ ized . He
quite noisy ! She frowned and decided she didn 't like

INTERVAL -5.127 - -0.340
CONTAINS 41.227%

said , " I know it hurts , but I 'm
up the hole . When she was done , Julie felt
birthday hat and was waving to all his friends . His
have a special show . The animals wanted to display their
him . " Hi Max , how are

INTERVAL -9.914 - -5.127
CONTAINS 1.230%

hand and she was ready to go . Mom
about the mosquitoes anymore . <|endoftext|> Once upon a time ,
decided to take Sus ie to a restaurant for dinner .
my . Tim my liked to explore and go on adventures
year old . He wanted to go to the church ,
Token go
Token ID467
Feature activation-1.64e+00

Pos logit contributions
 hunt+0.056
 hall+0.051
 exhibit+0.051
 library+0.050
 park+0.050
Neg logit contributions
ived-0.059
rier-0.058
lessly-0.057
ously-0.056
der-0.055
Token."
Token ID526
Feature activation+9.59e-01

Pos logit contributions
 hunt+0.038
 library+0.036
 park+0.034
 hall+0.034
 exhibit+0.034
Neg logit contributions
lessly-0.044
ously-0.041
ived-0.041
rier-0.041
der-0.039
Token
Token ID198
Feature activation+1.82e+00

Pos logit contributions
lessly+0.028
ously+0.028
rier+0.027
ived+0.027
der+0.026
Neg logit contributions
 hunt-0.021
 park-0.019
 library-0.018
 exhibit-0.018
 hall-0.018
Token
Token ID198
Feature activation+1.71e+00

Pos logit contributions
lessly+0.054
ously+0.053
rier+0.052
ived+0.051
der+0.051
Neg logit contributions
 hunt-0.038
 park-0.035
 library-0.034
 exhibit-0.033
 hall-0.033
TokenHe
Token ID1544
Feature activation+5.87e-01

Pos logit contributions
lessly+0.055
ously+0.054
rier+0.053
ived+0.052
der+0.052
Neg logit contributions
 hunt-0.032
 park-0.029
 library-0.028
 exhibit-0.027
 hall-0.027
Token unh
Token ID14274
Feature activation+8.92e+00

Pos logit contributions
ously+0.015
lessly+0.015
der+0.015
ived+0.015
rier+0.014
Neg logit contributions
 hunt-0.014
 exhibit-0.012
 park-0.012
 library-0.012
 hall-0.012
Tokenooks
Token ID31085
Feature activation+9.41e-01

Pos logit contributions
lessly+0.204
ously+0.202
der+0.192
ived+0.181
rier+0.181
Neg logit contributions
 hunt-0.220
house-0.218
 exhibit-0.204
 hide-0.202
e-0.202
Token Anna
Token ID11735
Feature activation+1.57e+00

Pos logit contributions
lessly+0.025
rier+0.025
ously+0.024
der+0.024
ived+0.023
Neg logit contributions
 hunt-0.021
 park-0.020
 exhibit-0.019
 hall-0.019
 library-0.019
Token from
Token ID422
Feature activation+1.55e+00

Pos logit contributions
ived+0.047
ously+0.047
lessly+0.046
der+0.043
rier+0.043
Neg logit contributions
 hunt-0.030
 library-0.028
 park-0.027
 hall-0.027
 museum-0.027
Token the
Token ID262
Feature activation-1.36e-01

Pos logit contributions
lessly+0.049
ously+0.048
rier+0.047
ived+0.046
der+0.046
Neg logit contributions
 hunt-0.029
 exhibit-0.025
 park-0.024
 library-0.024
 hall-0.024
Token zip
Token ID19974
Feature activation+7.49e-01

Pos logit contributions
 hunt+0.003
rob+0.002
 exhibit+0.002
nob+0.002
ci+0.002
Neg logit contributions
rier-0.004
lessly-0.004
ously-0.004
ived-0.004
der-0.003
Token Mia
Token ID32189
Feature activation+1.56e+00

Pos logit contributions
rier+0.019
lessly+0.019
ously+0.019
ived+0.018
iously+0.017
Neg logit contributions
 park-0.019
 hunt-0.019
 library-0.018
 hall-0.017
 exhibit-0.017
Token and
Token ID290
Feature activation+2.27e+00

Pos logit contributions
ously+0.033
lessly+0.033
ived+0.031
der+0.031
rier+0.029
Neg logit contributions
 hunt-0.045
 exhibit-0.041
 library-0.041
 park-0.040
 hall-0.040
Token Ben
Token ID3932
Feature activation+9.35e-01

Pos logit contributions
lessly+0.061
ously+0.060
rier+0.058
ived+0.058
der+0.058
Neg logit contributions
 hunt-0.054
 park-0.049
 exhibit-0.049
 library-0.048
 hall-0.047
Token felt
Token ID2936
Feature activation+4.55e+00

Pos logit contributions
lessly+0.023
ously+0.023
ived+0.022
der+0.021
rier+0.021
Neg logit contributions
 hunt-0.024
 exhibit-0.022
 library-0.022
 park-0.021
 hall-0.021
Token w
Token ID266
Feature activation+6.20e+00

Pos logit contributions
ously+0.112
lessly+0.109
rier+0.106
ived+0.106
der+0.105
Neg logit contributions
 hunt-0.115
 park-0.113
 library-0.105
 hall-0.103
 play-0.103
Tokenobb
Token ID21963
Feature activation+8.89e+00

Pos logit contributions
ously+0.119
rier+0.114
lessly+0.112
der+0.103
ived+0.103
Neg logit contributions
 hunt-0.172
 park-0.168
 hide-0.159
 theater-0.159
e-0.159
Tokenly
Token ID306
Feature activation+4.05e+00

Pos logit contributions
ously+0.112
lessly+0.097
rier+0.089
ived+0.087
der+0.083
Neg logit contributions
 hunt-0.318
 park-0.294
 slide-0.288
 exhibit-0.287
rob-0.287
Token and
Token ID290
Feature activation+3.11e+00

Pos logit contributions
lessly+0.103
ously+0.101
rier+0.099
der+0.097
ived+0.096
Neg logit contributions
 hunt-0.101
 park-0.093
 library-0.091
 exhibit-0.091
 hall-0.089
Token gigg
Token ID30442
Feature activation+6.41e+00

Pos logit contributions
lessly+0.059
ously+0.056
rier+0.055
ived+0.053
der+0.053
Neg logit contributions
 hunt-0.099
 park-0.093
 exhibit-0.092
 library-0.091
 hall-0.090
Tokenly
Token ID306
Feature activation+3.90e+00

Pos logit contributions
ously+0.138
lessly+0.129
iously+0.121
ived+0.119
rier+0.118
Neg logit contributions
 park-0.151
 hunt-0.147
 theater-0.140
 play-0.139
 slide-0.136
Token.
Token ID13
Feature activation+6.56e-01

Pos logit contributions
lessly+0.113
ously+0.111
rier+0.108
der+0.107
ived+0.107
Neg logit contributions
 hunt-0.085
 park-0.077
 library-0.076
 exhibit-0.075
 hall-0.074
Token noise
Token ID7838
Feature activation+3.52e+00

Pos logit contributions
 hunt+0.003
 park+0.002
 library+0.002
 exhibit+0.002
 hall+0.002
Neg logit contributions
lessly-0.003
ously-0.003
rier-0.003
der-0.003
ived-0.003
Token.
Token ID13
Feature activation+9.74e-01

Pos logit contributions
ously+0.107
lessly+0.105
rier+0.102
ived+0.102
der+0.099
Neg logit contributions
 hunt-0.068
 park-0.061
 hall-0.059
 library-0.058
 exhibit-0.058
Token Lily
Token ID20037
Feature activation+1.43e+00

Pos logit contributions
ously+0.026
lessly+0.025
rier+0.025
ived+0.025
der+0.023
Neg logit contributions
 hunt-0.022
 park-0.021
 hall-0.021
 library-0.021
 exhibit-0.020
Token got
Token ID1392
Feature activation+3.87e+00

Pos logit contributions
lessly+0.040
ously+0.039
rier+0.038
ived+0.037
der+0.037
Neg logit contributions
 hunt-0.033
 park-0.030
 exhibit-0.029
 library-0.029
 hall-0.029
Token even
Token ID772
Feature activation+2.61e+00

Pos logit contributions
lessly+0.095
ously+0.094
rier+0.091
der+0.090
ived+0.088
Neg logit contributions
 hunt-0.100
 park-0.093
 exhibit-0.090
 library-0.090
 hall-0.090
Token ang
Token ID3550
Feature activation+8.76e+00

Pos logit contributions
lessly+0.051
ously+0.049
rier+0.048
ived+0.047
der+0.046
Neg logit contributions
 hunt-0.081
 park-0.075
 exhibit-0.074
 library-0.074
 hall-0.073
Tokenrier
Token ID5277
Feature activation+6.00e+00

Pos logit contributions
ously+0.076
ived+0.062
 pleased+0.060
lessly+0.059
atted+0.049
Neg logit contributions
 hunt-0.307
 hide-0.294
 store-0.293
 test-0.293
house-0.288
Token and
Token ID290
Feature activation+2.83e+00

Pos logit contributions
lessly+0.160
rier+0.157
ived+0.153
ously+0.146
uster+0.140
Neg logit contributions
 hunt-0.135
 park-0.121
 exhibit-0.118
 hall-0.116
 library-0.114
Token threw
Token ID9617
Feature activation-6.10e-02

Pos logit contributions
lessly+0.066
ously+0.064
rier+0.063
der+0.061
ived+0.061
Neg logit contributions
 hunt-0.078
 exhibit-0.072
 park-0.072
 library-0.071
 hall-0.070
Token her
Token ID607
Feature activation+1.13e+00

Pos logit contributions
 hunt+0.001
 exhibit+0.001
 library+0.001
 park+0.001
 hall+0.001
Neg logit contributions
lessly-0.002
ously-0.002
rier-0.002
der-0.002
ived-0.002
Token doll
Token ID3654
Feature activation-1.00e+00

Pos logit contributions
lessly+0.033
ously+0.032
rier+0.032
ived+0.032
der+0.031
Neg logit contributions
 hunt-0.024
 library-0.021
 park-0.021
 exhibit-0.021
 museum-0.019
Token,
Token ID11
Feature activation+2.70e+00

Pos logit contributions
lessly+0.054
rier+0.053
ously+0.052
der+0.051
ived+0.051
Neg logit contributions
 hunt-0.039
 park-0.035
 exhibit-0.032
 play-0.032
 library-0.032
Token but
Token ID475
Feature activation+1.14e+00

Pos logit contributions
rier+0.080
ived+0.080
ously+0.077
lessly+0.076
der+0.072
Neg logit contributions
 hunt-0.055
 park-0.054
 slide-0.050
 hall-0.048
 store-0.048
Token Johnny
Token ID15470
Feature activation+3.99e-01

Pos logit contributions
rier+0.036
lessly+0.035
ously+0.034
ived+0.034
der+0.033
Neg logit contributions
 hunt-0.022
 park-0.021
 play-0.020
 hall-0.020
 library-0.019
Token kept
Token ID4030
Feature activation+1.98e+00

Pos logit contributions
ously+0.010
lessly+0.009
der+0.009
rier+0.009
ived+0.009
Neg logit contributions
 hunt-0.010
 library-0.010
 exhibit-0.010
 park-0.009
 hall-0.009
Token getting
Token ID1972
Feature activation+2.97e+00

Pos logit contributions
lessly+0.051
ously+0.051
rier+0.049
ived+0.048
der+0.047
Neg logit contributions
 hunt-0.048
 library-0.044
 exhibit-0.044
 park-0.044
 hall-0.043
Token ang
Token ID3550
Feature activation+8.74e+00

Pos logit contributions
ously+0.069
lessly+0.068
der+0.065
rier+0.063
ived+0.062
Neg logit contributions
 hunt-0.081
 park-0.077
 library-0.072
 hall-0.072
 play-0.072
Tokenrier
Token ID5277
Feature activation+5.74e+00

Pos logit contributions
ously+0.089
lessly+0.071
ived+0.060
 pleased+0.054
atted+0.053
Neg logit contributions
 hunt-0.315
 test-0.299
 store-0.294
 hide-0.294
 slide-0.284
Token and
Token ID290
Feature activation+2.41e+00

Pos logit contributions
lessly+0.173
ived+0.168
rier+0.167
ously+0.160
uster+0.154
Neg logit contributions
 hunt-0.110
 park-0.095
 exhibit-0.094
 hall-0.087
 library-0.087
Token ang
Token ID3550
Feature activation+8.72e+00

Pos logit contributions
lessly+0.054
ously+0.052
rier+0.051
der+0.050
ived+0.050
Neg logit contributions
 hunt-0.068
 park-0.063
 exhibit-0.062
 library-0.062
 hall-0.062
Tokenrier
Token ID5277
Feature activation+5.26e+00

Pos logit contributions
ously+0.095
lessly+0.071
ived+0.062
 pleased+0.051
lee+0.049
Neg logit contributions
 hunt-0.307
 test-0.297
 exhibit-0.285
 hide-0.285
 store-0.283
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
lessly+0.171
rier+0.167
ously+0.164
ived+0.162
der+0.157
Neg logit contributions
 hunt-0.090
 park-0.076
 exhibit-0.076
 hall-0.072
 library-0.070
Token kept
Token ID4030
Feature activation+1.98e+00

Pos logit contributions
ously+0.010
lessly+0.009
der+0.009
rier+0.009
ived+0.009
Neg logit contributions
 hunt-0.010
 library-0.010
 exhibit-0.010
 park-0.009
 hall-0.009
Token getting
Token ID1972
Feature activation+2.97e+00

Pos logit contributions
lessly+0.051
ously+0.051
rier+0.049
ived+0.048
der+0.047
Neg logit contributions
 hunt-0.048
 library-0.044
 exhibit-0.044
 park-0.044
 hall-0.043
Token ang
Token ID3550
Feature activation+8.74e+00

Pos logit contributions
ously+0.069
lessly+0.068
der+0.065
rier+0.063
ived+0.062
Neg logit contributions
 hunt-0.081
 park-0.077
 library-0.072
 hall-0.072
 play-0.072
Tokenrier
Token ID5277
Feature activation+5.74e+00

Pos logit contributions
ously+0.089
lessly+0.071
ived+0.060
 pleased+0.054
atted+0.053
Neg logit contributions
 hunt-0.315
 test-0.299
 store-0.294
 hide-0.294
 slide-0.284
Token and
Token ID290
Feature activation+2.41e+00

Pos logit contributions
lessly+0.173
ived+0.168
rier+0.167
ously+0.160
uster+0.154
Neg logit contributions
 hunt-0.110
 park-0.095
 exhibit-0.094
 hall-0.087
 library-0.087
Token ang
Token ID3550
Feature activation+8.72e+00

Pos logit contributions
lessly+0.054
ously+0.052
rier+0.051
der+0.050
ived+0.050
Neg logit contributions
 hunt-0.068
 park-0.063
 exhibit-0.062
 library-0.062
 hall-0.062
Tokenrier
Token ID5277
Feature activation+5.26e+00

Pos logit contributions
ously+0.095
lessly+0.071
ived+0.062
 pleased+0.051
lee+0.049
Neg logit contributions
 hunt-0.307
 test-0.297
 exhibit-0.285
 hide-0.285
 store-0.283
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
lessly+0.171
rier+0.167
ously+0.164
ived+0.162
der+0.157
Neg logit contributions
 hunt-0.090
 park-0.076
 exhibit-0.076
 hall-0.072
 library-0.070
Token Finally
Token ID9461
Feature activation+1.11e+00

Pos logit contributions
rier+0.003
ously+0.003
lessly+0.003
ived+0.003
der+0.003
Neg logit contributions
 hunt-0.003
 park-0.003
 hall-0.003
 library-0.003
 exhibit-0.003
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
lessly+0.034
rier+0.033
ously+0.032
der+0.032
ived+0.031
Neg logit contributions
 hunt-0.023
 park-0.021
 exhibit-0.020
 hall-0.020
 play-0.020
Token Tim
Token ID5045
Feature activation-1.67e-01

Pos logit contributions
lessly+0.006
ously+0.006
der+0.006
rier+0.006
ived+0.006
Neg logit contributions
 hunt-0.006
 exhibit-0.005
 library-0.005
 park-0.005
 hall-0.005
Token to
Token ID284
Feature activation-2.19e+00

Pos logit contributions
 hunt+0.005
 library+0.005
 exhibit+0.005
 park+0.004
 hall+0.004
Neg logit contributions
lessly-0.006
rier-0.005
der-0.005
ived-0.005
ously-0.005
Token break
Token ID2270
Feature activation+6.71e-01

Pos logit contributions
 exhibit+0.049
 hunt+0.048
 hall+0.046
 library+0.045
 theater+0.043
Neg logit contributions
lessly-0.057
der-0.056
rier-0.054
ously-0.053
 understanding-0.048
Token."
Token ID526
Feature activation+1.25e+00

Pos logit contributions
lessly+0.019
rier+0.018
ously+0.018
der+0.018
ived+0.017
Neg logit contributions
 hunt-0.015
 park-0.014
 library-0.014
 exhibit-0.014
 hall-0.013
Token Molly
Token ID30236
Feature activation+3.13e+00

Pos logit contributions
lessly+0.035
rier+0.034
ously+0.033
ived+0.032
der+0.031
Neg logit contributions
 hunt-0.028
 park-0.028
 library-0.026
 hall-0.026
 exhibit-0.026
Token nodded
Token ID14464
Feature activation+4.69e+00

Pos logit contributions
lessly+0.078
rier+0.075
ously+0.075
ived+0.072
der+0.071
Neg logit contributions
 hunt-0.081
 park-0.076
 library-0.073
 exhibit-0.073
 hall-0.072
Token solemn
Token ID26322
Feature activation+8.66e+00

Pos logit contributions
lessly+0.156
ived+0.155
rier+0.149
der+0.144
uster+0.142
Neg logit contributions
 play-0.076
 visit-0.068
 hunt-0.068
 park-0.065
 read-0.064
Tokenly
Token ID306
Feature activation+4.73e+00

Pos logit contributions
lessly+0.163
ived+0.155
 importantly+0.155
ously+0.145
add+0.141
Neg logit contributions
 hunt-0.210
 play-0.208
 show-0.190
 visit-0.189
 exhibit-0.187
Token,
Token ID11
Feature activation+4.81e+00

Pos logit contributions
lessly+0.134
rier+0.131
ously+0.124
der+0.122
ived+0.122
Neg logit contributions
 hunt-0.099
 park-0.089
 play-0.089
 exhibit-0.086
 store-0.085
Token understanding
Token ID4547
Feature activation+3.11e+00

Pos logit contributions
rier+0.107
ived+0.105
lessly+0.104
ously+0.101
der+0.097
Neg logit contributions
 hunt-0.132
 park-0.128
 play-0.120
 library-0.119
 hall-0.119
Token the
Token ID262
Feature activation+2.21e+00

Pos logit contributions
lessly+0.088
ously+0.086
rier+0.086
ived+0.085
der+0.082
Neg logit contributions
 hunt-0.068
 park-0.063
 library-0.061
 exhibit-0.060
 hall-0.059
Token lesson
Token ID11483
Feature activation+3.12e+00

Pos logit contributions
lessly+0.062
ously+0.060
rier+0.060
ived+0.059
der+0.058
Neg logit contributions
 hunt-0.051
 park-0.049
 library-0.046
 hall-0.045
 exhibit-0.044
Token to
Token ID284
Feature activation-2.66e+00

Pos logit contributions
 hunt+0.046
 exhibit+0.042
 hall+0.041
 library+0.040
 park+0.040
Neg logit contributions
lessly-0.041
ously-0.040
der-0.038
ived-0.037
rier-0.037
Token make
Token ID787
Feature activation-7.65e-01

Pos logit contributions
 hunt+0.054
 exhibit+0.052
 library+0.051
 hall+0.049
 park+0.048
Neg logit contributions
lessly-0.074
ously-0.074
rier-0.071
der-0.071
ived-0.069
Token her
Token ID607
Feature activation+9.53e-01

Pos logit contributions
 hunt+0.018
 exhibit+0.016
 library+0.016
 hall+0.016
 park+0.016
Neg logit contributions
lessly-0.021
rier-0.020
ously-0.020
der-0.020
ived-0.020
Token mom
Token ID1995
Feature activation-1.01e+00

Pos logit contributions
lessly+0.020
ously+0.020
rier+0.020
ived+0.019
der+0.019
Neg logit contributions
 hunt-0.027
 exhibit-0.025
 park-0.025
 library-0.025
 hall-0.024
Token any
Token ID597
Feature activation+1.12e+00

Pos logit contributions
 hunt+0.031
 library+0.029
 park+0.029
 exhibit+0.029
 hall+0.028
Neg logit contributions
lessly-0.019
ously-0.019
rier-0.019
ived-0.018
der-0.017
Token ang
Token ID3550
Feature activation+8.60e+00

Pos logit contributions
lessly+0.027
ously+0.027
rier+0.026
der+0.026
ived+0.025
Neg logit contributions
 hunt-0.030
 exhibit-0.027
 park-0.027
 library-0.027
 hall-0.026
Tokenrier
Token ID5277
Feature activation+4.65e+00

Pos logit contributions
ived+0.066
ously+0.065
lessly+0.057
 pleased+0.045
lee+0.042
Neg logit contributions
 hunt-0.319
 store-0.317
 test-0.312
 park-0.305
 slide-0.304
Token,
Token ID11
Feature activation+3.96e+00

Pos logit contributions
lessly+0.140
rier+0.135
ously+0.135
ived+0.133
der+0.130
Neg logit contributions
 hunt-0.093
 park-0.084
 exhibit-0.080
 library-0.080
 hall-0.079
Token so
Token ID523
Feature activation+4.17e+00

Pos logit contributions
rier+0.094
ived+0.094
ously+0.093
lessly+0.092
der+0.087
Neg logit contributions
 hunt-0.098
 park-0.096
 hall-0.090
 library-0.089
 slide-0.088
Token she
Token ID673
Feature activation+4.30e-01

Pos logit contributions
lessly+0.114
ously+0.113
rier+0.111
ived+0.110
der+0.108
Neg logit contributions
 hunt-0.092
 park-0.087
 library-0.083
 hall-0.082
 exhibit-0.081
Token stopped
Token ID5025
Feature activation+2.55e+00

Pos logit contributions
lessly+0.011
ously+0.011
rier+0.011
ived+0.011
der+0.011
Neg logit contributions
 hunt-0.010
 park-0.010
 library-0.009
 exhibit-0.009
 hall-0.009
Token happened
Token ID3022
Feature activation+3.19e+00

Pos logit contributions
 hunt+0.013
 hall+0.012
 exhibit+0.012
 library+0.012
 zoo+0.012
Neg logit contributions
ously-0.015
lessly-0.014
ived-0.014
der-0.013
rier-0.012
Token,
Token ID11
Feature activation+2.76e+00

Pos logit contributions
lessly+0.075
ously+0.074
rier+0.071
der+0.070
ived+0.069
Neg logit contributions
 hunt-0.086
 park-0.079
 library-0.078
 exhibit-0.078
 hall-0.077
Token they
Token ID484
Feature activation+3.62e-01

Pos logit contributions
lessly+0.075
ously+0.073
rier+0.071
der+0.070
ived+0.067
Neg logit contributions
 hunt-0.067
 exhibit-0.060
 park-0.058
 library-0.058
 hall-0.057
Token were
Token ID547
Feature activation+3.30e+00

Pos logit contributions
lessly+0.010
ously+0.010
rier+0.009
der+0.009
ived+0.009
Neg logit contributions
 hunt-0.009
 park-0.008
 library-0.008
 exhibit-0.008
 hall-0.008
Token very
Token ID845
Feature activation+5.09e+00

Pos logit contributions
lessly+0.075
ously+0.074
rier+0.072
ived+0.070
der+0.070
Neg logit contributions
 hunt-0.093
 park-0.086
 library-0.084
 exhibit-0.084
 hall-0.083
Token apolog
Token ID8453
Feature activation+8.58e+00

Pos logit contributions
ously+0.106
lessly+0.101
der+0.097
ived+0.095
rier+0.094
Neg logit contributions
 hunt-0.145
 park-0.141
 hall-0.135
 theater-0.134
 store-0.134
Tokenetic
Token ID5139
Feature activation+6.46e+00

Pos logit contributions
ously+0.203
ived+0.202
lessly+0.194
der+0.192
iously+0.169
Neg logit contributions
 hunt-0.201
 park-0.188
 test-0.186
 store-0.183
 exhibit-0.178
Token.
Token ID13
Feature activation+9.10e-02

Pos logit contributions
ived+0.181
lessly+0.181
rier+0.180
ously+0.174
der+0.168
Neg logit contributions
 hunt-0.122
 park-0.117
 exhibit-0.109
 library-0.105
 hall-0.105
Token
Token ID220
Feature activation+2.89e+00

Pos logit contributions
lessly+0.002
ously+0.002
rier+0.002
ived+0.002
der+0.002
Neg logit contributions
 hunt-0.002
 park-0.002
 library-0.002
 exhibit-0.002
 hall-0.002
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
lessly+0.077
ously+0.075
rier+0.072
ived+0.072
der+0.071
Neg logit contributions
 hunt-0.071
 park-0.063
 library-0.062
 exhibit-0.062
 hall-0.061
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
rier+0.037
lessly+0.037
ously+0.037
der+0.036
ived+0.035
Neg logit contributions
 hunt-0.023
 park-0.022
 hall-0.020
 library-0.020
 exhibit-0.020
Token so
Token ID523
Feature activation+3.93e+00

Pos logit contributions
lessly+0.067
ously+0.065
rier+0.064
ived+0.063
der+0.060
Neg logit contributions
 hunt-0.073
 park-0.068
 library-0.065
 hall-0.064
 exhibit-0.064
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
lessly+0.121
ously+0.119
rier+0.116
ived+0.115
der+0.114
Neg logit contributions
 hunt-0.079
 park-0.071
 library-0.068
 exhibit-0.068
 hall-0.067
Token fox
Token ID21831
Feature activation+4.55e-01

Pos logit contributions
lessly+0.042
ously+0.041
rier+0.041
der+0.040
ived+0.039
Neg logit contributions
 hunt-0.037
 park-0.034
 exhibit-0.033
 library-0.033
 hall-0.033
Token got
Token ID1392
Feature activation+3.20e+00

Pos logit contributions
ously+0.011
lessly+0.011
der+0.011
ived+0.011
rier+0.010
Neg logit contributions
 hunt-0.011
 library-0.011
 hall-0.011
 park-0.011
 exhibit-0.010
Token even
Token ID772
Feature activation+2.15e+00

Pos logit contributions
lessly+0.083
ously+0.082
rier+0.079
der+0.078
ived+0.077
Neg logit contributions
 hunt-0.079
 park-0.073
 exhibit-0.070
 library-0.070
 hall-0.070
Token ang
Token ID3550
Feature activation+8.56e+00

Pos logit contributions
lessly+0.050
ously+0.048
rier+0.048
ived+0.046
der+0.046
Neg logit contributions
 hunt-0.059
 park-0.054
 exhibit-0.053
 library-0.053
 hall-0.053
Tokenrier
Token ID5277
Feature activation+5.69e+00

Pos logit contributions
ously+0.080
lessly+0.072
ived+0.057
itted+0.043
 pleased+0.042
Neg logit contributions
 hunt-0.335
 exhibit-0.302
 hide-0.300
 park-0.300
 store-0.300
Token.
Token ID13
Feature activation+4.59e-01

Pos logit contributions
lessly+0.160
rier+0.156
ived+0.152
ously+0.145
der+0.141
Neg logit contributions
 hunt-0.123
 park-0.107
 exhibit-0.105
 hall-0.098
 library-0.098
Token
Token ID198
Feature activation+8.11e-01

Pos logit contributions
lessly+0.011
ously+0.011
rier+0.011
ived+0.010
der+0.010
Neg logit contributions
 hunt-0.012
 park-0.011
 library-0.011
 exhibit-0.011
 hall-0.011
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
lessly+0.024
ously+0.024
rier+0.023
der+0.023
ived+0.023
Neg logit contributions
 hunt-0.017
 park-0.016
 exhibit-0.015
 library-0.015
 hall-0.015
Token"
Token ID1
Feature activation-3.22e+00

Pos logit contributions
lessly+0.027
ously+0.027
rier+0.026
ived+0.025
der+0.025
Neg logit contributions
 hunt-0.038
 park-0.036
 exhibit-0.035
 library-0.035
 hall-0.034
Token spot
Token ID4136
Feature activation+7.81e-02

Pos logit contributions
ously+0.013
lessly+0.012
rier+0.012
der+0.012
ived+0.011
Neg logit contributions
 hunt-0.006
 exhibit-0.006
 library-0.006
 hall-0.005
 bake-0.005
Token.
Token ID13
Feature activation+4.73e-01

Pos logit contributions
lessly+0.002
ously+0.002
rier+0.002
ived+0.002
der+0.002
Neg logit contributions
 hunt-0.002
 library-0.002
 park-0.002
 exhibit-0.002
 hall-0.002
Token With
Token ID2080
Feature activation+1.67e+00

Pos logit contributions
lessly+0.011
ously+0.010
rier+0.010
ived+0.010
der+0.009
Neg logit contributions
 hunt-0.013
 park-0.013
 exhibit-0.012
 library-0.012
 museum-0.012
Token a
Token ID257
Feature activation+2.73e+00

Pos logit contributions
lessly+0.051
ously+0.050
rier+0.050
ived+0.049
der+0.048
Neg logit contributions
 hunt-0.033
 park-0.030
 library-0.029
 exhibit-0.028
 hall-0.027
Token small
Token ID1402
Feature activation+3.02e+00

Pos logit contributions
der+0.075
ously+0.073
rier+0.073
lessly+0.072
ived+0.068
Neg logit contributions
 hunt-0.065
 park-0.061
 slide-0.057
 play-0.055
 hall-0.055
Token gro
Token ID7128
Feature activation+8.55e+00

Pos logit contributions
lessly+0.083
ously+0.080
rier+0.080
der+0.079
ived+0.077
Neg logit contributions
 hunt-0.072
 park-0.066
 library-0.063
 hall-0.062
 exhibit-0.062
Tokenan
Token ID272
Feature activation+2.23e+00

Pos logit contributions
ously+0.194
ived+0.183
der+0.178
lessly+0.173
rier+0.161
Neg logit contributions
 hunt-0.221
 exhibit-0.190
 test-0.188
 park-0.184
 slide-0.180
Token,
Token ID11
Feature activation+3.06e+00

Pos logit contributions
lessly+0.054
ously+0.054
rier+0.051
ived+0.051
der+0.049
Neg logit contributions
 hunt-0.058
 library-0.055
 park-0.054
 hall-0.053
 exhibit-0.053
Token he
Token ID339
Feature activation+1.03e+00

Pos logit contributions
lessly+0.078
ously+0.076
rier+0.074
ived+0.073
der+0.072
Neg logit contributions
 hunt-0.077
 park-0.071
 library-0.069
 exhibit-0.069
 hall-0.068
Token hopped
Token ID45230
Feature activation+2.11e+00

Pos logit contributions
lessly+0.026
ously+0.026
ived+0.025
rier+0.025
der+0.025
Neg logit contributions
 hunt-0.026
 park-0.024
 library-0.024
 exhibit-0.023
 hall-0.023
Token down
Token ID866
Feature activation+1.04e+00

Pos logit contributions
der+0.062
ously+0.061
rier+0.061
lessly+0.059
ived+0.056
Neg logit contributions
 hunt-0.043
 park-0.041
 play-0.037
 slide-0.036
 hall-0.036
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
lessly+0.008
ously+0.008
rier+0.007
der+0.007
ived+0.007
Neg logit contributions
 hunt-0.007
 library-0.006
 park-0.006
 exhibit-0.006
 hall-0.006
Token Everyone
Token ID11075
Feature activation+1.08e-01

Pos logit contributions
lessly+0.006
ously+0.006
rier+0.006
ived+0.006
der+0.006
Neg logit contributions
 hunt-0.006
 park-0.005
 library-0.005
 exhibit-0.005
 hall-0.005
Token praised
Token ID15342
Feature activation+4.34e+00

Pos logit contributions
lessly+0.003
ously+0.003
der+0.003
ived+0.003
rier+0.003
Neg logit contributions
 hunt-0.002
 library-0.002
 exhibit-0.002
 park-0.002
 hall-0.002
Token her
Token ID607
Feature activation+3.69e+00

Pos logit contributions
rier+0.128
lessly+0.127
ously+0.125
der+0.125
ived+0.122
Neg logit contributions
 hunt-0.086
 park-0.079
 exhibit-0.075
 hall-0.075
 library-0.074
Token cle
Token ID1190
Feature activation+7.34e+00

Pos logit contributions
lessly+0.088
ously+0.087
rier+0.086
ived+0.083
der+0.082
Neg logit contributions
 hunt-0.097
 park-0.091
 library-0.089
 exhibit-0.088
 hall-0.087
Tokenvern
Token ID933
Feature activation+8.54e+00

Pos logit contributions
der+0.148
ously+0.144
lessly+0.141
rier+0.137
ived+0.135
Neg logit contributions
 hunt-0.214
 exhibit-0.197
 park-0.195
 test-0.188
 library-0.187
Tokeness
Token ID408
Feature activation+4.08e+00

Pos logit contributions
ously+0.228
ived+0.227
rier+0.225
 gust+0.221
lessly+0.218
Neg logit contributions
 hunt-0.154
 exhibit-0.143
house-0.130
 library-0.130
ci-0.130
Token and
Token ID290
Feature activation+2.84e+00

Pos logit contributions
lessly+0.091
ously+0.088
rier+0.085
der+0.083
ived+0.083
Neg logit contributions
 hunt-0.117
 park-0.109
 library-0.107
 hall-0.105
 exhibit-0.105
Token eventually
Token ID4191
Feature activation+1.46e+00

Pos logit contributions
lessly+0.075
ously+0.074
rier+0.070
ived+0.070
der+0.070
Neg logit contributions
 hunt-0.069
 park-0.063
 exhibit-0.062
 library-0.062
 hall-0.060
Token Jan
Token ID2365
Feature activation+2.89e+00

Pos logit contributions
lessly+0.039
ously+0.038
ived+0.037
der+0.036
rier+0.035
Neg logit contributions
 hunt-0.035
 exhibit-0.032
 library-0.031
 park-0.031
 hall-0.031
Tokenie
Token ID494
Feature activation+8.70e-01

Pos logit contributions
lessly+0.066
ously+0.065
ived+0.060
rier+0.060
der+0.060
Neg logit contributions
 hunt-0.082
 park-0.075
 library-0.074
 exhibit-0.073
 hall-0.073
Token
Token ID198
Feature activation+1.67e+00

Pos logit contributions
rier+0.046
lessly+0.045
ously+0.044
ived+0.044
der+0.043
Neg logit contributions
 hunt-0.023
 park-0.022
 hall-0.020
 library-0.020
 exhibit-0.019
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
rier+0.054
lessly+0.053
ously+0.052
der+0.051
ived+0.051
Neg logit contributions
 hunt-0.031
 park-0.029
 hall-0.027
 exhibit-0.026
 library-0.026
TokenB
Token ID33
Feature activation+4.55e+00

Pos logit contributions
rier+0.040
lessly+0.039
ously+0.038
der+0.037
ived+0.037
Neg logit contributions
 hunt-0.031
 park-0.031
 hall-0.029
 slide-0.028
 library-0.028
Tokenizzy
Token ID40593
Feature activation+3.00e+00

Pos logit contributions
rier+0.122
lessly+0.114
ously+0.113
der+0.112
ived+0.103
Neg logit contributions
 hunt-0.098
 exhibit-0.095
 hall-0.095
 park-0.090
 library-0.088
Token nodded
Token ID14464
Feature activation+4.77e+00

Pos logit contributions
lessly+0.074
ously+0.071
rier+0.071
ived+0.069
der+0.068
Neg logit contributions
 hunt-0.078
 park-0.073
 library-0.070
 exhibit-0.070
 hall-0.070
Token solemn
Token ID26322
Feature activation+8.44e+00

Pos logit contributions
lessly+0.163
ived+0.158
rier+0.156
der+0.151
uster+0.150
Neg logit contributions
 play-0.067
 hunt-0.062
 park-0.058
 visit-0.056
 read-0.055
Tokenly
Token ID306
Feature activation+4.78e+00

Pos logit contributions
lessly+0.157
 importantly+0.152
ived+0.150
ously+0.143
Days+0.140
Neg logit contributions
 play-0.193
 hunt-0.190
 show-0.175
 cover-0.172
 outside-0.171
Token.
Token ID13
Feature activation+6.36e-01

Pos logit contributions
lessly+0.157
rier+0.153
ously+0.145
ived+0.143
der+0.142
Neg logit contributions
 hunt-0.073
 play-0.063
 park-0.062
 store-0.060
 hide-0.058
Token She
Token ID1375
Feature activation+1.29e+00

Pos logit contributions
rier+0.018
ously+0.017
ived+0.017
lessly+0.017
iously+0.016
Neg logit contributions
 park-0.013
 hall-0.012
 hunt-0.012
 library-0.012
 store-0.011
Token knew
Token ID2993
Feature activation+3.10e+00

Pos logit contributions
lessly+0.036
rier+0.036
ously+0.035
ived+0.034
der+0.034
Neg logit contributions
 hunt-0.029
 park-0.027
 library-0.026
 exhibit-0.025
 hall-0.025
Token she
Token ID673
Feature activation-1.68e-01

Pos logit contributions
lessly+0.087
ously+0.085
rier+0.083
der+0.082
ived+0.080
Neg logit contributions
 hunt-0.072
 park-0.065
 exhibit-0.063
 library-0.063
 hall-0.062
Token,
Token ID11
Feature activation+3.07e+00

Pos logit contributions
 hunt+0.003
 library+0.003
 park+0.003
 exhibit+0.003
 hall+0.003
Neg logit contributions
lessly-0.004
ously-0.004
der-0.004
rier-0.004
ived-0.004
Token getting
Token ID1972
Feature activation+3.27e+00

Pos logit contributions
lessly+0.081
ously+0.079
rier+0.077
der+0.075
ived+0.075
Neg logit contributions
 hunt-0.075
 park-0.068
 exhibit-0.067
 library-0.067
 hall-0.066
Token ang
Token ID3550
Feature activation+8.04e+00

Pos logit contributions
ously+0.069
lessly+0.067
der+0.064
rier+0.064
ived+0.061
Neg logit contributions
 hunt-0.095
 park-0.091
 library-0.086
 hall-0.086
 exhibit-0.085
Tokenrier
Token ID5277
Feature activation+5.41e+00

Pos logit contributions
ously+0.067
lessly+0.061
ived+0.046
 pleased+0.036
uster+0.036
Neg logit contributions
 hunt-0.311
 park-0.296
 exhibit-0.289
 slide-0.288
 land-0.288
Token and
Token ID290
Feature activation+2.58e+00

Pos logit contributions
lessly+0.150
rier+0.145
ously+0.142
ived+0.141
der+0.135
Neg logit contributions
 hunt-0.119
 park-0.107
 exhibit-0.105
 hall-0.102
 library-0.101
Token ang
Token ID3550
Feature activation+8.43e+00

Pos logit contributions
lessly+0.045
rier+0.042
ously+0.042
ived+0.040
der+0.040
Neg logit contributions
 hunt-0.086
 park-0.080
 exhibit-0.080
 library-0.080
 hall-0.079
Tokenrier
Token ID5277
Feature activation+5.63e+00

Pos logit contributions
ously+0.070
lessly+0.059
ived+0.053
uster+0.038
 pleased+0.035
Neg logit contributions
 hunt-0.326
 park-0.310
 exhibit-0.305
 test-0.304
 slide-0.303
Token.
Token ID13
Feature activation+7.01e-01

Pos logit contributions
lessly+0.172
rier+0.168
ously+0.165
ived+0.163
der+0.158
Neg logit contributions
 hunt-0.106
 park-0.093
 exhibit-0.092
 hall-0.089
 library-0.087
Token The
Token ID383
Feature activation+4.85e-01

Pos logit contributions
lessly+0.017
ously+0.017
rier+0.016
ived+0.016
der+0.016
Neg logit contributions
 hunt-0.018
 park-0.017
 library-0.017
 exhibit-0.017
 hall-0.016
Token two
Token ID734
Feature activation+7.84e-01

Pos logit contributions
rier+0.012
lessly+0.012
ously+0.011
der+0.011
ived+0.011
Neg logit contributions
 hunt-0.013
 exhibit-0.012
 library-0.011
 hall-0.011
 park-0.011
Token friends
Token ID2460
Feature activation-4.23e-02

Pos logit contributions
lessly+0.021
ously+0.019
rier+0.019
der+0.019
ived+0.017
Neg logit contributions
 hunt-0.019
 park-0.018
 exhibit-0.017
 hall-0.017
 library-0.017
Token.
Token ID13
Feature activation-4.26e-01

Pos logit contributions
 hunt+0.063
 library+0.060
 exhibit+0.058
 park+0.056
 museum+0.055
Neg logit contributions
ously-0.051
lessly-0.050
der-0.047
rier-0.046
ived-0.045
Token The
Token ID383
Feature activation-2.62e-02

Pos logit contributions
 hunt+0.012
 park+0.011
 exhibit+0.011
 library+0.011
 hall+0.011
Neg logit contributions
lessly-0.009
ously-0.009
rier-0.009
der-0.009
ived-0.009
Token water
Token ID1660
Feature activation-8.42e-01

Pos logit contributions
 hunt+0.001
 exhibit+0.001
 library+0.001
 park+0.000
 hall+0.000
Neg logit contributions
lessly-0.001
rier-0.001
der-0.001
ously-0.001
ived-0.001
Token was
Token ID373
Feature activation+2.83e+00

Pos logit contributions
 hunt+0.018
 library+0.017
 exhibit+0.017
 park+0.016
 museum+0.016
Neg logit contributions
lessly-0.024
ously-0.024
ived-0.023
der-0.023
rier-0.023
Token so
Token ID523
Feature activation+3.49e+00

Pos logit contributions
lessly+0.076
ously+0.075
rier+0.073
ived+0.071
der+0.071
Neg logit contributions
 hunt-0.067
 park-0.061
 library-0.059
 exhibit-0.059
 hall-0.059
Token grim
Token ID18288
Feature activation+8.41e+00

Pos logit contributions
ously+0.097
rier+0.091
lessly+0.091
der+0.089
ived+0.088
Neg logit contributions
 hunt-0.079
 park-0.076
 hall-0.071
 theater-0.070
 library-0.069
Tokeny
Token ID88
Feature activation+3.61e+00

Pos logit contributions
 pleased+0.136
 importantly+0.130
ously+0.129
lessly+0.124
Of+0.120
Neg logit contributions
e-0.200
 hunt-0.195
 hide-0.192
 land-0.184
house-0.180
Token and
Token ID290
Feature activation+3.05e+00

Pos logit contributions
lessly+0.098
ously+0.096
rier+0.093
ived+0.092
der+0.090
Neg logit contributions
 hunt-0.083
 park-0.076
 exhibit-0.073
 library-0.072
 hall-0.071
Token gross
Token ID10319
Feature activation+3.81e+00

Pos logit contributions
lessly+0.079
ously+0.076
rier+0.076
ived+0.073
der+0.073
Neg logit contributions
 hunt-0.076
 park-0.070
 exhibit-0.069
 library-0.069
 hall-0.067
Token,
Token ID11
Feature activation+3.79e+00

Pos logit contributions
lessly+0.114
ously+0.112
rier+0.111
ived+0.109
der+0.103
Neg logit contributions
 hunt-0.071
 park-0.064
 hall-0.059
 exhibit-0.058
 slide-0.058
Token but
Token ID475
Feature activation+1.77e+00

Pos logit contributions
ously+0.096
lessly+0.096
ived+0.094
rier+0.093
der+0.090
Neg logit contributions
 hunt-0.091
 park-0.086
 hall-0.081
 library-0.080
 slide-0.079
Token sunrise
Token ID40048
Feature activation-2.17e+00

Pos logit contributions
rier+0.013
lessly+0.012
der+0.012
ously+0.011
ived+0.011
Neg logit contributions
 hunt-0.011
 exhibit-0.010
 hall-0.009
 library-0.009
|-0.009
Token!"
Token ID2474
Feature activation+6.58e-01

Pos logit contributions
 hunt+0.054
 library+0.051
 park+0.050
 hall+0.049
 exhibit+0.049
Neg logit contributions
rier-0.052
lessly-0.052
der-0.049
ived-0.048
ously-0.046
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
lessly+0.018
ously+0.017
rier+0.017
ived+0.016
der+0.016
Neg logit contributions
 hunt-0.016
 park-0.014
 library-0.014
 exhibit-0.014
 hall-0.014
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
lessly+0.038
ously+0.038
rier+0.037
ived+0.036
der+0.036
Neg logit contributions
 hunt-0.030
 park-0.027
 library-0.026
 exhibit-0.026
 hall-0.026
TokenSam
Token ID16305
Feature activation+1.42e+00

Pos logit contributions
lessly+0.033
rier+0.033
ously+0.033
der+0.032
ived+0.031
Neg logit contributions
 hunt-0.030
 park-0.028
 library-0.027
 exhibit-0.027
 hall-0.026
Token gro
Token ID7128
Feature activation+8.40e+00

Pos logit contributions
ously+0.036
lessly+0.036
ived+0.035
der+0.034
rier+0.033
Neg logit contributions
 hunt-0.035
 park-0.032
 library-0.032
 exhibit-0.031
 hall-0.031
Tokenaned
Token ID22739
Feature activation+4.69e+00

Pos logit contributions
ously+0.137
lessly+0.129
der+0.125
ived+0.119
rier+0.119
Neg logit contributions
 hunt-0.254
e-0.238
 exhibit-0.235
nob-0.231
house-0.224
Token and
Token ID290
Feature activation+2.44e+00

Pos logit contributions
rier+0.130
lessly+0.127
ived+0.124
ously+0.122
der+0.117
Neg logit contributions
 hunt-0.099
 park-0.095
 play-0.090
 hall-0.088
 hide-0.086
Token pulled
Token ID5954
Feature activation+2.00e+00

Pos logit contributions
lessly+0.064
ously+0.063
rier+0.061
der+0.060
ived+0.060
Neg logit contributions
 hunt-0.060
 park-0.055
 exhibit-0.054
 library-0.054
 hall-0.053
Token his
Token ID465
Feature activation+1.97e-01

Pos logit contributions
der+0.062
rier+0.062
ously+0.062
lessly+0.061
ived+0.059
Neg logit contributions
 hunt-0.036
 park-0.034
 hide-0.032
 play-0.032
 exhibit-0.031
Token blanket
Token ID18447
Feature activation-1.85e-01

Pos logit contributions
lessly+0.005
rier+0.005
ously+0.005
ived+0.004
der+0.004
Neg logit contributions
 hunt-0.005
 park-0.005
 library-0.005
 exhibit-0.005
 hall-0.005
Tokenâ
Token ID22940
Feature activation+1.71e+00

Pos logit contributions
 hunt+0.005
 library+0.004
 exhibit+0.004
 park+0.004
 museum+0.004
Neg logit contributions
lessly-0.005
der-0.005
rier-0.005
ously-0.005
ived-0.005
Token
Token ID26391
Feature activation+9.28e-01

Pos logit contributions
lessly+0.050
ously+0.047
rier+0.046
ived+0.046
der+0.045
Neg logit contributions
 hunt-0.036
 exhibit-0.033
 park-0.032
 library-0.032
 hall-0.032
Token
Token ID8151
Feature activation-6.73e-01

Pos logit contributions
lessly+0.017
ived+0.016
ously+0.016
der+0.015
rier+0.015
Neg logit contributions
 hunt-0.029
 park-0.027
 library-0.027
 exhibit-0.026
 hall-0.026
Tokent
Token ID83
Feature activation+2.27e-01

Pos logit contributions
 hunt+0.019
 park+0.018
 exhibit+0.018
 library+0.018
 hall+0.018
Neg logit contributions
lessly-0.014
ously-0.014
rier-0.014
der-0.013
ived-0.013
Token get
Token ID651
Feature activation+1.85e+00

Pos logit contributions
lessly+0.007
ously+0.007
rier+0.007
der+0.007
ived+0.006
Neg logit contributions
 hunt-0.005
 park-0.004
 library-0.004
 exhibit-0.004
 hall-0.004
Token ang
Token ID3550
Feature activation+8.38e+00

Pos logit contributions
lessly+0.050
ously+0.048
rier+0.048
ived+0.047
der+0.046
Neg logit contributions
 hunt-0.044
 park-0.040
 library-0.040
 exhibit-0.040
 hall-0.039
Tokenrier
Token ID5277
Feature activation+5.39e+00

Pos logit contributions
ously+0.082
lessly+0.062
ived+0.060
itted+0.047
atted+0.044
Neg logit contributions
 hunt-0.313
 test-0.303
 store-0.301
 slide-0.298
 park-0.298
Token.
Token ID13
Feature activation-4.57e-01

Pos logit contributions
lessly+0.160
ously+0.155
ived+0.153
rier+0.151
der+0.146
Neg logit contributions
 hunt-0.107
 park-0.096
 exhibit-0.092
 library-0.091
 hall-0.090
Tokenâ
Token ID22940
Feature activation+3.50e+00

Pos logit contributions
 hunt+0.012
 park+0.011
 library+0.011
 exhibit+0.011
 hall+0.010
Neg logit contributions
lessly-0.011
ously-0.011
rier-0.011
ived-0.011
der-0.011
Token
Token ID26391
Feature activation+2.37e+00

Pos logit contributions
lessly+0.100
ously+0.095
rier+0.093
der+0.090
ived+0.089
Neg logit contributions
 hunt-0.080
 exhibit-0.071
 park-0.071
 library-0.070
 hall-0.069
Token
Token ID198
Feature activation+5.78e-02

Pos logit contributions
lessly+0.062
ously+0.061
ived+0.058
rier+0.058
der+0.057
Neg logit contributions
 hunt-0.057
 park-0.051
 library-0.050
 exhibit-0.050
 hall-0.049
Token
Token ID198
Feature activation+2.24e+00

Pos logit contributions
rier+0.046
lessly+0.046
ously+0.045
ived+0.045
der+0.043
Neg logit contributions
 hunt-0.022
 park-0.022
 library-0.019
 hall-0.019
 store-0.018
Token
Token ID198
Feature activation+1.94e+00

Pos logit contributions
lessly+0.069
ously+0.068
rier+0.068
ived+0.066
der+0.066
Neg logit contributions
 hunt-0.044
 park-0.041
 library-0.038
 hall-0.038
 exhibit-0.038
TokenTom
Token ID13787
Feature activation+2.28e+00

Pos logit contributions
lessly+0.052
ously+0.051
rier+0.051
ived+0.050
der+0.049
Neg logit contributions
 hunt-0.046
 park-0.044
 library-0.042
 hall-0.041
 exhibit-0.040
Token and
Token ID290
Feature activation+2.64e+00

Pos logit contributions
lessly+0.056
rier+0.056
ously+0.054
der+0.053
ived+0.052
Neg logit contributions
 hunt-0.059
 park-0.056
 exhibit-0.053
 hall-0.053
 library-0.053
Token Anna
Token ID11735
Feature activation+2.12e+00

Pos logit contributions
lessly+0.083
ously+0.081
rier+0.080
ived+0.078
der+0.078
Neg logit contributions
 hunt-0.052
 park-0.047
 library-0.045
 hall-0.044
 exhibit-0.044
Token gro
Token ID7128
Feature activation+8.36e+00

Pos logit contributions
rier+0.061
lessly+0.061
der+0.058
ously+0.057
ived+0.055
Neg logit contributions
 hunt-0.047
 park-0.043
 hall-0.041
 exhibit-0.040
 slide-0.040
Tokenaned
Token ID22739
Feature activation+5.09e+00

Pos logit contributions
ously+0.131
lessly+0.126
der+0.119
rier+0.116
ived+0.114
Neg logit contributions
 hunt-0.268
 exhibit-0.248
e-0.243
nob-0.238
 hall-0.236
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
rier+0.170
lessly+0.166
ously+0.164
ived+0.163
der+0.156
Neg logit contributions
 hunt-0.081
 park-0.078
 play-0.072
 hall-0.070
 hide-0.067
Token They
Token ID1119
Feature activation+3.33e-01

Pos logit contributions
lessly+0.011
ously+0.011
rier+0.011
ived+0.011
der+0.010
Neg logit contributions
 hunt-0.011
 park-0.011
 library-0.011
 hall-0.011
 exhibit-0.010
Token did
Token ID750
Feature activation+2.49e+00

Pos logit contributions
lessly+0.009
rier+0.009
ously+0.009
ived+0.009
der+0.009
Neg logit contributions
 hunt-0.008
 park-0.007
 library-0.007
 hall-0.007
 exhibit-0.007
Token not
Token ID407
Feature activation+1.26e+00

Pos logit contributions
lessly+0.057
ously+0.055
der+0.053
rier+0.053
ived+0.052
Neg logit contributions
 hunt-0.068
 park-0.063
 exhibit-0.062
 library-0.062
 hall-0.061
Token Sara
Token ID24799
Feature activation+5.66e-01

Pos logit contributions
 hunt+0.034
 exhibit+0.031
 library+0.030
 hall+0.030
 park+0.028
Neg logit contributions
rier-0.041
lessly-0.040
der-0.039
ously-0.039
ived-0.037
Token and
Token ID290
Feature activation+2.95e+00

Pos logit contributions
lessly+0.015
ously+0.015
rier+0.014
der+0.014
ived+0.014
Neg logit contributions
 hunt-0.014
 park-0.013
 library-0.012
 hall-0.012
 exhibit-0.012
Token Ben
Token ID3932
Feature activation+9.97e-01

Pos logit contributions
der+0.076
ously+0.075
lessly+0.073
rier+0.071
ived+0.070
Neg logit contributions
 hunt-0.069
 exhibit-0.064
 hall-0.064
 park-0.062
 library-0.062
Token.
Token ID13
Feature activation+9.55e-01

Pos logit contributions
ously+0.025
lessly+0.025
ived+0.024
rier+0.023
der+0.023
Neg logit contributions
 hunt-0.025
 library-0.024
 park-0.023
 hall-0.023
 exhibit-0.022
Token He
Token ID679
Feature activation+2.14e-01

Pos logit contributions
lessly+0.024
ously+0.024
rier+0.023
ived+0.023
iously+0.023
Neg logit contributions
 park-0.023
 hunt-0.022
 library-0.021
 store-0.021
 hall-0.020
Token apolog
Token ID8453
Feature activation+8.32e+00

Pos logit contributions
der+0.006
lessly+0.006
ously+0.005
ived+0.005
rier+0.005
Neg logit contributions
 hunt-0.005
 hall-0.004
 exhibit-0.004
 park-0.004
 library-0.004
Tokenizes
Token ID4340
Feature activation+3.32e+00

Pos logit contributions
ived+0.214
ously+0.213
lessly+0.212
der+0.199
rier+0.197
Neg logit contributions
 hunt-0.191
 park-0.180
 exhibit-0.168
 slide-0.165
 theater-0.163
Token again
Token ID757
Feature activation+1.60e+00

Pos logit contributions
lessly+0.099
rier+0.098
ously+0.096
ived+0.095
der+0.092
Neg logit contributions
 hunt-0.068
 park-0.062
 library-0.059
 exhibit-0.058
 hall-0.058
Token.
Token ID13
Feature activation+7.89e-01

Pos logit contributions
lessly+0.041
ously+0.040
der+0.039
rier+0.038
ived+0.038
Neg logit contributions
 hunt-0.040
 park-0.037
 library-0.036
 exhibit-0.036
 hall-0.036
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
ously+0.023
rier+0.023
lessly+0.023
ived+0.022
iously+0.022
Neg logit contributions
 park-0.016
 hunt-0.016
 library-0.015
 hall-0.014
 store-0.014
Token
Token ID198
Feature activation+1.57e+00

Pos logit contributions
lessly+0.045
ously+0.045
rier+0.045
der+0.044
ived+0.043
Neg logit contributions
 hunt-0.028
 park-0.027
 library-0.025
 exhibit-0.025
 hall-0.024
Token the
Token ID262
Feature activation-1.49e+00

Pos logit contributions
 hunt+0.093
 library+0.090
 exhibit+0.086
 hall+0.085
 park+0.083
Neg logit contributions
lessly-0.118
rier-0.113
der-0.110
ously-0.105
ived-0.099
Token creek
Token ID39459
Feature activation-5.25e+00

Pos logit contributions
 hunt+0.029
 exhibit+0.027
 library+0.027
c+0.025
 hall+0.024
Neg logit contributions
lessly-0.044
rier-0.043
der-0.042
ived-0.039
ously-0.039
Token?"
Token ID1701
Feature activation+5.79e-01

Pos logit contributions
 hunt+0.097
 library+0.095
 exhibit+0.092
 zoo+0.089
 park+0.087
Neg logit contributions
lessly-0.152
rier-0.147
der-0.141
ously-0.132
ived-0.131
Token asked
Token ID1965
Feature activation+1.56e+00

Pos logit contributions
ously+0.014
lessly+0.014
rier+0.014
der+0.013
ived+0.013
Neg logit contributions
 hunt-0.015
 park-0.013
 library-0.013
 exhibit-0.013
 hall-0.013
Token Peter
Token ID5613
Feature activation+1.04e+00

Pos logit contributions
lessly+0.043
ously+0.042
rier+0.042
ived+0.041
der+0.041
Neg logit contributions
 hunt-0.035
 park-0.032
 exhibit-0.031
 library-0.031
 hall-0.031
Token enthusi
Token ID11273
Feature activation+8.30e+00

Pos logit contributions
lessly+0.032
ously+0.031
rier+0.031
ived+0.031
der+0.031
Neg logit contributions
 hunt-0.020
 park-0.018
 exhibit-0.017
 library-0.017
 hall-0.017
Tokenatically
Token ID4142
Feature activation+1.91e+00

Pos logit contributions
lessly+0.222
rier+0.207
der+0.203
ously+0.174
ived+0.170
Neg logit contributions
 hunt-0.203
 exhibit-0.194
e-0.189
house-0.174
c-0.171
Token.
Token ID13
Feature activation+5.28e-02

Pos logit contributions
lessly+0.056
ously+0.055
rier+0.054
ived+0.053
der+0.053
Neg logit contributions
 hunt-0.041
 park-0.037
 library-0.036
 exhibit-0.036
 hall-0.035
Token
Token ID198
Feature activation-1.99e-01

Pos logit contributions
lessly+0.002
rier+0.002
ously+0.002
der+0.002
ived+0.002
Neg logit contributions
 hunt-0.001
 park-0.001
 hall-0.001
 exhibit-0.001
 library-0.001
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
 hunt+0.004
 park+0.003
 exhibit+0.003
 hall+0.003
 library+0.003
Neg logit contributions
lessly-0.006
rier-0.006
ously-0.006
der-0.006
ived-0.006
Token"
Token ID1
Feature activation-3.28e+00

Pos logit contributions
lessly+0.005
rier+0.005
der+0.005
ously+0.005
ived+0.005
Neg logit contributions
 hunt-0.006
 park-0.006
 exhibit-0.006
 hall-0.006
 play-0.005
Token he
Token ID339
Feature activation+3.62e-01

Pos logit contributions
lessly+0.033
ously+0.032
rier+0.031
der+0.031
ived+0.030
Neg logit contributions
 hunt-0.038
 park-0.035
 library-0.035
 exhibit-0.034
 hall-0.034
Token felt
Token ID2936
Feature activation+4.52e+00

Pos logit contributions
lessly+0.009
ously+0.009
rier+0.009
ived+0.009
der+0.009
Neg logit contributions
 hunt-0.009
 park-0.008
 library-0.008
 exhibit-0.008
 hall-0.008
Token a
Token ID257
Feature activation+3.43e+00

Pos logit contributions
ously+0.107
ived+0.102
lessly+0.102
rier+0.101
der+0.096
Neg logit contributions
 hunt-0.120
 park-0.116
 library-0.107
 play-0.106
 store-0.105
Token bit
Token ID1643
Feature activation+6.12e+00

Pos logit contributions
ously+0.099
ived+0.097
lessly+0.094
rier+0.093
atted+0.092
Neg logit contributions
 hunt-0.069
 test-0.063
 rest-0.062
 park-0.061
 play-0.059
Token w
Token ID266
Feature activation+5.53e+00

Pos logit contributions
ously+0.167
iously+0.153
edly+0.150
ived+0.148
rier+0.145
Neg logit contributions
 play-0.119
 park-0.119
 hunt-0.118
 store-0.116
 hide-0.113
Tokenobb
Token ID21963
Feature activation+8.24e+00

Pos logit contributions
ously+0.110
rier+0.108
lessly+0.105
ived+0.102
der+0.095
Neg logit contributions
 hunt-0.147
 park-0.143
 slide-0.133
 test-0.132
 exhibit-0.132
Tokenly
Token ID306
Feature activation+2.73e+00

Pos logit contributions
ously+0.101
ived+0.087
lessly+0.084
rier+0.079
der+0.070
Neg logit contributions
 hunt-0.302
 park-0.271
 exhibit-0.265
 slide-0.265
 play-0.264
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
lessly+0.074
ously+0.073
rier+0.070
der+0.070
ived+0.068
Neg logit contributions
 hunt-0.064
 park-0.059
 exhibit-0.058
 library-0.058
 hall-0.057
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
lessly+0.007
ously+0.007
rier+0.007
ived+0.007
der+0.006
Neg logit contributions
 hunt-0.010
 park-0.010
 library-0.009
 exhibit-0.009
 hall-0.009
Token
Token ID198
Feature activation+6.89e-01

Pos logit contributions
lessly+0.005
rier+0.005
ously+0.005
der+0.005
ived+0.005
Neg logit contributions
 hunt-0.003
 park-0.003
 exhibit-0.003
 library-0.002
 hall-0.002
Token"
Token ID1
Feature activation-3.18e+00

Pos logit contributions
rier+0.015
ously+0.015
lessly+0.015
der+0.014
ived+0.013
Neg logit contributions
 hunt-0.020
 park-0.019
 exhibit-0.019
 hall-0.018
 library-0.018
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
lessly+0.026
ously+0.026
rier+0.026
der+0.026
ived+0.024
Neg logit contributions
 hunt-0.027
 exhibit-0.024
 library-0.024
 hall-0.024
 park-0.024
Token They
Token ID1119
Feature activation-2.74e+00

Pos logit contributions
 hunt+0.041
 park+0.039
 library+0.038
 exhibit+0.038
 hall+0.037
Neg logit contributions
lessly-0.037
ously-0.037
rier-0.036
ived-0.034
der-0.034
Token like
Token ID588
Feature activation-5.12e+00

Pos logit contributions
 hunt+0.065
 exhibit+0.061
 library+0.058
 hall+0.058
 park+0.058
Neg logit contributions
lessly-0.073
ously-0.070
ived-0.066
rier-0.066
der-0.065
Token to
Token ID284
Feature activation-6.64e+00

Pos logit contributions
 hunt+0.142
 exhibit+0.135
 hall+0.130
 park+0.128
 library+0.126
Neg logit contributions
ously-0.103
lessly-0.102
ived-0.100
der-0.098
rier-0.091
Token play
Token ID711
Feature activation-3.15e+00

Pos logit contributions
 exhibit+0.077
 hunt+0.074
 library+0.071
rob+0.070
 hall+0.065
Neg logit contributions
ously-0.219
ived-0.217
lessly-0.209
um-0.204
der-0.200
Token at
Token ID379
Feature activation-8.71e+00

Pos logit contributions
 library+0.082
 exhibit+0.082
 hunt+0.081
|+0.076
 hall+0.076
Neg logit contributions
der-0.059
lessly-0.054
ously-0.054
rier-0.054
ived-0.052
Token the
Token ID262
Feature activation-4.37e+00

Pos logit contributions
 hunt+0.198
 exhibit+0.186
house+0.182
nob+0.182
asis+0.182
Neg logit contributions
ived-0.126
lessly-0.125
 grateful-0.122
 pleased-0.120
 knowing-0.119
Token beach
Token ID10481
Feature activation-3.11e+00

Pos logit contributions
nob+0.060
e+0.058
|+0.055
 hunt+0.052
c+0.048
Neg logit contributions
um-0.119
der-0.114
rier-0.113
 louder-0.107
ived-0.105
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
 hunt+0.077
 library+0.073
|+0.072
 park+0.071
 exhibit+0.070
Neg logit contributions
der-0.064
rier-0.063
lessly-0.062
ously-0.059
ived-0.058
Token They
Token ID1119
Feature activation-2.27e+00

Pos logit contributions
 hunt+0.045
 park+0.041
 exhibit+0.040
 library+0.040
 hall+0.040
Neg logit contributions
lessly-0.039
ously-0.038
rier-0.037
der-0.037
ived-0.037
Token make
Token ID787
Feature activation-2.26e+00

Pos logit contributions
 hunt+0.042
 exhibit+0.041
 library+0.039
 hall+0.038
 park+0.038
Neg logit contributions
lessly-0.067
ously-0.064
ived-0.062
rier-0.059
der-0.059
Token Lily
Token ID20037
Feature activation-9.12e-01

Pos logit contributions
 hunt+0.067
 exhibit+0.060
 park+0.058
rob+0.057
 theater+0.056
Neg logit contributions
lessly-0.055
ived-0.052
ously-0.050
der-0.049
rier-0.048
Token who
Token ID508
Feature activation-3.42e+00

Pos logit contributions
 hunt+0.023
 exhibit+0.022
 library+0.021
 theater+0.021
 park+0.021
Neg logit contributions
ously-0.022
lessly-0.021
der-0.020
ived-0.018
rier-0.017
Token loved
Token ID6151
Feature activation-2.93e+00

Pos logit contributions
 hunt+0.099
 exhibit+0.096
 theater+0.088
 park+0.088
 zoo+0.087
Neg logit contributions
lessly-0.067
ously-0.061
der-0.052
 pleased-0.052
 widely-0.050
Token to
Token ID284
Feature activation-6.25e+00

Pos logit contributions
 hunt+0.080
 exhibit+0.075
house+0.070
 hall+0.070
 park+0.069
Neg logit contributions
der-0.058
ously-0.058
lessly-0.058
ived-0.057
rier-0.054
Token play
Token ID711
Feature activation-3.75e+00

Pos logit contributions
 exhibit+0.080
rob+0.074
 hunt+0.073
 library+0.067
 basement+0.066
Neg logit contributions
ously-0.192
ived-0.190
lessly-0.187
der-0.185
 expression-0.179
Token at
Token ID379
Feature activation-8.69e+00

Pos logit contributions
 library+0.079
 exhibit+0.078
 hunt+0.075
 museum+0.070
 hall+0.070
Neg logit contributions
der-0.086
rier-0.083
ived-0.080
lessly-0.079
ously-0.077
Token the
Token ID262
Feature activation-4.34e+00

Pos logit contributions
 hunt+0.182
 exhibit+0.166
house+0.166
illas+0.163
nob+0.156
Neg logit contributions
lessly-0.140
ived-0.139
der-0.139
rier-0.133
 grateful-0.128
Token beach
Token ID10481
Feature activation-3.72e+00

Pos logit contributions
 hunt+0.056
nob+0.055
e+0.053
|+0.052
c+0.051
Neg logit contributions
der-0.120
um-0.118
rier-0.118
ived-0.113
lessly-0.107
Token.
Token ID13
Feature activation-9.78e-01

Pos logit contributions
 hunt+0.088
 library+0.086
 park+0.083
 exhibit+0.081
 basement+0.079
Neg logit contributions
lessly-0.081
rier-0.080
der-0.079
ived-0.077
ously-0.077
Token One
Token ID1881
Feature activation-3.82e+00

Pos logit contributions
 hunt+0.029
 exhibit+0.026
 park+0.026
 hall+0.025
 library+0.025
Neg logit contributions
lessly-0.021
ously-0.020
der-0.019
rier-0.019
ived-0.019
Token day
Token ID1110
Feature activation+4.07e-01

Pos logit contributions
 hunt+0.062
 park+0.055
 library+0.053
 exhibit+0.052
 hall+0.051
Neg logit contributions
lessly-0.131
ously-0.130
rier-0.128
der-0.125
ived-0.125
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
lessly+0.003
ously+0.003
rier+0.003
der+0.003
ived+0.003
Neg logit contributions
 hunt-0.003
 library-0.002
 park-0.002
 exhibit-0.002
 hall-0.002
Token They
Token ID1119
Feature activation-2.41e+00

Pos logit contributions
 hunt+0.053
 exhibit+0.048
 park+0.048
 library+0.048
 hall+0.047
Neg logit contributions
lessly-0.033
ously-0.031
der-0.030
ived-0.030
rier-0.029
Token like
Token ID588
Feature activation-5.08e+00

Pos logit contributions
 hunt+0.058
 exhibit+0.056
 hall+0.053
 library+0.052
 park+0.052
Neg logit contributions
lessly-0.061
ously-0.058
ived-0.056
der-0.055
rier-0.054
Token to
Token ID284
Feature activation-6.27e+00

Pos logit contributions
 hunt+0.141
 exhibit+0.138
 hall+0.134
house+0.129
 library+0.129
Neg logit contributions
ously-0.096
ived-0.095
lessly-0.094
der-0.093
rier-0.085
Token play
Token ID711
Feature activation-2.93e+00

Pos logit contributions
 exhibit+0.079
rob+0.074
 library+0.072
 basement+0.071
 hunt+0.070
Neg logit contributions
ously-0.187
ived-0.185
um-0.175
lessly-0.173
der-0.172
Token at
Token ID379
Feature activation-8.64e+00

Pos logit contributions
 library+0.076
 exhibit+0.076
 hunt+0.075
 hall+0.072
|+0.071
Neg logit contributions
der-0.057
rier-0.051
lessly-0.051
ived-0.049
ously-0.049
Token the
Token ID262
Feature activation-4.16e+00

Pos logit contributions
 hunt+0.183
house+0.172
 exhibit+0.171
nob+0.169
asis+0.163
Neg logit contributions
lessly-0.142
ived-0.140
der-0.138
lee-0.129
 pleased-0.128
Token shore
Token ID15191
Feature activation-2.32e+00

Pos logit contributions
nob+0.059
e+0.057
|+0.053
 hunt+0.049
house+0.047
Neg logit contributions
um-0.114
der-0.110
rier-0.105
 int-0.099
 louder-0.099
Token.
Token ID13
Feature activation-1.71e+00

Pos logit contributions
 hunt+0.059
 library+0.054
 hall+0.052
 exhibit+0.052
 park+0.052
Neg logit contributions
der-0.054
lessly-0.053
rier-0.052
ously-0.050
ived-0.050
Token They
Token ID1119
Feature activation-1.93e+00

Pos logit contributions
 hunt+0.050
 library+0.044
 exhibit+0.044
 hall+0.044
 park+0.043
Neg logit contributions
lessly-0.034
ously-0.031
der-0.031
ived-0.031
rier-0.028
Token see
Token ID766
Feature activation-2.37e+00

Pos logit contributions
 exhibit+0.041
 hunt+0.040
 library+0.038
 hall+0.037
house+0.036
Neg logit contributions
lessly-0.051
ously-0.048
ived-0.046
der-0.044
rier-0.043
Token day
Token ID1110
Feature activation+1.92e-01

Pos logit contributions
 hunt+0.104
rob+0.088
 library+0.088
 exhibit+0.087
 hall+0.085
Neg logit contributions
lessly-0.115
ived-0.108
ously-0.107
 pleased-0.102
um-0.097
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
lessly+0.005
rier+0.005
ously+0.005
der+0.005
ived+0.005
Neg logit contributions
 hunt-0.004
 park-0.004
 exhibit-0.004
 hall-0.004
 theater-0.004
Token Amy
Token ID14235
Feature activation-1.71e+00

Pos logit contributions
 hunt+0.036
 park+0.032
 exhibit+0.030
 library+0.030
 hall+0.030
Neg logit contributions
lessly-0.070
ously-0.069
rier-0.068
der-0.067
ived-0.066
Token was
Token ID373
Feature activation+5.97e-01

Pos logit contributions
 hunt+0.050
 exhibit+0.047
 theater+0.047
 museum+0.046
 library+0.046
Neg logit contributions
ously-0.031
ived-0.026
lessly-0.026
ly-0.024
der-0.024
Token playing
Token ID2712
Feature activation-3.36e+00

Pos logit contributions
lessly+0.018
ously+0.018
ived+0.017
rier+0.017
der+0.017
Neg logit contributions
 hunt-0.012
 exhibit-0.010
 park-0.010
 library-0.010
 hall-0.010
Token at
Token ID379
Feature activation-8.63e+00

Pos logit contributions
 exhibit+0.081
 library+0.081
 hunt+0.079
 museum+0.073
asis+0.073
Neg logit contributions
ously-0.067
der-0.066
lessly-0.066
ived-0.064
 louder-0.063
Token the
Token ID262
Feature activation-4.16e+00

Pos logit contributions
 hunt+0.215
 exhibit+0.206
asis+0.200
house+0.183
uch+0.180
Neg logit contributions
lessly-0.128
ived-0.127
ously-0.123
 knowing-0.118
der-0.116
Token park
Token ID3952
Feature activation-2.84e+00

Pos logit contributions
 hunt+0.044
e+0.042
illas+0.042
nob+0.040
|+0.040
Neg logit contributions
der-0.117
um-0.112
lessly-0.111
ived-0.111
rier-0.105
Token with
Token ID351
Feature activation-3.35e+00

Pos logit contributions
 hunt+0.069
 library+0.062
 exhibit+0.061
 park+0.060
 theater+0.059
Neg logit contributions
lessly-0.070
ously-0.069
ived-0.067
der-0.066
rier-0.063
Token her
Token ID607
Feature activation-2.02e+00

Pos logit contributions
 hunt+0.086
 exhibit+0.081
 library+0.077
 theater+0.075
 hall+0.075
Neg logit contributions
ously-0.079
lessly-0.079
ived-0.077
der-0.072
rier-0.070
Token friends
Token ID2460
Feature activation-1.30e+00

Pos logit contributions
 hunt+0.053
 park+0.049
 library+0.048
 exhibit+0.048
 hall+0.047
Neg logit contributions
lessly-0.050
ously-0.049
rier-0.047
der-0.046
ived-0.046
Token.
Token ID13
Feature activation-1.52e+00

Pos logit contributions
lessly+0.002
ously+0.001
rier+0.001
ived+0.001
der+0.001
Neg logit contributions
 hunt-0.001
 park-0.001
 library-0.001
 exhibit-0.001
 hall-0.001
Token They
Token ID1119
Feature activation-2.62e+00

Pos logit contributions
 hunt+0.042
 park+0.040
 library+0.039
 exhibit+0.039
 hall+0.038
Neg logit contributions
lessly-0.035
ously-0.034
rier-0.034
ived-0.032
der-0.032
Token like
Token ID588
Feature activation-4.92e+00

Pos logit contributions
 hunt+0.063
 exhibit+0.059
 library+0.057
 hall+0.056
 park+0.056
Neg logit contributions
lessly-0.068
ously-0.066
ived-0.062
der-0.062
rier-0.061
Token to
Token ID284
Feature activation-6.26e+00

Pos logit contributions
 hunt+0.138
 exhibit+0.134
 hall+0.130
house+0.126
nob+0.125
Neg logit contributions
ously-0.092
ived-0.090
lessly-0.090
der-0.087
rier-0.078
Token play
Token ID711
Feature activation-2.96e+00

Pos logit contributions
 exhibit+0.071
rob+0.070
 hunt+0.068
 library+0.066
 basement+0.064
Neg logit contributions
ously-0.198
ived-0.195
lessly-0.185
um-0.183
der-0.180
Token at
Token ID379
Feature activation-8.61e+00

Pos logit contributions
 library+0.078
 exhibit+0.076
 hunt+0.075
 hall+0.074
|+0.070
Neg logit contributions
der-0.057
ously-0.050
lessly-0.050
ived-0.050
rier-0.048
Token the
Token ID262
Feature activation-4.37e+00

Pos logit contributions
 hunt+0.186
nob+0.177
house+0.174
 exhibit+0.170
asis+0.170
Neg logit contributions
ived-0.129
lessly-0.125
der-0.123
lee-0.123
 knowing-0.118
Token park
Token ID3952
Feature activation-2.95e+00

Pos logit contributions
e+0.062
nob+0.062
|+0.057
 hunt+0.054
c+0.049
Neg logit contributions
um-0.117
der-0.115
rier-0.110
 widely-0.103
 �-0.103
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
 hunt+0.078
 library+0.071
 hall+0.070
 exhibit+0.069
 test+0.069
Neg logit contributions
der-0.061
lessly-0.057
ously-0.055
ived-0.055
rier-0.052
Token One
Token ID1881
Feature activation-4.07e+00

Pos logit contributions
 hunt+0.043
 exhibit+0.039
 park+0.038
 hall+0.038
 library+0.038
Neg logit contributions
lessly-0.033
ously-0.031
der-0.031
ived-0.031
rier-0.029
Token day
Token ID1110
Feature activation+2.11e-01

Pos logit contributions
 hunt+0.082
 exhibit+0.070
 library+0.068
 hall+0.066
 park+0.065
Neg logit contributions
lessly-0.122
ously-0.117
ived-0.117
der-0.112
rier-0.111
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
 hunt+0.007
 park+0.006
 library+0.006
 exhibit+0.006
 hall+0.006
Neg logit contributions
lessly-0.011
ously-0.011
rier-0.011
ived-0.011
der-0.010
Token They
Token ID1119
Feature activation-2.93e+00

Pos logit contributions
 hunt+0.050
 park+0.046
 exhibit+0.045
 library+0.045
 hall+0.045
Neg logit contributions
lessly-0.035
ously-0.034
rier-0.032
ived-0.032
der-0.032
Token like
Token ID588
Feature activation-5.01e+00

Pos logit contributions
 hunt+0.069
 exhibit+0.066
 hall+0.062
 library+0.061
 park+0.060
Neg logit contributions
lessly-0.077
ously-0.073
ived-0.070
der-0.068
rier-0.067
Token to
Token ID284
Feature activation-6.41e+00

Pos logit contributions
 hunt+0.138
 exhibit+0.134
 hall+0.129
house+0.126
 park+0.124
Neg logit contributions
ously-0.098
ived-0.096
lessly-0.096
der-0.094
rier-0.087
Token play
Token ID711
Feature activation-3.23e+00

Pos logit contributions
 exhibit+0.079
rob+0.075
 library+0.072
 hunt+0.072
 basement+0.071
Neg logit contributions
ived-0.191
ously-0.187
um-0.178
lessly-0.173
 widely-0.172
Token at
Token ID379
Feature activation-8.60e+00

Pos logit contributions
 library+0.084
 exhibit+0.083
 hunt+0.082
 hall+0.081
|+0.075
Neg logit contributions
der-0.064
rier-0.057
ously-0.056
lessly-0.056
ived-0.055
Token school
Token ID1524
Feature activation-3.27e+00

Pos logit contributions
 hunt+0.189
house+0.181
nob+0.181
 exhibit+0.175
asis+0.172
Neg logit contributions
ived-0.136
der-0.127
lessly-0.126
lee-0.122
 grateful-0.121
Token.
Token ID13
Feature activation-1.80e+00

Pos logit contributions
 hunt+0.086
 exhibit+0.076
 gra+0.076
 park+0.075
|+0.072
Neg logit contributions
der-0.073
lessly-0.072
ously-0.066
ived-0.064
rier-0.063
Token But
Token ID887
Feature activation-1.82e-01

Pos logit contributions
 hunt+0.053
 exhibit+0.047
 park+0.046
 hall+0.046
 library+0.045
Neg logit contributions
lessly-0.035
der-0.035
ously-0.034
ived-0.033
rier-0.032
Token today
Token ID1909
Feature activation-2.88e+00

Pos logit contributions
 hunt+0.005
 exhibit+0.004
 park+0.004
 hall+0.004
 library+0.004
Neg logit contributions
lessly-0.004
der-0.004
ously-0.004
rier-0.004
ived-0.004
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
 hunt+0.067
nob+0.063
rob+0.058
 exhibit+0.058
 gra+0.057
Neg logit contributions
ived-0.064
der-0.059
ously-0.059
um-0.059
rier-0.059
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
lessly+0.004
ously+0.004
rier+0.004
ived+0.004
der+0.004
Neg logit contributions
 hunt-0.003
 park-0.002
 library-0.002
 exhibit-0.002
 hall-0.002
Token They
Token ID1119
Feature activation-2.50e+00

Pos logit contributions
 hunt+0.043
 park+0.040
 library+0.039
 exhibit+0.039
 hall+0.038
Neg logit contributions
lessly-0.036
ously-0.035
rier-0.034
ived-0.033
der-0.033
Token like
Token ID588
Feature activation-4.85e+00

Pos logit contributions
 hunt+0.062
 exhibit+0.059
 library+0.056
 hall+0.056
 park+0.055
Neg logit contributions
lessly-0.063
ously-0.060
ived-0.057
der-0.056
rier-0.056
Token to
Token ID284
Feature activation-6.15e+00

Pos logit contributions
 hunt+0.136
 exhibit+0.132
 hall+0.126
 library+0.123
 park+0.122
Neg logit contributions
ously-0.093
ived-0.091
lessly-0.091
der-0.088
rier-0.080
Token play
Token ID711
Feature activation-2.93e+00

Pos logit contributions
 exhibit+0.076
rob+0.072
 library+0.068
 hunt+0.068
 basement+0.063
Neg logit contributions
ously-0.191
ived-0.185
um-0.176
lessly-0.176
der-0.174
Token at
Token ID379
Feature activation-8.60e+00

Pos logit contributions
 library+0.078
 exhibit+0.077
 hunt+0.076
 hall+0.073
|+0.070
Neg logit contributions
der-0.057
lessly-0.051
ously-0.051
rier-0.051
ived-0.049
Token the
Token ID262
Feature activation-4.24e+00

Pos logit contributions
 hunt+0.190
nob+0.180
house+0.178
 exhibit+0.178
asis+0.172
Neg logit contributions
lessly-0.126
der-0.124
ived-0.123
 knowing-0.121
 grateful-0.120
Token park
Token ID3952
Feature activation-2.83e+00

Pos logit contributions
e+0.062
nob+0.061
|+0.054
 hunt+0.051
c+0.048
Neg logit contributions
um-0.113
der-0.110
rier-0.105
ummy-0.099
 louder-0.099
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
 hunt+0.074
 library+0.068
 hall+0.067
 exhibit+0.066
 park+0.066
Neg logit contributions
der-0.059
lessly-0.056
ously-0.054
ived-0.053
rier-0.051
Token They
Token ID1119
Feature activation-2.13e+00

Pos logit contributions
 hunt+0.046
 park+0.041
 exhibit+0.041
 hall+0.041
 library+0.040
Neg logit contributions
lessly-0.036
ously-0.035
der-0.035
ived-0.034
rier-0.033
Token see
Token ID766
Feature activation-2.89e+00

Pos logit contributions
 exhibit+0.043
 hunt+0.040
 theater+0.037
 hall+0.037
 library+0.037
Neg logit contributions
lessly-0.059
ously-0.055
ived-0.054
der-0.051
rier-0.050
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
lessly+0.006
ously+0.006
rier+0.006
ived+0.006
der+0.006
Neg logit contributions
 hunt-0.004
 park-0.004
 library-0.004
 exhibit-0.004
 hall-0.003
Token They
Token ID1119
Feature activation-2.58e+00

Pos logit contributions
 hunt+0.049
 exhibit+0.044
 park+0.043
 library+0.043
 hall+0.043
Neg logit contributions
lessly-0.030
ously-0.030
ived-0.028
der-0.028
rier-0.026
Token like
Token ID588
Feature activation-4.91e+00

Pos logit contributions
 hunt+0.059
 exhibit+0.058
 hall+0.053
 library+0.053
 park+0.052
Neg logit contributions
lessly-0.068
ously-0.064
ived-0.062
der-0.060
rier-0.059
Token to
Token ID284
Feature activation-6.16e+00

Pos logit contributions
 hunt+0.135
 exhibit+0.132
 hall+0.126
 library+0.123
 theater+0.123
Neg logit contributions
ived-0.094
ously-0.094
lessly-0.093
der-0.089
rier-0.082
Token play
Token ID711
Feature activation-2.82e+00

Pos logit contributions
 exhibit+0.080
 library+0.079
rob+0.076
 basement+0.074
 hunt+0.071
Neg logit contributions
ived-0.182
ously-0.177
um-0.166
lessly-0.163
der-0.158
Token at
Token ID379
Feature activation-8.58e+00

Pos logit contributions
 library+0.073
 exhibit+0.071
 hall+0.070
 hunt+0.069
|+0.066
Neg logit contributions
der-0.057
rier-0.051
ived-0.050
ously-0.049
lessly-0.049
Token the
Token ID262
Feature activation-4.13e+00

Pos logit contributions
 hunt+0.184
house+0.175
nob+0.174
 exhibit+0.171
 library+0.164
Neg logit contributions
ived-0.139
der-0.136
lessly-0.135
rier-0.134
 grateful-0.129
Token beach
Token ID10481
Feature activation-3.05e+00

Pos logit contributions
nob+0.071
e+0.070
|+0.066
c+0.062
house+0.061
Neg logit contributions
um-0.099
rier-0.098
der-0.098
ived-0.088
 louder-0.087
Token.
Token ID13
Feature activation-1.71e+00

Pos logit contributions
 hunt+0.075
 library+0.072
|+0.071
nob+0.070
 park+0.070
Neg logit contributions
der-0.062
rier-0.060
lessly-0.057
ived-0.055
ously-0.054
Token They
Token ID1119
Feature activation-2.18e+00

Pos logit contributions
 hunt+0.052
 exhibit+0.045
 library+0.045
 hall+0.044
 park+0.044
Neg logit contributions
lessly-0.032
ived-0.030
ously-0.030
der-0.029
rier-0.026
Token have
Token ID423
Feature activation-3.47e+00

Pos logit contributions
 exhibit+0.040
 hunt+0.039
 library+0.037
 hall+0.036
 theater+0.035
Neg logit contributions
lessly-0.064
ously-0.059
ived-0.058
rier-0.056
der-0.055
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
lessly+0.015
ously+0.015
rier+0.014
ived+0.014
der+0.014
Neg logit contributions
 hunt-0.009
 park-0.008
 exhibit-0.008
 library-0.008
 hall-0.008
Token They
Token ID1119
Feature activation-2.43e+00

Pos logit contributions
 hunt+0.040
 park+0.037
 library+0.037
 exhibit+0.036
 hall+0.036
Neg logit contributions
lessly-0.032
ously-0.031
rier-0.031
ived-0.030
der-0.030
Token like
Token ID588
Feature activation-4.95e+00

Pos logit contributions
 hunt+0.061
 exhibit+0.060
 hall+0.056
 library+0.055
 park+0.054
Neg logit contributions
lessly-0.059
ously-0.055
ived-0.052
der-0.052
rier-0.051
Token to
Token ID284
Feature activation-6.18e+00

Pos logit contributions
 hunt+0.141
 exhibit+0.139
 hall+0.133
 theater+0.130
house+0.129
Neg logit contributions
ously-0.088
ived-0.086
lessly-0.085
der-0.083
rier-0.073
Token play
Token ID711
Feature activation-2.88e+00

Pos logit contributions
 exhibit+0.078
rob+0.078
 library+0.071
 basement+0.068
 hunt+0.068
Neg logit contributions
ously-0.182
ived-0.180
um-0.171
der-0.165
 expression-0.165
Token at
Token ID379
Feature activation-8.58e+00

Pos logit contributions
 library+0.074
 exhibit+0.074
 hunt+0.071
 hall+0.070
|+0.068
Neg logit contributions
der-0.056
ously-0.050
rier-0.049
ived-0.049
lessly-0.049
Token the
Token ID262
Feature activation-4.36e+00

Pos logit contributions
 hunt+0.187
house+0.175
nob+0.174
 exhibit+0.173
asis+0.169
Neg logit contributions
ived-0.128
lessly-0.124
der-0.122
lee-0.120
ously-0.118
Token park
Token ID3952
Feature activation-2.78e+00

Pos logit contributions
e+0.067
nob+0.061
|+0.060
 hunt+0.053
house+0.050
Neg logit contributions
der-0.116
um-0.113
rier-0.110
 louder-0.102
 �-0.102
Token.
Token ID13
Feature activation-1.61e+00

Pos logit contributions
 hunt+0.073
 library+0.067
 exhibit+0.066
 hall+0.066
 test+0.066
Neg logit contributions
der-0.056
lessly-0.052
ously-0.050
ived-0.049
rier-0.048
Token They
Token ID1119
Feature activation-2.05e+00

Pos logit contributions
 hunt+0.045
 exhibit+0.041
 park+0.040
 hall+0.040
 library+0.040
Neg logit contributions
lessly-0.034
ously-0.033
der-0.033
ived-0.032
rier-0.031
Token like
Token ID588
Feature activation-3.91e+00

Pos logit contributions
 exhibit+0.042
 hunt+0.038
 theater+0.036
nob+0.036
 hall+0.036
Neg logit contributions
lessly-0.055
ously-0.051
ived-0.050
der-0.046
rier-0.046
Token friends
Token ID2460
Feature activation-3.07e-01

Pos logit contributions
 hunt+0.018
 park+0.016
 exhibit+0.016
 library+0.016
 hall+0.015
Neg logit contributions
lessly-0.030
ously-0.029
rier-0.029
ived-0.028
der-0.028
Token who
Token ID508
Feature activation-3.68e+00

Pos logit contributions
 hunt+0.007
 park+0.006
 library+0.006
 exhibit+0.006
 hall+0.006
Neg logit contributions
lessly-0.009
ously-0.009
rier-0.008
der-0.008
ived-0.008
Token liked
Token ID8288
Feature activation-3.58e+00

Pos logit contributions
 hunt+0.108
 exhibit+0.105
 hall+0.098
 basement+0.097
 library+0.094
Neg logit contributions
lessly-0.072
ously-0.065
der-0.062
ived-0.053
 widely-0.052
Token to
Token ID284
Feature activation-6.43e+00

Pos logit contributions
 hunt+0.098
 exhibit+0.091
 hall+0.090
 library+0.088
 park+0.086
Neg logit contributions
der-0.073
ously-0.073
ived-0.072
lessly-0.071
rier-0.068
Token play
Token ID711
Feature activation-3.16e+00

Pos logit contributions
rob+0.073
 library+0.071
 exhibit+0.071
 hunt+0.071
 basement+0.066
Neg logit contributions
ously-0.205
ived-0.203
lessly-0.190
der-0.190
um-0.188
Token at
Token ID379
Feature activation-8.56e+00

Pos logit contributions
 library+0.078
 exhibit+0.076
 hunt+0.075
 hall+0.073
|+0.069
Neg logit contributions
der-0.068
lessly-0.061
rier-0.061
ously-0.061
ived-0.058
Token the
Token ID262
Feature activation-4.58e+00

Pos logit contributions
 hunt+0.178
nob+0.171
house+0.168
c+0.163
uch+0.158
Neg logit contributions
um-0.124
der-0.123
 widely-0.120
ived-0.120
lee-0.119
Token park
Token ID3952
Feature activation-2.59e+00

Pos logit contributions
e+0.066
nob+0.063
 hunt+0.060
|+0.058
c+0.055
Neg logit contributions
um-0.123
der-0.119
rier-0.113
ummy-0.108
 widely-0.107
Token.
Token ID13
Feature activation-8.70e-01

Pos logit contributions
 hunt+0.066
 library+0.060
 park+0.060
 exhibit+0.059
 hall+0.059
Neg logit contributions
der-0.057
lessly-0.055
ously-0.054
rier-0.052
ived-0.051
Token They
Token ID1119
Feature activation-2.24e+00

Pos logit contributions
 hunt+0.024
 exhibit+0.022
 park+0.022
 library+0.021
 hall+0.021
Neg logit contributions
lessly-0.019
ously-0.018
der-0.018
ived-0.018
rier-0.017
Token liked
Token ID8288
Feature activation-2.38e+00

Pos logit contributions
 hunt+0.053
 exhibit+0.050
 library+0.047
 hall+0.047
 park+0.047
Neg logit contributions
lessly-0.056
ously-0.056
ived-0.053
der-0.051
rier-0.048
Token Tim
Token ID5045
Feature activation-1.73e+00

Pos logit contributions
 hunt+0.027
 park+0.025
 library+0.024
 exhibit+0.024
 hall+0.024
Neg logit contributions
lessly-0.030
ously-0.030
rier-0.029
ived-0.028
der-0.028
Tokenmy
Token ID1820
Feature activation-9.42e-01

Pos logit contributions
 hunt+0.038
 exhibit+0.033
 library+0.033
 park+0.032
 hall+0.032
Neg logit contributions
lessly-0.049
ously-0.048
der-0.045
rier-0.044
ived-0.044
Token loved
Token ID6151
Feature activation-2.50e+00

Pos logit contributions
 hunt+0.024
 exhibit+0.022
 park+0.021
 library+0.021
 hall+0.021
Neg logit contributions
lessly-0.023
ously-0.023
der-0.021
ived-0.021
rier-0.020
Token to
Token ID284
Feature activation-5.93e+00

Pos logit contributions
 hunt+0.065
 exhibit+0.061
 hall+0.058
house+0.058
 library+0.057
Neg logit contributions
ously-0.051
lessly-0.050
der-0.050
ived-0.049
rier-0.046
Token play
Token ID711
Feature activation-3.39e+00

Pos logit contributions
 exhibit+0.072
 hunt+0.063
 library+0.060
rob+0.059
 zoo+0.059
Neg logit contributions
ously-0.190
ived-0.185
lessly-0.185
der-0.180
 widely-0.174
Token at
Token ID379
Feature activation-8.56e+00

Pos logit contributions
 library+0.067
 exhibit+0.066
 hunt+0.064
c+0.059
 hall+0.058
Neg logit contributions
der-0.081
lessly-0.080
rier-0.079
ived-0.078
ously-0.077
Token the
Token ID262
Feature activation-4.35e+00

Pos logit contributions
 hunt+0.172
 exhibit+0.158
house+0.155
illas+0.150
c+0.148
Neg logit contributions
lessly-0.165
ived-0.158
der-0.156
rier-0.154
ously-0.150
Token park
Token ID3952
Feature activation-2.89e+00

Pos logit contributions
 hunt+0.052
e+0.048
|+0.047
nob+0.047
house+0.046
Neg logit contributions
der-0.128
rier-0.126
um-0.125
ived-0.124
lessly-0.120
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
 hunt+0.068
 library+0.063
 park+0.062
 exhibit+0.061
 hall+0.060
Neg logit contributions
lessly-0.068
der-0.065
ously-0.064
ived-0.063
rier-0.061
Token One
Token ID1881
Feature activation-3.50e+00

Pos logit contributions
 hunt+0.036
 park+0.033
 exhibit+0.033
 library+0.033
 hall+0.032
Neg logit contributions
lessly-0.023
ously-0.023
rier-0.022
der-0.021
ived-0.021
Token day
Token ID1110
Feature activation+5.36e-01

Pos logit contributions
 hunt+0.061
 park+0.053
 exhibit+0.051
 library+0.051
 hall+0.050
Neg logit contributions
lessly-0.116
ously-0.114
rier-0.111
ived-0.111
der-0.110
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
 hunt+0.044
 exhibit+0.041
 library+0.040
 hall+0.040
 theater+0.040
Neg logit contributions
ously-0.037
lessly-0.035
der-0.033
ived-0.031
rier-0.030
Token She
Token ID1375
Feature activation-1.81e+00

Pos logit contributions
 hunt+0.036
 exhibit+0.032
 park+0.032
 library+0.031
 hall+0.031
Neg logit contributions
lessly-0.039
ously-0.038
ived-0.036
der-0.036
rier-0.035
Token loved
Token ID6151
Feature activation-2.52e+00

Pos logit contributions
 hunt+0.050
 exhibit+0.049
 library+0.045
 park+0.045
 zoo+0.043
Neg logit contributions
lessly-0.038
ously-0.034
der-0.032
ived-0.032
edly-0.030
Token to
Token ID284
Feature activation-6.21e+00

Pos logit contributions
 hunt+0.067
 exhibit+0.063
 hall+0.059
 park+0.059
 library+0.058
Neg logit contributions
ived-0.052
lessly-0.052
ously-0.052
der-0.051
rier-0.048
Token play
Token ID711
Feature activation-3.19e+00

Pos logit contributions
 exhibit+0.090
rob+0.073
 hunt+0.072
 library+0.070
 basement+0.068
Neg logit contributions
ived-0.182
ously-0.180
lessly-0.178
der-0.173
 widely-0.171
Token at
Token ID379
Feature activation-8.56e+00

Pos logit contributions
 exhibit+0.073
 library+0.071
 hunt+0.070
 museum+0.065
 hall+0.063
Neg logit contributions
der-0.069
rier-0.068
lessly-0.066
ived-0.066
ously-0.064
Token the
Token ID262
Feature activation-4.11e+00

Pos logit contributions
 hunt+0.178
 exhibit+0.166
house+0.159
illas+0.153
nob+0.152
Neg logit contributions
ived-0.154
lessly-0.153
der-0.149
rier-0.147
 grateful-0.144
Token park
Token ID3952
Feature activation-3.13e+00

Pos logit contributions
 hunt+0.048
nob+0.048
e+0.044
|+0.044
c+0.043
Neg logit contributions
um-0.121
rier-0.121
der-0.120
ived-0.119
lessly-0.110
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
 hunt+0.077
 library+0.071
 exhibit+0.070
 park+0.070
rob+0.069
Neg logit contributions
lessly-0.068
der-0.065
ived-0.064
ously-0.063
rier-0.060
Token Her
Token ID2332
Feature activation-8.06e-01

Pos logit contributions
 hunt+0.043
 exhibit+0.040
 hall+0.038
 theater+0.038
 park+0.038
Neg logit contributions
lessly-0.027
ously-0.027
der-0.025
ived-0.025
um-0.021
Token favorite
Token ID4004
Feature activation-3.08e+00

Pos logit contributions
 hunt+0.022
 exhibit+0.020
 library+0.019
 park+0.019
 hall+0.019
Neg logit contributions
lessly-0.018
ously-0.017
ived-0.017
rier-0.017
der-0.017
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 hunt+0.021
 exhibit+0.019
 library+0.019
 theater+0.018
 park+0.018
Neg logit contributions
ously-0.020
lessly-0.020
der-0.018
ived-0.017
rier-0.017
Token Lily
Token ID20037
Feature activation-8.92e-01

Pos logit contributions
 hunt+0.038
 park+0.035
 exhibit+0.034
 library+0.034
 hall+0.034
Neg logit contributions
lessly-0.031
ously-0.030
rier-0.029
ived-0.028
der-0.028
Token loved
Token ID6151
Feature activation-2.60e+00

Pos logit contributions
 hunt+0.025
 exhibit+0.023
 theater+0.022
 library+0.021
 park+0.021
Neg logit contributions
lessly-0.018
ously-0.018
der-0.016
ived-0.015
ly-0.014
Token to
Token ID284
Feature activation-6.05e+00

Pos logit contributions
 hunt+0.068
 exhibit+0.065
 hall+0.060
 park+0.059
 library+0.059
Neg logit contributions
ously-0.054
ived-0.054
der-0.053
lessly-0.053
rier-0.050
Token play
Token ID711
Feature activation-3.33e+00

Pos logit contributions
 exhibit+0.079
 hunt+0.069
rob+0.068
 library+0.064
 zoo+0.064
Neg logit contributions
ously-0.186
ived-0.184
lessly-0.181
der-0.177
 widely-0.171
Token at
Token ID379
Feature activation-8.54e+00

Pos logit contributions
 exhibit+0.069
 library+0.067
 hunt+0.066
 museum+0.061
 hall+0.059
Neg logit contributions
der-0.078
rier-0.077
ived-0.075
lessly-0.075
ously-0.075
Token the
Token ID262
Feature activation-4.24e+00

Pos logit contributions
 hunt+0.175
 exhibit+0.164
house+0.153
illas+0.151
c+0.146
Neg logit contributions
lessly-0.158
der-0.155
ived-0.153
rier-0.151
 knowing-0.148
Token park
Token ID3952
Feature activation-3.07e+00

Pos logit contributions
 hunt+0.047
e+0.046
nob+0.045
|+0.043
illas+0.042
Neg logit contributions
der-0.125
um-0.124
rier-0.122
ived-0.120
lessly-0.112
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
 hunt+0.074
 library+0.067
 exhibit+0.067
 park+0.067
 theater+0.065
Neg logit contributions
lessly-0.069
der-0.066
ously-0.066
ived-0.065
rier-0.062
Token Her
Token ID2332
Feature activation-7.62e-01

Pos logit contributions
 hunt+0.041
 exhibit+0.037
 park+0.036
 hall+0.036
 library+0.035
Neg logit contributions
lessly-0.026
ously-0.025
der-0.024
ived-0.023
rier-0.023
Token favorite
Token ID4004
Feature activation-3.02e+00

Pos logit contributions
 hunt+0.021
 exhibit+0.020
 library+0.019
 park+0.019
 hall+0.019
Neg logit contributions
lessly-0.016
ously-0.016
rier-0.016
ived-0.016
der-0.015
Token Lily
Token ID20037
Feature activation-4.90e-01

Pos logit contributions
 hunt+0.054
 exhibit+0.048
 park+0.048
 library+0.047
 theater+0.046
Neg logit contributions
lessly-0.058
ously-0.055
ived-0.055
rier-0.053
der-0.053
Token who
Token ID508
Feature activation-2.98e+00

Pos logit contributions
 hunt+0.011
 exhibit+0.010
 library+0.010
 park+0.010
 theater+0.010
Neg logit contributions
ously-0.013
lessly-0.013
der-0.012
rier-0.012
ived-0.012
Token loved
Token ID6151
Feature activation-2.70e+00

Pos logit contributions
 hunt+0.083
 exhibit+0.079
 park+0.075
 theater+0.074
 hall+0.073
Neg logit contributions
lessly-0.064
ously-0.060
der-0.053
ived-0.053
rier-0.051
Token to
Token ID284
Feature activation-5.88e+00

Pos logit contributions
 hunt+0.071
 exhibit+0.066
 park+0.062
 hall+0.062
 library+0.062
Neg logit contributions
lessly-0.058
ously-0.058
der-0.058
ived-0.057
rier-0.055
Token play
Token ID711
Feature activation-3.58e+00

Pos logit contributions
 exhibit+0.075
rob+0.068
 hunt+0.067
 library+0.062
 zoo+0.062
Neg logit contributions
ously-0.182
ived-0.179
lessly-0.177
der-0.176
 expression-0.169
Token at
Token ID379
Feature activation-8.54e+00

Pos logit contributions
 exhibit+0.070
 library+0.069
 hunt+0.066
 museum+0.061
 hall+0.059
Neg logit contributions
der-0.090
rier-0.089
lessly-0.086
ously-0.086
ived-0.085
Token the
Token ID262
Feature activation-4.14e+00

Pos logit contributions
 hunt+0.172
 exhibit+0.160
house+0.151
illas+0.149
c+0.146
Neg logit contributions
der-0.156
lessly-0.156
rier-0.151
ived-0.150
 knowing-0.148
Token beach
Token ID10481
Feature activation-3.37e+00

Pos logit contributions
 hunt+0.054
e+0.053
nob+0.052
|+0.051
c+0.050
Neg logit contributions
der-0.116
rier-0.114
um-0.113
ived-0.109
lessly-0.104
Token.
Token ID13
Feature activation-6.50e-01

Pos logit contributions
 hunt+0.077
 library+0.075
 park+0.073
 exhibit+0.072
 hall+0.069
Neg logit contributions
lessly-0.077
rier-0.076
der-0.074
ived-0.073
ously-0.072
Token One
Token ID1881
Feature activation-3.51e+00

Pos logit contributions
 hunt+0.018
 park+0.017
 exhibit+0.017
 library+0.017
 hall+0.016
Neg logit contributions
lessly-0.014
ously-0.014
rier-0.014
der-0.013
ived-0.013
Token day
Token ID1110
Feature activation+3.25e-01

Pos logit contributions
 hunt+0.051
 park+0.047
 library+0.042
 exhibit+0.042
 hall+0.041
Neg logit contributions
lessly-0.128
rier-0.128
ously-0.127
der-0.124
ived-0.121
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 hunt+0.012
 park+0.011
 library+0.011
 exhibit+0.010
 hall+0.010
Neg logit contributions
lessly-0.017
ously-0.016
rier-0.016
der-0.016
ived-0.016
Token Tim
Token ID5045
Feature activation-1.73e+00

Pos logit contributions
 hunt+0.028
 park+0.025
 library+0.025
 exhibit+0.025
 hall+0.024
Neg logit contributions
lessly-0.033
ously-0.032
rier-0.031
ived-0.031
der-0.031
Tokenmy
Token ID1820
Feature activation-9.49e-01

Pos logit contributions
 hunt+0.039
 exhibit+0.034
 library+0.034
 theater+0.033
 hall+0.032
Neg logit contributions
lessly-0.047
ously-0.047
der-0.043
ived-0.042
rier-0.041
Token loved
Token ID6151
Feature activation-2.41e+00

Pos logit contributions
 hunt+0.024
 exhibit+0.021
 park+0.021
 library+0.021
 hall+0.021
Neg logit contributions
lessly-0.024
ously-0.023
der-0.022
ived-0.022
rier-0.021
Token playing
Token ID2712
Feature activation-3.26e+00

Pos logit contributions
 hunt+0.062
 exhibit+0.059
 hall+0.056
 library+0.055
house+0.055
Neg logit contributions
ously-0.050
ived-0.049
lessly-0.049
der-0.049
rier-0.046
Token at
Token ID379
Feature activation-8.54e+00

Pos logit contributions
 library+0.073
 exhibit+0.072
 hunt+0.068
 hall+0.064
c+0.064
Neg logit contributions
ived-0.067
rier-0.066
lessly-0.066
der-0.066
ously-0.065
Token the
Token ID262
Feature activation-3.86e+00

Pos logit contributions
 hunt+0.179
 exhibit+0.162
house+0.155
 library+0.151
nob+0.149
Neg logit contributions
ived-0.163
lessly-0.160
rier-0.156
der-0.154
ously-0.149
Token park
Token ID3952
Feature activation-2.80e+00

Pos logit contributions
 hunt+0.049
e+0.046
nob+0.046
|+0.045
house+0.043
Neg logit contributions
der-0.110
rier-0.109
ived-0.108
um-0.108
lessly-0.103
Token with
Token ID351
Feature activation-3.30e+00

Pos logit contributions
 hunt+0.064
 library+0.059
 park+0.058
 exhibit+0.057
 hall+0.056
Neg logit contributions
lessly-0.070
ously-0.067
der-0.066
ived-0.065
rier-0.063
Token his
Token ID465
Feature activation-1.67e+00

Pos logit contributions
 hunt+0.087
 exhibit+0.079
 library+0.079
 park+0.078
 hall+0.077
Neg logit contributions
lessly-0.079
ously-0.077
rier-0.074
ived-0.074
der-0.073
Token friends
Token ID2460
Feature activation-1.36e+00

Pos logit contributions
 hunt+0.047
 exhibit+0.043
 library+0.042
 park+0.042
 hall+0.041
Neg logit contributions
lessly-0.037
ously-0.036
rier-0.035
ived-0.035
der-0.034
Token friends
Token ID2460
Feature activation-7.93e-02

Pos logit contributions
 hunt+0.030
 park+0.028
 library+0.027
 exhibit+0.027
 hall+0.027
Neg logit contributions
lessly-0.030
ously-0.029
rier-0.029
der-0.028
ived-0.028
Token who
Token ID508
Feature activation-3.63e+00

Pos logit contributions
 hunt+0.002
 park+0.001
 library+0.001
 exhibit+0.001
 hall+0.001
Neg logit contributions
lessly-0.002
ously-0.002
rier-0.002
der-0.002
ived-0.002
Token liked
Token ID8288
Feature activation-3.59e+00

Pos logit contributions
 hunt+0.102
 exhibit+0.098
 hall+0.092
 basement+0.090
 park+0.089
Neg logit contributions
lessly-0.077
ously-0.073
der-0.069
ived-0.061
rier-0.058
Token to
Token ID284
Feature activation-6.41e+00

Pos logit contributions
 hunt+0.102
 exhibit+0.095
 hall+0.092
 library+0.091
nob+0.090
Neg logit contributions
ously-0.068
ived-0.067
der-0.067
lessly-0.065
rier-0.061
Token play
Token ID711
Feature activation-3.26e+00

Pos logit contributions
rob+0.073
 exhibit+0.072
 library+0.070
 hunt+0.070
 basement+0.065
Neg logit contributions
ously-0.208
ived-0.199
lessly-0.191
der-0.187
um-0.186
Token at
Token ID379
Feature activation-8.54e+00

Pos logit contributions
 library+0.087
 exhibit+0.086
 hunt+0.085
 hall+0.081
asis+0.079
Neg logit contributions
der-0.061
lessly-0.057
ously-0.055
rier-0.055
ived-0.052
Token the
Token ID262
Feature activation-4.41e+00

Pos logit contributions
 hunt+0.175
nob+0.166
house+0.166
asis+0.161
 exhibit+0.160
Neg logit contributions
ived-0.126
lee-0.124
lessly-0.124
 widely-0.121
 knowing-0.121
Token beach
Token ID10481
Feature activation-3.17e+00

Pos logit contributions
e+0.064
nob+0.062
|+0.056
 hunt+0.055
c+0.052
Neg logit contributions
um-0.116
der-0.116
rier-0.109
 widely-0.104
 int-0.104
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 hunt+0.077
 library+0.073
 park+0.072
 basement+0.071
 hall+0.070
Neg logit contributions
der-0.068
rier-0.067
lessly-0.066
ously-0.063
ived-0.061
Token They
Token ID1119
Feature activation-2.22e+00

Pos logit contributions
 hunt+0.027
 park+0.025
 library+0.024
 exhibit+0.024
 hall+0.024
Neg logit contributions
lessly-0.027
ously-0.026
rier-0.026
ived-0.025
der-0.025
Token liked
Token ID8288
Feature activation-2.44e+00

Pos logit contributions
 hunt+0.050
 exhibit+0.046
 library+0.045
 park+0.045
 hall+0.044
Neg logit contributions
lessly-0.060
ously-0.059
ived-0.056
der-0.055
rier-0.053
Token Sally
Token ID25737
Feature activation-1.58e+00

Pos logit contributions
 hunt+0.060
 exhibit+0.054
 park+0.052
 theater+0.051
 library+0.051
Neg logit contributions
lessly-0.055
ously-0.051
ived-0.051
der-0.050
rier-0.049
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
 hunt+0.041
 exhibit+0.038
 theater+0.037
 library+0.037
 park+0.036
Neg logit contributions
lessly-0.038
ously-0.038
der-0.034
ived-0.032
rier-0.032
Token She
Token ID1375
Feature activation-1.62e+00

Pos logit contributions
 hunt+0.027
 park+0.027
 library+0.025
 hall+0.025
 exhibit+0.024
Neg logit contributions
rier-0.029
lessly-0.029
ously-0.028
ived-0.027
der-0.027
Token loved
Token ID6151
Feature activation-2.32e+00

Pos logit contributions
 hunt+0.046
 exhibit+0.044
 library+0.040
 park+0.040
 hall+0.039
Neg logit contributions
lessly-0.034
ously-0.032
der-0.031
ived-0.029
edly-0.027
Token playing
Token ID2712
Feature activation-3.32e+00

Pos logit contributions
 hunt+0.063
 exhibit+0.060
house+0.055
 hall+0.055
 theater+0.055
Neg logit contributions
ously-0.046
lessly-0.046
der-0.046
ived-0.045
rier-0.041
Token at
Token ID379
Feature activation-8.54e+00

Pos logit contributions
 exhibit+0.079
 hunt+0.073
 library+0.073
 museum+0.066
 zoo+0.066
Neg logit contributions
lessly-0.069
der-0.067
ously-0.066
rier-0.065
ived-0.064
Token the
Token ID262
Feature activation-3.92e+00

Pos logit contributions
 hunt+0.191
 exhibit+0.173
house+0.158
 library+0.154
asis+0.154
Neg logit contributions
lessly-0.158
ived-0.155
der-0.150
ously-0.145
rier-0.144
Token park
Token ID3952
Feature activation-2.83e+00

Pos logit contributions
 hunt+0.046
nob+0.044
|+0.044
e+0.043
 exhibit+0.043
Neg logit contributions
der-0.115
rier-0.114
um-0.111
ived-0.107
lessly-0.105
Token near
Token ID1474
Feature activation-4.85e+00

Pos logit contributions
 hunt+0.070
 exhibit+0.064
 library+0.062
 park+0.061
rob+0.061
Neg logit contributions
lessly-0.064
der-0.060
ously-0.060
ived-0.059
rier-0.055
Token her
Token ID607
Feature activation-2.01e+00

Pos logit contributions
 hunt+0.137
 exhibit+0.129
 theater+0.119
 hall+0.118
 library+0.118
Neg logit contributions
ously-0.093
lessly-0.088
ived-0.087
der-0.083
 understanding-0.082
Token house
Token ID2156
Feature activation-3.47e+00

Pos logit contributions
 hunt+0.036
 exhibit+0.030
rob+0.029
nob+0.028
 library+0.028
Neg logit contributions
ously-0.060
der-0.060
ived-0.059
lessly-0.059
rier-0.058
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
 hunt+0.016
 exhibit+0.015
 park+0.015
 library+0.015
 hall+0.014
Neg logit contributions
lessly-0.020
ously-0.020
der-0.018
rier-0.018
ived-0.018
Token She
Token ID1375
Feature activation-1.37e+00

Pos logit contributions
 hunt+0.030
 park+0.028
 library+0.028
 hall+0.028
 exhibit+0.027
Neg logit contributions
lessly-0.025
ously-0.025
rier-0.024
ived-0.024
der-0.023
Token loved
Token ID6151
Feature activation-2.42e+00

Pos logit contributions
 hunt+0.034
 park+0.031
 exhibit+0.031
 library+0.030
 hall+0.029
Neg logit contributions
lessly-0.036
ously-0.035
der-0.033
rier-0.033
ived-0.033
Token to
Token ID284
Feature activation-6.16e+00

Pos logit contributions
 hunt+0.063
 exhibit+0.059
 park+0.055
 hall+0.055
 library+0.055
Neg logit contributions
lessly-0.053
ously-0.052
ived-0.052
der-0.051
rier-0.049
Token play
Token ID711
Feature activation-3.12e+00

Pos logit contributions
 exhibit+0.083
 hunt+0.071
rob+0.066
 library+0.066
 zoo+0.065
Neg logit contributions
ously-0.189
ived-0.187
lessly-0.186
der-0.181
 widely-0.175
Token at
Token ID379
Feature activation-8.53e+00

Pos logit contributions
 exhibit+0.062
 library+0.060
 hunt+0.059
 museum+0.054
 hall+0.053
Neg logit contributions
der-0.076
lessly-0.076
rier-0.075
ously-0.075
ived-0.074
Token the
Token ID262
Feature activation-4.36e+00

Pos logit contributions
 hunt+0.171
 exhibit+0.155
house+0.146
illas+0.140
c+0.135
Neg logit contributions
lessly-0.176
der-0.176
rier-0.174
ived-0.170
ously-0.164
Token sea
Token ID5417
Feature activation-2.81e+00

Pos logit contributions
 hunt+0.046
nob+0.043
|+0.043
e+0.041
house+0.040
Neg logit contributions
der-0.134
rier-0.133
um-0.131
ived-0.129
lessly-0.121
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
 hunt+0.064
 library+0.062
 park+0.060
 exhibit+0.059
nob+0.058
Neg logit contributions
lessly-0.065
rier-0.063
ived-0.062
der-0.062
ously-0.061
Token collect
Token ID2824
Feature activation-2.71e+00

Pos logit contributions
 exhibit+0.023
 hunt+0.023
 library+0.022
 park+0.020
 theater+0.020
Neg logit contributions
lessly-0.054
ously-0.054
der-0.052
ived-0.051
rier-0.050
Token pretty
Token ID2495
Feature activation-1.07e+00

Pos logit contributions
 hunt+0.059
 park+0.054
 library+0.053
 exhibit+0.053
 hall+0.052
Neg logit contributions
lessly-0.077
ously-0.075
rier-0.075
ived-0.073
der-0.073
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
 hunt+0.023
 exhibit+0.021
 library+0.021
 theater+0.021
 park+0.021
Neg logit contributions
ously-0.021
lessly-0.020
der-0.019
ived-0.017
rier-0.017
Token She
Token ID1375
Feature activation-1.36e+00

Pos logit contributions
 hunt+0.035
 park+0.032
 exhibit+0.032
 library+0.032
 hall+0.031
Neg logit contributions
lessly-0.026
ously-0.025
rier-0.024
der-0.023
ived-0.023
Token loved
Token ID6151
Feature activation-2.64e+00

Pos logit contributions
 hunt+0.036
 exhibit+0.034
 park+0.033
 library+0.032
 hall+0.032
Neg logit contributions
lessly-0.031
ously-0.030
der-0.028
ived-0.028
rier-0.027
Token to
Token ID284
Feature activation-6.05e+00

Pos logit contributions
 hunt+0.070
 exhibit+0.066
 park+0.062
 hall+0.062
 library+0.061
Neg logit contributions
lessly-0.056
ously-0.056
ived-0.055
der-0.054
rier-0.051
Token play
Token ID711
Feature activation-3.14e+00

Pos logit contributions
 exhibit+0.083
 hunt+0.070
 library+0.064
 hall+0.062
 park+0.061
Neg logit contributions
lessly-0.188
ived-0.187
ously-0.187
der-0.181
 widely-0.175
Token at
Token ID379
Feature activation-8.53e+00

Pos logit contributions
 exhibit+0.069
 hunt+0.066
 library+0.066
 museum+0.060
 hall+0.060
Neg logit contributions
der-0.071
rier-0.071
lessly-0.069
ived-0.069
ously-0.068
Token the
Token ID262
Feature activation-4.23e+00

Pos logit contributions
 hunt+0.174
 exhibit+0.158
house+0.152
c+0.143
nob+0.141
Neg logit contributions
lessly-0.175
rier-0.171
der-0.170
ived-0.169
ously-0.160
Token park
Token ID3952
Feature activation-2.91e+00

Pos logit contributions
 hunt+0.048
nob+0.044
house+0.041
e+0.041
 exhibit+0.041
Neg logit contributions
rier-0.129
um-0.129
der-0.129
ived-0.127
lessly-0.120
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
 hunt+0.071
 exhibit+0.065
 library+0.064
 park+0.064
 theater+0.062
Neg logit contributions
lessly-0.066
der-0.063
ously-0.062
ived-0.062
rier-0.059
Token Her
Token ID2332
Feature activation-6.48e-01

Pos logit contributions
 hunt+0.039
 exhibit+0.036
 park+0.034
 hall+0.034
 theater+0.034
Neg logit contributions
lessly-0.028
ously-0.027
der-0.026
ived-0.026
rier-0.025
Token favorite
Token ID4004
Feature activation-3.05e+00

Pos logit contributions
 hunt+0.018
 exhibit+0.017
 library+0.016
 hall+0.016
 park+0.016
Neg logit contributions
lessly-0.013
ously-0.013
ived-0.013
rier-0.013
der-0.013
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
lessly+0.008
ously+0.008
rier+0.008
der+0.008
ived+0.007
Neg logit contributions
 hunt-0.006
 park-0.005
 library-0.005
 exhibit-0.005
 hall-0.005
Token They
Token ID1119
Feature activation-2.69e+00

Pos logit contributions
 hunt+0.040
 exhibit+0.036
 park+0.036
 library+0.035
 hall+0.035
Neg logit contributions
ously-0.025
lessly-0.025
ived-0.024
der-0.023
rier-0.021
Token like
Token ID588
Feature activation-4.92e+00

Pos logit contributions
 hunt+0.064
 exhibit+0.063
 hall+0.058
 library+0.057
 theater+0.055
Neg logit contributions
lessly-0.068
ously-0.064
ived-0.062
der-0.059
rier-0.058
Token to
Token ID284
Feature activation-6.44e+00

Pos logit contributions
 hunt+0.137
 exhibit+0.135
 hall+0.129
house+0.125
 library+0.125
Neg logit contributions
ived-0.093
ously-0.092
lessly-0.090
der-0.086
rier-0.078
Token play
Token ID711
Feature activation-2.89e+00

Pos logit contributions
 exhibit+0.082
rob+0.078
 library+0.076
 hunt+0.074
 basement+0.072
Neg logit contributions
ived-0.192
ously-0.189
um-0.179
 widely-0.171
lessly-0.170
Token at
Token ID379
Feature activation-8.51e+00

Pos logit contributions
 library+0.075
 exhibit+0.075
 hunt+0.072
 hall+0.071
 museum+0.068
Neg logit contributions
der-0.056
ously-0.050
ived-0.050
lessly-0.048
rier-0.048
Token the
Token ID262
Feature activation-4.28e+00

Pos logit contributions
 hunt+0.189
nob+0.183
house+0.180
 exhibit+0.179
asis+0.170
Neg logit contributions
ived-0.122
 grateful-0.114
lee-0.112
lessly-0.111
 pleased-0.109
Token park
Token ID3952
Feature activation-2.73e+00

Pos logit contributions
e+0.062
nob+0.058
|+0.054
 hunt+0.049
house+0.047
Neg logit contributions
um-0.114
der-0.114
rier-0.108
 louder-0.102
lessly-0.101
Token.
Token ID13
Feature activation-1.56e+00

Pos logit contributions
 hunt+0.074
 library+0.068
 exhibit+0.068
 hall+0.067
 test+0.067
Neg logit contributions
der-0.053
lessly-0.049
ously-0.047
ived-0.047
edly-0.046
Token They
Token ID1119
Feature activation-2.30e+00

Pos logit contributions
 hunt+0.046
 exhibit+0.040
 hall+0.040
 library+0.040
 theater+0.040
Neg logit contributions
lessly-0.029
ived-0.028
ously-0.028
der-0.027
um-0.023
Token like
Token ID588
Feature activation-3.96e+00

Pos logit contributions
 exhibit+0.046
 hunt+0.042
 theater+0.040
nob+0.040
house+0.039
Neg logit contributions
lessly-0.062
ived-0.058
ously-0.057
 pleased-0.053
rier-0.052
Token away
Token ID1497
Feature activation+1.83e+00

Pos logit contributions
lessly+0.017
ously+0.017
rier+0.016
der+0.016
ived+0.016
Neg logit contributions
 hunt-0.010
 park-0.009
 exhibit-0.008
 hall-0.008
 library-0.008
Token,
Token ID11
Feature activation+2.65e+00

Pos logit contributions
lessly+0.053
ously+0.052
ived+0.051
rier+0.051
der+0.049
Neg logit contributions
 hunt-0.040
 park-0.035
 exhibit-0.034
 hall-0.034
 library-0.034
Token but
Token ID475
Feature activation+1.30e+00

Pos logit contributions
rier+0.066
lessly+0.065
ously+0.064
ived+0.064
der+0.061
Neg logit contributions
 hunt-0.068
 park-0.063
 hall-0.060
 library-0.059
 slide-0.059
Token Spot
Token ID15899
Feature activation+9.67e-01

Pos logit contributions
lessly+0.041
ously+0.040
rier+0.039
ived+0.039
der+0.038
Neg logit contributions
 hunt-0.025
 park-0.023
 library-0.022
 exhibit-0.022
 hall-0.022
Token was
Token ID373
Feature activation+3.71e+00

Pos logit contributions
lessly+0.028
ously+0.027
rier+0.027
ived+0.026
der+0.026
Neg logit contributions
 hunt-0.021
 park-0.019
 library-0.019
 exhibit-0.019
 hall-0.018
Token fast
Token ID3049
Feature activation+4.52e+00

Pos logit contributions
lessly+0.083
ously+0.082
rier+0.080
der+0.077
ived+0.077
Neg logit contributions
 hunt-0.105
 park-0.098
 library-0.094
 exhibit-0.094
 hall-0.093
Token.
Token ID13
Feature activation+4.88e-01

Pos logit contributions
lessly+0.135
rier+0.132
ously+0.132
ived+0.129
der+0.127
Neg logit contributions
 hunt-0.092
 park-0.083
 exhibit-0.078
 library-0.078
 hall-0.077
Token He
Token ID679
Feature activation-1.92e-01

Pos logit contributions
ously+0.012
lessly+0.012
rier+0.011
ived+0.011
der+0.011
Neg logit contributions
 hunt-0.013
 park-0.012
 library-0.012
 exhibit-0.012
 hall-0.011
Token caught
Token ID4978
Feature activation+1.58e+00

Pos logit contributions
 hunt+0.005
 park+0.004
 library+0.004
 exhibit+0.004
 hall+0.004
Neg logit contributions
lessly-0.005
ously-0.005
rier-0.005
ived-0.005
der-0.005
Token the
Token ID262
Feature activation+3.97e-01

Pos logit contributions
lessly+0.049
ously+0.047
rier+0.047
der+0.046
ived+0.046
Neg logit contributions
 hunt-0.032
 library-0.028
 park-0.028
 exhibit-0.028
 hall-0.028
Token cat
Token ID3797
Feature activation-1.36e-01

Pos logit contributions
lessly+0.012
ously+0.011
rier+0.011
ived+0.011
der+0.011
Neg logit contributions
 hunt-0.008
 exhibit-0.007
 theater-0.007
 library-0.007
 park-0.007
Token
Token ID198
Feature activation+4.88e-01

Pos logit contributions
 hunt+0.006
 park+0.006
 library+0.005
 exhibit+0.005
 hall+0.005
Neg logit contributions
rier-0.010
lessly-0.010
ously-0.010
der-0.010
ived-0.010
TokenS
Token ID50
Feature activation+3.04e+00

Pos logit contributions
rier+0.012
ously+0.012
lessly+0.011
der+0.011
ived+0.011
Neg logit contributions
 park-0.013
 hunt-0.012
 library-0.012
 store-0.012
 hall-0.011
Tokenoph
Token ID2522
Feature activation+3.46e+00

Pos logit contributions
lessly+0.059
ously+0.058
rier+0.056
ived+0.054
der+0.054
Neg logit contributions
 hunt-0.095
 park-0.089
 library-0.087
 exhibit-0.087
 hall-0.086
Tokenie
Token ID494
Feature activation+6.30e-01

Pos logit contributions
lessly+0.102
rier+0.100
ously+0.096
ived+0.096
der+0.093
Neg logit contributions
 hunt-0.068
 park-0.067
 exhibit-0.064
 hall-0.061
 library-0.061
Token got
Token ID1392
Feature activation+3.24e+00

Pos logit contributions
ously+0.018
lessly+0.018
ived+0.017
der+0.017
rier+0.017
Neg logit contributions
 hunt-0.014
 park-0.012
 library-0.012
 exhibit-0.012
 hall-0.012
Token very
Token ID845
Feature activation+4.71e+00

Pos logit contributions
ously+0.079
lessly+0.079
rier+0.076
der+0.076
ived+0.072
Neg logit contributions
 hunt-0.082
 park-0.079
 hall-0.075
 library-0.075
 exhibit-0.074
Token impatient
Token ID33440
Feature activation+3.47e+00

Pos logit contributions
ously+0.131
lessly+0.123
der+0.121
rier+0.115
iously+0.114
Neg logit contributions
 play-0.100
 store-0.099
 park-0.096
 test-0.096
 slide-0.094
Token,
Token ID11
Feature activation+2.13e+00

Pos logit contributions
lessly+0.092
rier+0.091
ously+0.090
ived+0.088
der+0.084
Neg logit contributions
 hunt-0.081
 park-0.076
 library-0.073
 hall-0.072
 exhibit-0.072
Token so
Token ID523
Feature activation+3.87e+00

Pos logit contributions
rier+0.052
ously+0.051
ived+0.050
lessly+0.050
der+0.048
Neg logit contributions
 hunt-0.055
 park-0.053
 library-0.050
 hall-0.049
 store-0.049
Token she
Token ID673
Feature activation+4.13e-01

Pos logit contributions
lessly+0.121
rier+0.120
ously+0.118
ived+0.115
der+0.113
Neg logit contributions
 park-0.071
 hunt-0.068
 library-0.065
 store-0.063
 hall-0.063
Token called
Token ID1444
Feature activation-2.14e-02

Pos logit contributions
lessly+0.012
rier+0.012
ously+0.012
ived+0.011
der+0.011
Neg logit contributions
 hunt-0.009
 park-0.008
 library-0.008
 hall-0.008
 exhibit-0.008
Token said
Token ID531
Feature activation+5.48e+00

Pos logit contributions
lessly+0.075
ously+0.073
rier+0.073
ived+0.070
der+0.070
Neg logit contributions
 hunt-0.057
 park-0.052
 library-0.050
 exhibit-0.049
 hall-0.049
Token.
Token ID13
Feature activation+7.17e-01

Pos logit contributions
lessly+0.195
ously+0.188
der+0.188
ived+0.188
rier+0.186
Neg logit contributions
 hunt-0.060
 play-0.057
 hide-0.051
 park-0.048
 store-0.047
Token
Token ID198
Feature activation+1.55e+00

Pos logit contributions
rier+0.023
ously+0.023
lessly+0.023
ived+0.022
iously+0.022
Neg logit contributions
 hunt-0.012
 park-0.012
 library-0.011
 play-0.011
 hall-0.011
Token
Token ID198
Feature activation+1.72e+00

Pos logit contributions
rier+0.051
lessly+0.050
ously+0.050
der+0.049
ived+0.049
Neg logit contributions
 hunt-0.027
 park-0.026
 library-0.023
 exhibit-0.023
 hall-0.023
TokenThey
Token ID2990
Feature activation+2.97e-01

Pos logit contributions
rier+0.049
ously+0.046
lessly+0.046
der+0.046
ived+0.046
Neg logit contributions
 hunt-0.039
 park-0.038
 library-0.035
 hall-0.035
 play-0.034
Token waved
Token ID26834
Feature activation+5.19e+00

Pos logit contributions
lessly+0.007
ously+0.007
ived+0.007
der+0.007
rier+0.007
Neg logit contributions
 hunt-0.008
 exhibit-0.007
 library-0.007
 park-0.007
 hall-0.007
Token at
Token ID379
Feature activation-9.37e-01

Pos logit contributions
lessly+0.145
rier+0.142
ously+0.136
der+0.132
iously+0.132
Neg logit contributions
 play-0.107
 park-0.103
 hunt-0.103
 hide-0.095
 visit-0.091
Token the
Token ID262
Feature activation-2.43e-03

Pos logit contributions
 hunt+0.025
 exhibit+0.023
 library+0.022
 hall+0.022
 park+0.021
Neg logit contributions
lessly-0.022
ously-0.021
der-0.021
rier-0.020
ived-0.020
Token dog
Token ID3290
Feature activation+3.99e-01

Pos logit contributions
 hunt+0.000
 exhibit+0.000
 hall+0.000
 library+0.000
house+0.000
Neg logit contributions
lessly-0.000
rier-0.000
der-0.000
ously-0.000
ived-0.000
Token and
Token ID290
Feature activation+2.25e+00

Pos logit contributions
ously+0.008
lessly+0.008
der+0.008
ived+0.008
rier+0.007
Neg logit contributions
 hunt-0.012
 park-0.011
 library-0.011
 hall-0.011
 museum-0.010
Token walked
Token ID6807
Feature activation+2.11e+00

Pos logit contributions
ously+0.054
lessly+0.053
der+0.051
ived+0.050
rier+0.047
Neg logit contributions
 hunt-0.057
 exhibit-0.055
 library-0.054
 park-0.053
 hall-0.053
Token kind
Token ID1611
Feature activation+3.51e+00

Pos logit contributions
lessly+0.089
rier+0.085
ously+0.081
ived+0.080
der+0.079
Neg logit contributions
 hunt-0.134
 park-0.124
 exhibit-0.123
 library-0.122
 hall-0.121
Token."
Token ID526
Feature activation+1.39e+00

Pos logit contributions
lessly+0.107
ously+0.105
rier+0.102
ived+0.101
der+0.098
Neg logit contributions
 hunt-0.069
 park-0.064
 library-0.061
 exhibit-0.060
 hall-0.059
Token
Token ID198
Feature activation+2.10e+00

Pos logit contributions
rier+0.049
lessly+0.049
ously+0.047
ived+0.046
der+0.045
Neg logit contributions
 park-0.023
 hunt-0.022
 library-0.021
 hall-0.020
 play-0.020
Token
Token ID198
Feature activation+1.90e+00

Pos logit contributions
lessly+0.067
rier+0.067
ously+0.066
der+0.064
ived+0.064
Neg logit contributions
 hunt-0.039
 park-0.037
 library-0.034
 hall-0.034
 exhibit-0.033
TokenMom
Token ID29252
Feature activation+1.68e-01

Pos logit contributions
lessly+0.046
rier+0.046
ously+0.045
der+0.043
ived+0.043
Neg logit contributions
 park-0.050
 hunt-0.050
 library-0.047
 hall-0.047
 slide-0.046
Token smiled
Token ID13541
Feature activation+6.27e+00

Pos logit contributions
lessly+0.004
rier+0.004
ously+0.004
der+0.004
ived+0.004
Neg logit contributions
 hunt-0.004
 park-0.004
 exhibit-0.004
 hall-0.004
 library-0.004
Token and
Token ID290
Feature activation+3.04e+00

Pos logit contributions
iously+0.222
ived+0.220
lessly+0.218
ously+0.215
rier+0.215
Neg logit contributions
 park-0.079
 play-0.071
 outside-0.060
 store-0.057
 read-0.055
Token said
Token ID531
Feature activation+4.83e+00

Pos logit contributions
lessly+0.082
ously+0.080
rier+0.078
der+0.077
ived+0.077
Neg logit contributions
 hunt-0.072
 park-0.066
 exhibit-0.064
 library-0.064
 hall-0.064
Token,
Token ID11
Feature activation+3.48e+00

Pos logit contributions
lessly+0.161
ously+0.150
rier+0.147
ived+0.142
uster+0.141
Neg logit contributions
 hunt-0.075
 park-0.071
 hide-0.070
 slide-0.069
 library-0.067
Token "
Token ID366
Feature activation-1.11e+00

Pos logit contributions
lessly+0.084
ously+0.084
ived+0.082
rier+0.081
iously+0.074
Neg logit contributions
 hunt-0.087
 park-0.084
 library-0.080
 hall-0.080
 store-0.079
TokenYou
Token ID1639
Feature activation-1.06e+00

Pos logit contributions
 hunt+0.023
 park+0.021
 library+0.020
 hall+0.020
 exhibit+0.019
Neg logit contributions
lessly-0.033
ously-0.033
rier-0.033
ived-0.032
der-0.031
Token
Token ID198
Feature activation+5.76e-01

Pos logit contributions
rier+0.010
ously+0.009
der+0.009
lessly+0.009
ived+0.009
Neg logit contributions
 park-0.005
 hunt-0.005
 library-0.004
 hall-0.004
 exhibit-0.004
TokenAb
Token ID4826
Feature activation+2.81e+00

Pos logit contributions
rier+0.014
ously+0.013
der+0.013
lessly+0.013
ived+0.013
Neg logit contributions
 park-0.015
 hunt-0.014
 hall-0.014
 library-0.013
 store-0.013
Tokenig
Token ID328
Feature activation+1.11e+00

Pos logit contributions
lessly+0.063
ously+0.062
rier+0.060
ived+0.059
der+0.058
Neg logit contributions
 hunt-0.079
 park-0.073
 library-0.072
 exhibit-0.071
 hall-0.071
Tokenail
Token ID603
Feature activation+5.56e-01

Pos logit contributions
lessly+0.026
ously+0.025
rier+0.025
der+0.024
ived+0.023
Neg logit contributions
 hunt-0.030
 park-0.028
 exhibit-0.027
 hall-0.027
 library-0.027
Token felt
Token ID2936
Feature activation+4.43e+00

Pos logit contributions
lessly+0.015
ously+0.015
ived+0.014
der+0.014
rier+0.014
Neg logit contributions
 hunt-0.013
 library-0.011
 park-0.011
 exhibit-0.011
 hall-0.011
Token deep
Token ID2769
Feature activation+4.79e+00

Pos logit contributions
ously+0.104
lessly+0.102
rier+0.100
ived+0.099
der+0.097
Neg logit contributions
 hunt-0.118
 park-0.112
 library-0.106
 hall-0.105
 exhibit-0.104
Token regret
Token ID13721
Feature activation+4.39e+00

Pos logit contributions
lessly+0.126
ously+0.124
ived+0.121
rier+0.119
der+0.116
Neg logit contributions
 hunt-0.115
 park-0.106
 hall-0.101
 exhibit-0.100
 library-0.100
Token and
Token ID290
Feature activation+2.70e+00

Pos logit contributions
lessly+0.117
ously+0.115
rier+0.114
ived+0.113
der+0.108
Neg logit contributions
 hunt-0.099
 park-0.090
 hall-0.088
 library-0.086
 exhibit-0.086
Token decided
Token ID3066
Feature activation+1.97e+00

Pos logit contributions
lessly+0.066
ously+0.065
rier+0.063
der+0.061
ived+0.061
Neg logit contributions
 hunt-0.071
 park-0.065
 exhibit-0.064
 library-0.064
 hall-0.063
Token to
Token ID284
Feature activation-3.43e+00

Pos logit contributions
lessly+0.054
ously+0.053
rier+0.052
der+0.051
ived+0.051
Neg logit contributions
 hunt-0.046
 park-0.042
 exhibit-0.041
 library-0.041
 hall-0.040
Token take
Token ID1011
Feature activation-1.01e+00

Pos logit contributions
 library+0.073
rob+0.071
 hunt+0.069
 exhibit+0.069
 hall+0.067
Neg logit contributions
ously-0.091
lessly-0.086
rier-0.084
ived-0.081
der-0.081
Token the
Token ID262
Feature activation-3.48e+00

Pos logit contributions
 hunt+0.083
 exhibit+0.077
 hall+0.071
 library+0.071
 park+0.069
Neg logit contributions
lessly-0.105
ously-0.103
rier-0.096
ived-0.093
der-0.091
Token forest
Token ID8222
Feature activation-2.93e+00

Pos logit contributions
 hunt+0.044
 exhibit+0.040
e+0.035
nob+0.032
 bake+0.032
Neg logit contributions
lessly-0.108
rier-0.108
ously-0.105
der-0.103
ived-0.102
Token.
Token ID13
Feature activation-8.26e-01

Pos logit contributions
 hunt+0.072
 library+0.070
rob+0.069
 park+0.068
 exhibit+0.068
Neg logit contributions
ously-0.066
lessly-0.064
rier-0.060
der-0.060
ived-0.057
Token One
Token ID1881
Feature activation-3.31e+00

Pos logit contributions
 hunt+0.024
 park+0.022
 library+0.022
 exhibit+0.022
 hall+0.022
Neg logit contributions
lessly-0.016
ously-0.016
der-0.015
ived-0.015
rier-0.014
Token day
Token ID1110
Feature activation+3.85e-01

Pos logit contributions
 hunt+0.062
 park+0.053
 library+0.053
 exhibit+0.053
 hall+0.051
Neg logit contributions
lessly-0.104
ously-0.101
ived-0.098
rier-0.096
der-0.096
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
lessly+0.011
rier+0.011
ously+0.010
der+0.010
ived+0.010
Neg logit contributions
 hunt-0.009
 exhibit-0.008
 park-0.008
 hall-0.008
 theater-0.007
Token they
Token ID484
Feature activation-1.71e+00

Pos logit contributions
lessly+0.005
ously+0.005
ived+0.005
rier+0.005
der+0.005
Neg logit contributions
 hunt-0.005
 park-0.004
 library-0.004
 exhibit-0.004
 hall-0.004
Token went
Token ID1816
Feature activation-2.27e-01

Pos logit contributions
 hunt+0.035
 library+0.032
 exhibit+0.031
 park+0.031
 hall+0.030
Neg logit contributions
ously-0.049
lessly-0.049
ived-0.048
rier-0.046
der-0.045
Token on
Token ID319
Feature activation-2.51e+00

Pos logit contributions
 hunt+0.006
 library+0.005
 park+0.005
 exhibit+0.005
 hall+0.005
Neg logit contributions
lessly-0.006
ously-0.005
ived-0.005
rier-0.005
der-0.005
Token a
Token ID257
Feature activation-3.16e+00

Pos logit contributions
 hunt+0.058
 bake+0.052
 hall+0.052
 exhibit+0.052
 library+0.051
Neg logit contributions
ously-0.055
lessly-0.053
der-0.053
ived-0.049
iously-0.049
Token trail
Token ID8025
Feature activation-4.86e-01

Pos logit contributions
 hunt+0.044
 library+0.040
 exhibit+0.040
 park+0.039
 theater+0.039
Neg logit contributions
lessly-0.097
ously-0.096
ived-0.094
rier-0.091
der-0.091
Token nature
Token ID3450
Feature activation-2.17e+00

Pos logit contributions
 hall+0.079
rob+0.077
 library+0.076
 exhibit+0.075
 theater+0.075
Neg logit contributions
rier-0.086
der-0.085
lessly-0.081
iously-0.080
ived-0.079
Token."
Token ID526
Feature activation+7.81e-01

Pos logit contributions
 hunt+0.055
 library+0.055
 museum+0.052
 hall+0.052
 park+0.052
Neg logit contributions
der-0.047
rier-0.047
lessly-0.045
ived-0.044
iously-0.040
Token
Token ID220
Feature activation+3.57e+00

Pos logit contributions
lessly+0.022
ously+0.022
rier+0.021
der+0.021
ived+0.020
Neg logit contributions
 hunt-0.018
 park-0.016
 library-0.016
 exhibit-0.015
 hall-0.015
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
lessly+0.103
ously+0.101
rier+0.099
ived+0.097
der+0.097
Neg logit contributions
 hunt-0.077
 park-0.070
 library-0.068
 exhibit-0.068
 hall-0.067
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
lessly+0.032
rier+0.032
ously+0.032
der+0.031
ived+0.031
Neg logit contributions
 hunt-0.023
 park-0.021
 exhibit-0.020
 library-0.020
 hall-0.020
TokenLittle
Token ID22253
Feature activation+2.69e+00

Pos logit contributions
rier+0.030
lessly+0.030
ously+0.029
ived+0.028
der+0.028
Neg logit contributions
 hunt-0.031
 park-0.030
 hall-0.028
 exhibit-0.028
 library-0.028
Token Tim
Token ID5045
Feature activation+8.23e-01

Pos logit contributions
ously+0.074
lessly+0.074
ived+0.071
rier+0.070
der+0.069
Neg logit contributions
 hunt-0.060
 library-0.054
 park-0.053
 exhibit-0.052
 hall-0.052
Token liked
Token ID8288
Feature activation-5.89e-01

Pos logit contributions
lessly+0.029
rier+0.028
ously+0.028
der+0.027
ived+0.027
Neg logit contributions
 hunt-0.013
 park-0.012
 exhibit-0.011
 library-0.011
 hall-0.011
Token the
Token ID262
Feature activation+9.20e-01

Pos logit contributions
 hunt+0.012
 library+0.011
 hall+0.011
 exhibit+0.011
 park+0.011
Neg logit contributions
der-0.016
ously-0.016
lessly-0.015
ived-0.015
rier-0.015
Token idea
Token ID2126
Feature activation+2.01e+00

Pos logit contributions
lessly+0.018
ously+0.018
rier+0.017
der+0.016
ived+0.016
Neg logit contributions
 hunt-0.027
 exhibit-0.026
 library-0.025
 theater-0.025
 hall-0.025
Token and
Token ID290
Feature activation+1.91e+00

Pos logit contributions
lessly+0.047
rier+0.045
ously+0.045
ived+0.043
der+0.043
Neg logit contributions
 hunt-0.055
 park-0.051
 exhibit-0.050
 library-0.050
 hall-0.049
Token the
Token ID262
Feature activation-9.53e-01

Pos logit contributions
 hunt+0.075
 library+0.072
 exhibit+0.071
 hall+0.064
 museum+0.063
Neg logit contributions
lessly-0.085
der-0.082
rier-0.081
ously-0.077
ived-0.076
Token p
Token ID279
Feature activation+1.27e+00

Pos logit contributions
 hunt+0.021
 exhibit+0.019
 library+0.018
 theater+0.017
 museum+0.017
Neg logit contributions
lessly-0.027
rier-0.026
der-0.025
ived-0.025
ously-0.025
Tokenuddle
Token ID24500
Feature activation-3.04e-01

Pos logit contributions
lessly+0.030
ously+0.030
rier+0.029
ived+0.028
der+0.028
Neg logit contributions
 hunt-0.034
 park-0.032
 library-0.031
 exhibit-0.031
 hall-0.031
Token instead
Token ID2427
Feature activation+5.71e-01

Pos logit contributions
 hunt+0.007
 park+0.007
 library+0.007
 exhibit+0.006
 hall+0.006
Neg logit contributions
lessly-0.008
ously-0.008
rier-0.008
der-0.008
ived-0.008
Token."
Token ID526
Feature activation+8.44e-01

Pos logit contributions
lessly+0.016
ously+0.015
rier+0.015
ived+0.015
der+0.015
Neg logit contributions
 hunt-0.013
 library-0.012
 park-0.012
 exhibit-0.012
 hall-0.011
Token
Token ID198
Feature activation+1.50e+00

Pos logit contributions
lessly+0.024
ously+0.023
rier+0.023
ived+0.022
der+0.022
Neg logit contributions
 hunt-0.019
 park-0.018
 library-0.017
 exhibit-0.017
 hall-0.017
Token
Token ID198
Feature activation+1.63e+00

Pos logit contributions
lessly+0.046
rier+0.045
ously+0.045
der+0.043
ived+0.043
Neg logit contributions
 hunt-0.030
 park-0.028
 exhibit-0.026
 library-0.026
 hall-0.026
TokenSo
Token ID2396
Feature activation+2.74e+00

Pos logit contributions
rier+0.044
lessly+0.043
ously+0.042
der+0.041
ived+0.041
Neg logit contributions
 hunt-0.040
 park-0.038
 library-0.035
 hall-0.035
 slide-0.035
Token,
Token ID11
Feature activation+2.16e+00

Pos logit contributions
rier+0.089
lessly+0.088
ously+0.085
der+0.084
ived+0.080
Neg logit contributions
 hunt-0.049
 park-0.047
 play-0.046
 slide-0.045
 exhibit-0.044
Token they
Token ID484
Feature activation+5.60e-01

Pos logit contributions
lessly+0.049
ously+0.047
ived+0.045
der+0.045
rier+0.044
Neg logit contributions
 hunt-0.061
 library-0.055
 exhibit-0.054
 park-0.054
 hall-0.053
Token found
Token ID1043
Feature activation+8.55e-02

Pos logit contributions
lessly+0.015
ously+0.015
rier+0.015
ived+0.014
der+0.014
Neg logit contributions
 hunt-0.013
 park-0.012
 library-0.012
 exhibit-0.012
 hall-0.012
Token see
Token ID766
Feature activation-4.77e-01

Pos logit contributions
 hunt+0.040
 exhibit+0.038
 library+0.038
 hall+0.037
house+0.037
Neg logit contributions
ously-0.042
lessly-0.042
rier-0.041
der-0.040
ived-0.039
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
 hunt+0.012
 hall+0.011
 library+0.010
 exhibit+0.010
 park+0.010
Neg logit contributions
lessly-0.013
ously-0.012
der-0.012
rier-0.012
ived-0.011
Token sun
Token ID4252
Feature activation-8.78e-01

Pos logit contributions
rier+0.030
lessly+0.030
der+0.028
ously+0.028
ived+0.028
Neg logit contributions
 hunt-0.031
 exhibit-0.029
 hall-0.028
 library-0.027
 theater-0.027
Token and
Token ID290
Feature activation+7.88e-01

Pos logit contributions
 hunt+0.019
 library+0.017
 exhibit+0.017
 park+0.017
 museum+0.017
Neg logit contributions
lessly-0.025
rier-0.025
ously-0.025
ived-0.024
der-0.024
Token felt
Token ID2936
Feature activation+3.67e+00

Pos logit contributions
lessly+0.021
ously+0.020
der+0.020
rier+0.020
ived+0.019
Neg logit contributions
 hunt-0.019
 exhibit-0.017
 library-0.017
 hall-0.017
 park-0.017
Token happy
Token ID3772
Feature activation+3.34e+00

Pos logit contributions
lessly+0.086
ously+0.085
rier+0.082
ived+0.080
der+0.080
Neg logit contributions
 hunt-0.100
 park-0.093
 library-0.090
 exhibit-0.089
 hall-0.089
Token and
Token ID290
Feature activation+2.93e+00

Pos logit contributions
lessly+0.092
ously+0.090
der+0.089
rier+0.088
ived+0.085
Neg logit contributions
 hunt-0.079
 park-0.071
 library-0.071
 exhibit-0.070
 hall-0.069
Token energ
Token ID19647
Feature activation+4.89e+00

Pos logit contributions
lessly+0.058
ously+0.055
rier+0.054
ived+0.052
der+0.051
Neg logit contributions
 hunt-0.091
 park-0.085
 exhibit-0.085
 library-0.085
 hall-0.083
Tokenized
Token ID1143
Feature activation+2.44e+00

Pos logit contributions
lessly+0.105
ously+0.103
rier+0.099
ived+0.097
der+0.097
Neg logit contributions
 hunt-0.143
 park-0.133
 library-0.129
 exhibit-0.129
 hall-0.128
Token.
Token ID13
Feature activation+9.49e-01

Pos logit contributions
lessly+0.071
ously+0.069
rier+0.069
der+0.067
ived+0.065
Neg logit contributions
 hunt-0.054
 park-0.048
 library-0.048
 exhibit-0.047
 hall-0.047
Token He
Token ID679
Feature activation+1.16e-01

Pos logit contributions
lessly+0.019
ously+0.018
rier+0.018
ived+0.017
der+0.016
Neg logit contributions
 hunt-0.029
 park-0.028
 library-0.027
 exhibit-0.027
 hall-0.027
Token quite
Token ID2407
Feature activation+4.23e+00

Pos logit contributions
lessly+0.074
ously+0.073
rier+0.071
ived+0.069
der+0.069
Neg logit contributions
 hunt-0.080
 park-0.074
 library-0.072
 exhibit-0.072
 hall-0.071
Token noisy
Token ID31210
Feature activation+2.97e+00

Pos logit contributions
ously+0.102
lessly+0.100
rier+0.096
ived+0.095
der+0.094
Neg logit contributions
 hunt-0.112
 park-0.104
 library-0.101
 exhibit-0.098
 hall-0.098
Token!
Token ID0
Feature activation+8.90e-02

Pos logit contributions
lessly+0.078
ously+0.075
der+0.075
rier+0.075
ived+0.072
Neg logit contributions
 hunt-0.073
 park-0.066
 exhibit-0.066
 library-0.065
 hall-0.064
Token She
Token ID1375
Feature activation-1.32e-01

Pos logit contributions
lessly+0.002
ously+0.002
rier+0.002
ived+0.002
der+0.002
Neg logit contributions
 hunt-0.002
 park-0.002
 library-0.002
 exhibit-0.002
 hall-0.002
Token frowned
Token ID32198
Feature activation+4.88e+00

Pos logit contributions
 hunt+0.003
 library+0.003
 park+0.003
 exhibit+0.003
 hall+0.003
Neg logit contributions
lessly-0.003
ously-0.003
der-0.003
ived-0.003
rier-0.003
Token and
Token ID290
Feature activation+1.70e+00

Pos logit contributions
lessly+0.133
ived+0.131
ously+0.130
rier+0.126
der+0.118
Neg logit contributions
 hunt-0.100
 park-0.100
 hall-0.091
 play-0.090
 library-0.089
Token decided
Token ID3066
Feature activation+2.14e+00

Pos logit contributions
lessly+0.041
ously+0.041
der+0.039
rier+0.038
ived+0.038
Neg logit contributions
 hunt-0.045
 exhibit-0.042
 library-0.041
 park-0.040
 hall-0.040
Token she
Token ID673
Feature activation-7.88e-01

Pos logit contributions
lessly+0.059
ously+0.058
rier+0.056
der+0.055
ived+0.055
Neg logit contributions
 hunt-0.050
 park-0.045
 exhibit-0.044
 library-0.044
 hall-0.044
Token didn
Token ID1422
Feature activation+1.69e+00

Pos logit contributions
 hunt+0.019
 park+0.018
 library+0.018
 exhibit+0.017
 hall+0.017
Neg logit contributions
lessly-0.020
ously-0.020
der-0.019
ived-0.019
rier-0.019
Token't
Token ID470
Feature activation+3.70e-01

Pos logit contributions
lessly+0.054
ously+0.053
rier+0.052
ived+0.051
der+0.051
Neg logit contributions
 hunt-0.031
 park-0.028
 library-0.027
 exhibit-0.027
 hall-0.026
Token like
Token ID588
Feature activation-2.67e+00

Pos logit contributions
lessly+0.010
ously+0.010
rier+0.009
ived+0.009
der+0.009
Neg logit contributions
 hunt-0.009
 park-0.008
 exhibit-0.008
 library-0.008
 hall-0.008
Token said
Token ID531
Feature activation+3.25e+00

Pos logit contributions
ously+0.020
lessly+0.019
der+0.019
ived+0.018
rier+0.018
Neg logit contributions
 hunt-0.020
 exhibit-0.019
 hall-0.018
 library-0.018
 park-0.018
Token,
Token ID11
Feature activation+2.01e+00

Pos logit contributions
lessly+0.100
ived+0.097
ously+0.095
rier+0.094
uster+0.089
Neg logit contributions
 hunt-0.056
 park-0.052
 library-0.049
 store-0.049
 play-0.049
Token "
Token ID366
Feature activation-2.78e+00

Pos logit contributions
ously+0.049
ived+0.049
rier+0.048
lessly+0.048
iously+0.043
Neg logit contributions
 hunt-0.049
 park-0.046
 store-0.044
 library-0.044
 hall-0.044
TokenI
Token ID40
Feature activation-2.86e+00

Pos logit contributions
 hunt+0.074
 park+0.069
 library+0.067
 exhibit+0.066
 hall+0.066
Neg logit contributions
lessly-0.066
ously-0.065
rier-0.064
ived-0.062
der-0.062
Token know
Token ID760
Feature activation+4.67e-01

Pos logit contributions
 hunt+0.064
 library+0.058
 park+0.058
 hall+0.058
 exhibit+0.057
Neg logit contributions
lessly-0.080
ously-0.079
ived-0.075
der-0.074
rier-0.073
Token it
Token ID340
Feature activation-2.54e+00

Pos logit contributions
lessly+0.013
ously+0.013
der+0.012
rier+0.012
ived+0.012
Neg logit contributions
 hunt-0.011
 park-0.010
 hall-0.010
 exhibit-0.010
 library-0.009
Token hurts
Token ID20406
Feature activation+1.12e+00

Pos logit contributions
 hunt+0.054
 library+0.050
 park+0.050
 exhibit+0.049
 hall+0.048
Neg logit contributions
der-0.071
lessly-0.071
ously-0.070
ived-0.068
rier-0.067
Token,
Token ID11
Feature activation+2.08e+00

Pos logit contributions
ously+0.031
lessly+0.031
ived+0.030
rier+0.030
der+0.029
Neg logit contributions
 hunt-0.025
 park-0.023
 library-0.021
 hall-0.021
 exhibit-0.021
Token but
Token ID475
Feature activation+2.20e-01

Pos logit contributions
ously+0.061
lessly+0.060
ived+0.058
rier+0.058
der+0.056
Neg logit contributions
 hunt-0.044
 park-0.041
 library-0.039
 hall-0.039
 slide-0.038
Token I
Token ID314
Feature activation-2.18e+00

Pos logit contributions
lessly+0.006
ously+0.006
rier+0.006
der+0.006
ived+0.006
Neg logit contributions
 hunt-0.005
 exhibit-0.004
 park-0.004
 library-0.004
 hall-0.004
Token'm
Token ID1101
Feature activation+3.67e+00

Pos logit contributions
 hunt+0.049
 library+0.045
 park+0.045
 hall+0.045
 exhibit+0.045
Neg logit contributions
lessly-0.060
ously-0.059
der-0.056
ived-0.055
rier-0.054
Token up
Token ID510
Feature activation-7.02e-01

Pos logit contributions
 hunt+0.037
 exhibit+0.033
 park+0.032
 library+0.032
 hall+0.032
Neg logit contributions
lessly-0.051
rier-0.049
ously-0.049
der-0.048
ived-0.046
Token the
Token ID262
Feature activation-3.18e-01

Pos logit contributions
 hunt+0.015
 exhibit+0.014
 park+0.013
 library+0.013
 hall+0.013
Neg logit contributions
lessly-0.021
ously-0.019
rier-0.019
der-0.019
ived-0.018
Token hole
Token ID7604
Feature activation+1.17e+00

Pos logit contributions
 hunt+0.006
 exhibit+0.005
 park+0.005
 library+0.005
 theater+0.005
Neg logit contributions
lessly-0.010
rier-0.010
ously-0.010
der-0.010
ived-0.009
Token.
Token ID13
Feature activation+3.87e-01

Pos logit contributions
lessly+0.031
ously+0.030
rier+0.029
ived+0.029
der+0.029
Neg logit contributions
 hunt-0.028
 park-0.026
 library-0.025
 exhibit-0.025
 hall-0.025
Token When
Token ID1649
Feature activation-3.93e-01

Pos logit contributions
lessly+0.009
ously+0.008
rier+0.008
ived+0.008
der+0.008
Neg logit contributions
 hunt-0.011
 park-0.010
 library-0.010
 exhibit-0.010
 hall-0.010
Token she
Token ID673
Feature activation-6.83e-01

Pos logit contributions
 hunt+0.011
 exhibit+0.010
 park+0.009
 library+0.009
 hall+0.009
Neg logit contributions
lessly-0.009
ously-0.009
ived-0.008
der-0.008
rier-0.008
Token was
Token ID373
Feature activation+2.67e+00

Pos logit contributions
 hunt+0.016
 exhibit+0.015
 library+0.015
 zoo+0.015
 hall+0.014
Neg logit contributions
ously-0.017
ived-0.017
lessly-0.017
der-0.017
iously-0.015
Token done
Token ID1760
Feature activation+2.86e+00

Pos logit contributions
lessly+0.079
ously+0.077
rier+0.076
ived+0.074
der+0.074
Neg logit contributions
 hunt-0.057
 park-0.051
 library-0.050
 exhibit-0.050
 hall-0.049
Token,
Token ID11
Feature activation+2.29e+00

Pos logit contributions
lessly+0.065
ously+0.064
rier+0.062
der+0.061
ived+0.060
Neg logit contributions
 hunt-0.079
 park-0.073
 exhibit-0.072
 library-0.071
 hall-0.069
Token Julie
Token ID21946
Feature activation+2.40e+00

Pos logit contributions
lessly+0.057
ously+0.055
rier+0.053
der+0.052
ived+0.052
Neg logit contributions
 hunt-0.061
 exhibit-0.054
 park-0.053
 library-0.052
 hall-0.050
Token felt
Token ID2936
Feature activation+4.45e+00

Pos logit contributions
ously+0.065
lessly+0.062
ived+0.061
der+0.061
rier+0.057
Neg logit contributions
 hunt-0.055
 library-0.051
 exhibit-0.050
 museum-0.050
 park-0.050
Token birthday
Token ID10955
Feature activation-9.38e-01

Pos logit contributions
 hunt+0.054
 exhibit+0.050
 library+0.048
 park+0.047
 hall+0.046
Neg logit contributions
lessly-0.048
rier-0.046
ously-0.044
ived-0.044
 pleased-0.041
Token hat
Token ID6877
Feature activation-1.24e+00

Pos logit contributions
 hunt+0.016
 exhibit+0.014
|+0.013
 library+0.013
 park+0.013
Neg logit contributions
lessly-0.027
ously-0.027
rier-0.027
ived-0.026
ly-0.025
Token and
Token ID290
Feature activation-8.80e-01

Pos logit contributions
 hunt+0.037
 exhibit+0.034
 library+0.034
 park+0.034
 hall+0.034
Neg logit contributions
lessly-0.025
ously-0.024
der-0.023
rier-0.023
ived-0.022
Token was
Token ID373
Feature activation+6.95e-01

Pos logit contributions
 hunt+0.020
 exhibit+0.018
 park+0.018
 library+0.017
 hall+0.017
Neg logit contributions
lessly-0.023
ously-0.023
der-0.022
ived-0.021
rier-0.021
Token waving
Token ID25849
Feature activation+2.21e+00

Pos logit contributions
lessly+0.016
ously+0.015
rier+0.015
ived+0.015
der+0.015
Neg logit contributions
 hunt-0.019
 exhibit-0.017
 park-0.017
 library-0.017
 hall-0.017
Token to
Token ID284
Feature activation-2.80e+00

Pos logit contributions
lessly+0.061
ously+0.060
rier+0.058
der+0.058
ived+0.058
Neg logit contributions
 hunt-0.049
 park-0.045
 library-0.043
 exhibit-0.043
 hall-0.043
Token all
Token ID477
Feature activation-1.53e+00

Pos logit contributions
 hunt+0.064
 exhibit+0.057
 hall+0.057
 park+0.056
 library+0.056
Neg logit contributions
lessly-0.076
ously-0.073
rier-0.073
der-0.071
ived-0.068
Token his
Token ID465
Feature activation-7.86e-02

Pos logit contributions
 hunt+0.039
 library+0.034
 exhibit+0.034
 park+0.033
 hall+0.032
Neg logit contributions
lessly-0.041
ously-0.038
rier-0.036
ived-0.036
der-0.034
Token friends
Token ID2460
Feature activation-1.06e+00

Pos logit contributions
 hunt+0.002
 park+0.002
 hall+0.002
 exhibit+0.002
 library+0.002
Neg logit contributions
lessly-0.002
ously-0.002
rier-0.002
ived-0.001
der-0.001
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
 hunt+0.025
 park+0.023
 library+0.023
 exhibit+0.023
 hall+0.022
Neg logit contributions
ously-0.028
lessly-0.028
ived-0.026
der-0.025
rier-0.025
Token His
Token ID2399
Feature activation+2.54e-01

Pos logit contributions
 hunt+0.025
 park+0.024
 library+0.023
 exhibit+0.022
 theater+0.022
Neg logit contributions
ously-0.032
lessly-0.032
rier-0.031
ived-0.030
der-0.029
Token have
Token ID423
Feature activation-4.02e+00

Pos logit contributions
rob+0.157
 library+0.134
 exhibit+0.131
 hall+0.127
 hunt+0.125
Neg logit contributions
ived-0.127
ously-0.125
um-0.124
lessly-0.117
rier-0.116
Token a
Token ID257
Feature activation-2.21e+00

Pos logit contributions
 hunt+0.099
 hall+0.094
 library+0.092
 exhibit+0.090
 theater+0.086
Neg logit contributions
ously-0.087
ived-0.086
der-0.085
lessly-0.084
iously-0.072
Token special
Token ID2041
Feature activation-8.46e-01

Pos logit contributions
rob+0.036
 hunt+0.035
nob+0.035
 hall+0.034
e+0.034
Neg logit contributions
ived-0.057
rier-0.055
ously-0.054
lessly-0.054
 pleased-0.053
Token show
Token ID905
Feature activation-2.28e+00

Pos logit contributions
 hunt+0.014
 hall+0.013
 exhibit+0.013
|+0.013
 library+0.013
Neg logit contributions
rier-0.023
ously-0.023
lessly-0.022
der-0.022
ived-0.021
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 hunt+0.058
 library+0.054
 park+0.053
 exhibit+0.052
 hall+0.052
Neg logit contributions
ously-0.055
lessly-0.054
ived-0.051
der-0.051
rier-0.048
Token The
Token ID383
Feature activation-4.63e-01

Pos logit contributions
 hunt+0.035
 park+0.032
 library+0.032
 hall+0.031
 exhibit+0.031
Neg logit contributions
lessly-0.025
ously-0.024
ived-0.023
der-0.022
rier-0.022
Token animals
Token ID4695
Feature activation-1.68e+00

Pos logit contributions
 hunt+0.011
 exhibit+0.010
 hall+0.010
 library+0.009
 park+0.009
Neg logit contributions
rier-0.012
lessly-0.012
der-0.011
ously-0.011
ived-0.011
Token wanted
Token ID2227
Feature activation-1.80e+00

Pos logit contributions
 hunt+0.040
 library+0.038
 park+0.037
 exhibit+0.037
 hall+0.036
Neg logit contributions
ously-0.044
lessly-0.044
ived-0.041
der-0.038
rier-0.037
Token to
Token ID284
Feature activation-5.86e+00

Pos logit contributions
 hunt+0.049
 hall+0.047
 library+0.046
 park+0.045
 exhibit+0.045
Neg logit contributions
ously-0.037
lessly-0.037
ived-0.036
rier-0.034
der-0.034
Token display
Token ID3359
Feature activation-2.02e+00

Pos logit contributions
 exhibit+0.106
 library+0.103
rob+0.100
 hall+0.100
 hunt+0.099
Neg logit contributions
lessly-0.158
ously-0.156
ived-0.155
rier-0.147
 understanding-0.143
Token their
Token ID511
Feature activation-5.43e-01

Pos logit contributions
 hunt+0.050
 library+0.046
 hall+0.045
 park+0.044
 exhibit+0.044
Neg logit contributions
lessly-0.052
rier-0.050
ously-0.049
ived-0.047
der-0.047
Token him
Token ID683
Feature activation-1.30e+00

Pos logit contributions
 hunt+0.059
 hall+0.057
 park+0.056
 library+0.056
 slide+0.055
Neg logit contributions
ously-0.050
lessly-0.050
der-0.047
rier-0.046
iously-0.045
Token.
Token ID13
Feature activation-5.91e-01

Pos logit contributions
 hunt+0.034
 hall+0.034
 library+0.033
rob+0.033
 park+0.033
Neg logit contributions
der-0.025
lessly-0.025
rier-0.024
ously-0.023
iously-0.022
Token
Token ID220
Feature activation+4.92e-01

Pos logit contributions
 hunt+0.014
 exhibit+0.013
 park+0.013
 library+0.013
 hall+0.013
Neg logit contributions
lessly-0.014
ously-0.014
der-0.013
ived-0.013
rier-0.013
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
lessly+0.014
ously+0.014
rier+0.013
ived+0.013
der+0.013
Neg logit contributions
 hunt-0.011
 park-0.010
 library-0.010
 exhibit-0.010
 hall-0.010
Token
Token ID198
Feature activation+5.36e-02

Pos logit contributions
 hunt+0.025
 park+0.022
 library+0.022
 exhibit+0.022
 hall+0.022
Neg logit contributions
lessly-0.025
ously-0.025
ived-0.024
rier-0.023
der-0.023
Token"
Token ID1
Feature activation-4.24e+00

Pos logit contributions
lessly+0.001
ously+0.001
ived+0.001
der+0.001
rier+0.001
Neg logit contributions
 hunt-0.001
 exhibit-0.001
 park-0.001
 library-0.001
 hall-0.001
TokenHi
Token ID17250
Feature activation+2.31e-01

Pos logit contributions
 hunt+0.107
 exhibit+0.102
 library+0.095
 hall+0.093
 theater+0.092
Neg logit contributions
ously-0.100
lessly-0.099
ived-0.094
rier-0.093
der-0.091
Token Max
Token ID5436
Feature activation+2.46e+00

Pos logit contributions
ived+0.005
rier+0.005
ously+0.005
lessly+0.004
der+0.004
Neg logit contributions
 hunt-0.006
 hall-0.006
 library-0.006
 park-0.006
 exhibit-0.006
Token,
Token ID11
Feature activation-7.50e-01

Pos logit contributions
lessly+0.074
ously+0.070
rier+0.070
ived+0.069
der+0.068
Neg logit contributions
 hunt-0.054
 park-0.049
 exhibit-0.046
 library-0.045
 hall-0.045
Token how
Token ID703
Feature activation-2.67e+00

Pos logit contributions
 hunt+0.018
 exhibit+0.017
 hall+0.016
 library+0.016
 park+0.016
Neg logit contributions
lessly-0.018
rier-0.018
der-0.017
ously-0.017
ived-0.017
Token are
Token ID389
Feature activation-2.50e-01

Pos logit contributions
 hunt+0.061
 exhibit+0.056
 park+0.054
 hall+0.054
 zoo+0.053
Neg logit contributions
lessly-0.076
ously-0.067
ived-0.066
der-0.063
rier-0.063
Token hand
Token ID1021
Feature activation-1.81e-01

Pos logit contributions
 hunt+0.057
 exhibit+0.049
 library+0.045
 theater+0.045
 park+0.045
Neg logit contributions
ously-0.089
lessly-0.088
rier-0.085
der-0.083
ived-0.083
Token and
Token ID290
Feature activation-1.41e-01

Pos logit contributions
 hunt+0.005
 exhibit+0.005
 library+0.005
 hall+0.005
 park+0.005
Neg logit contributions
ously-0.004
lessly-0.004
der-0.004
ived-0.003
rier-0.003
Token she
Token ID673
Feature activation-4.64e-01

Pos logit contributions
 hunt+0.004
 exhibit+0.004
 park+0.003
 library+0.003
 theater+0.003
Neg logit contributions
lessly-0.003
ously-0.003
der-0.003
ived-0.003
rier-0.003
Token was
Token ID373
Feature activation+1.46e+00

Pos logit contributions
 hunt+0.011
 park+0.010
 exhibit+0.010
 library+0.010
 hall+0.010
Neg logit contributions
lessly-0.012
ously-0.012
rier-0.011
ived-0.011
der-0.011
Token ready
Token ID3492
Feature activation+1.79e+00

Pos logit contributions
lessly+0.029
rier+0.027
ously+0.027
ived+0.026
der+0.024
Neg logit contributions
 hunt-0.046
 exhibit-0.042
 park-0.041
 library-0.041
 hall-0.040
Token to
Token ID284
Feature activation-5.30e+00

Pos logit contributions
lessly+0.047
ously+0.046
rier+0.044
ived+0.044
der+0.044
Neg logit contributions
 hunt-0.043
 park-0.040
 exhibit-0.039
 library-0.039
 hall-0.039
Token go
Token ID467
Feature activation-1.99e+00

Pos logit contributions
rob+0.124
 exhibit+0.119
 hall+0.116
 library+0.114
|+0.110
Neg logit contributions
ously-0.107
lessly-0.106
ived-0.105
um-0.098
rier-0.097
Token.
Token ID13
Feature activation-7.50e-01

Pos logit contributions
 hunt+0.044
 library+0.042
 exhibit+0.041
 park+0.041
 hall+0.040
Neg logit contributions
lessly-0.052
ously-0.049
ived-0.048
rier-0.047
der-0.045
Token
Token ID198
Feature activation-1.26e+00

Pos logit contributions
 hunt+0.018
 park+0.016
 library+0.016
 exhibit+0.016
 hall+0.015
Neg logit contributions
lessly-0.021
rier-0.020
ously-0.020
der-0.019
ived-0.019
Token
Token ID198
Feature activation-1.96e-01

Pos logit contributions
 hunt+0.027
 park+0.025
 library+0.024
 exhibit+0.024
 hall+0.024
Neg logit contributions
lessly-0.036
ously-0.036
rier-0.035
der-0.034
ived-0.034
TokenMom
Token ID29252
Feature activation-1.38e+00

Pos logit contributions
 hunt+0.005
 park+0.005
 library+0.005
 exhibit+0.005
 hall+0.005
Neg logit contributions
lessly-0.005
rier-0.005
ously-0.005
der-0.004
ived-0.004
Token about
Token ID546
Feature activation-1.85e+00

Pos logit contributions
lessly+0.079
ously+0.077
rier+0.075
der+0.073
ived+0.073
Neg logit contributions
 hunt-0.098
 park-0.090
 library-0.089
 exhibit-0.088
 hall-0.087
Token the
Token ID262
Feature activation+7.96e-01

Pos logit contributions
 hunt+0.044
 exhibit+0.042
 library+0.039
 park+0.038
rob+0.038
Neg logit contributions
der-0.046
lessly-0.044
ously-0.042
rier-0.042
iously-0.040
Token mosquitoes
Token ID35573
Feature activation-1.02e+00

Pos logit contributions
lessly+0.021
rier+0.021
der+0.020
ously+0.019
ived+0.018
Neg logit contributions
 hunt-0.017
 exhibit-0.016
rob-0.014
house-0.014
e-0.014
Token anymore
Token ID7471
Feature activation-9.48e-01

Pos logit contributions
 hunt+0.026
 library+0.025
 exhibit+0.025
 museum+0.025
 park+0.024
Neg logit contributions
lessly-0.024
ously-0.024
der-0.023
ived-0.022
rier-0.021
Token.
Token ID13
Feature activation-3.24e-01

Pos logit contributions
 hunt+0.025
 exhibit+0.023
 park+0.023
 library+0.023
 hall+0.022
Neg logit contributions
der-0.023
lessly-0.022
ously-0.022
ived-0.019
rier-0.018
Token<|endoftext|>
Token ID50256
Feature activation-6.42e+00

Pos logit contributions
 hunt+0.008
 park+0.007
 exhibit+0.007
 library+0.007
 hall+0.007
Neg logit contributions
lessly-0.008
ously-0.008
der-0.008
rier-0.007
ived-0.007
TokenOnce
Token ID7454
Feature activation-3.06e+00

Pos logit contributions
 hunt+0.213
 test+0.189
 library+0.185
 report+0.183
ci+0.182
Neg logit contributions
lessly-0.100
ously-0.091
ived-0.087
der-0.074
um-0.074
Token upon
Token ID2402
Feature activation-3.96e+00

Pos logit contributions
 hunt+0.065
 park+0.064
 hall+0.060
 exhibit+0.060
 library+0.059
Neg logit contributions
rier-0.085
lessly-0.085
ously-0.084
der-0.083
ived-0.080
Token a
Token ID257
Feature activation-7.13e-01

Pos logit contributions
 hunt+0.087
 exhibit+0.080
 hall+0.077
 library+0.076
 park+0.075
Neg logit contributions
lessly-0.111
ously-0.107
der-0.104
ived-0.102
rier-0.101
Token time
Token ID640
Feature activation-6.88e-01

Pos logit contributions
 hunt+0.012
 park+0.011
 library+0.010
 exhibit+0.009
 hall+0.009
Neg logit contributions
rier-0.026
lessly-0.025
ously-0.025
der-0.024
ived-0.024
Token,
Token ID11
Feature activation-3.26e+00

Pos logit contributions
 hunt+0.015
 park+0.014
 exhibit+0.014
 hall+0.014
 library+0.014
Neg logit contributions
lessly-0.020
rier-0.019
ously-0.019
der-0.018
ived-0.018
Token decided
Token ID3066
Feature activation+9.15e-01

Pos logit contributions
 hunt+0.026
 exhibit+0.024
 library+0.024
 hall+0.023
 park+0.023
Neg logit contributions
lessly-0.030
ously-0.030
ived-0.028
der-0.027
rier-0.026
Token to
Token ID284
Feature activation-5.90e+00

Pos logit contributions
lessly+0.025
ously+0.024
rier+0.024
der+0.023
ived+0.023
Neg logit contributions
 hunt-0.022
 park-0.020
 exhibit-0.019
 library-0.019
 hall-0.019
Token take
Token ID1011
Feature activation-2.47e+00

Pos logit contributions
rob+0.128
 exhibit+0.116
 library+0.114
 hunt+0.108
 hall+0.107
Neg logit contributions
um-0.144
ived-0.135
ously-0.134
rier-0.131
lessly-0.130
Token Sus
Token ID8932
Feature activation+1.37e+00

Pos logit contributions
 hunt+0.055
 library+0.048
 park+0.048
 exhibit+0.048
 hall+0.048
Neg logit contributions
lessly-0.066
ived-0.065
ously-0.065
rier-0.064
der-0.062
Tokenie
Token ID494
Feature activation-1.25e+00

Pos logit contributions
lessly+0.030
ously+0.030
rier+0.028
ived+0.028
der+0.027
Neg logit contributions
 hunt-0.040
 park-0.036
 library-0.036
 exhibit-0.035
 hall-0.035
Token to
Token ID284
Feature activation-6.01e+00

Pos logit contributions
rob+0.035
 hunt+0.033
 library+0.033
 theater+0.032
 museum+0.032
Neg logit contributions
lessly-0.022
ously-0.022
ived-0.022
ly-0.019
uster-0.019
Token a
Token ID257
Feature activation-2.90e+00

Pos logit contributions
rob+0.146
 exhibit+0.142
c+0.134
ci+0.134
robat+0.132
Neg logit contributions
 understanding-0.117
rier-0.106
 pleased-0.105
 expression-0.104
 realizing-0.104
Token restaurant
Token ID7072
Feature activation-3.12e+00

Pos logit contributions
rob+0.051
 hunt+0.048
e+0.046
nob+0.046
ci+0.042
Neg logit contributions
rier-0.071
ived-0.069
 louder-0.068
 pleased-0.068
lessly-0.068
Token for
Token ID329
Feature activation-4.01e+00

Pos logit contributions
 hunt+0.081
 exhibit+0.077
c+0.076
rob+0.076
|+0.076
Neg logit contributions
lessly-0.065
ously-0.060
der-0.059
rier-0.059
edly-0.053
Token dinner
Token ID8073
Feature activation-1.47e+00

Pos logit contributions
 hunt+0.082
 exhibit+0.074
 park+0.072
 library+0.072
 hall+0.071
Neg logit contributions
lessly-0.117
ously-0.114
rier-0.113
der-0.111
ived-0.111
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
 hunt+0.035
 exhibit+0.034
 library+0.033
 park+0.033
 hall+0.032
Neg logit contributions
lessly-0.036
der-0.035
ously-0.034
rier-0.034
ived-0.032
Tokenmy
Token ID1820
Feature activation-4.29e-01

Pos logit contributions
 hunt+0.035
 library+0.032
 exhibit+0.031
 hall+0.031
 park+0.031
Neg logit contributions
ously-0.048
lessly-0.047
der-0.044
ived-0.043
rier-0.043
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 hunt+0.009
 park+0.008
 library+0.008
 exhibit+0.008
 hall+0.008
Neg logit contributions
lessly-0.013
ously-0.012
rier-0.012
der-0.012
ived-0.012
Token Tim
Token ID5045
Feature activation-1.55e+00

Pos logit contributions
 hunt+0.024
 park+0.021
 library+0.021
 exhibit+0.021
 hall+0.021
Neg logit contributions
lessly-0.028
ously-0.028
rier-0.027
ived-0.026
der-0.026
Tokenmy
Token ID1820
Feature activation-7.74e-01

Pos logit contributions
 hunt+0.035
 exhibit+0.031
 library+0.031
 theater+0.030
 hall+0.030
Neg logit contributions
lessly-0.041
ously-0.040
der-0.037
ived-0.036
rier-0.035
Token liked
Token ID8288
Feature activation-2.85e+00

Pos logit contributions
 hunt+0.020
 exhibit+0.018
 library+0.018
 park+0.018
 hall+0.017
Neg logit contributions
lessly-0.018
ously-0.018
der-0.017
ived-0.016
rier-0.016
Token to
Token ID284
Feature activation-5.64e+00

Pos logit contributions
 hunt+0.077
 exhibit+0.074
 hall+0.071
house+0.070
 library+0.070
Neg logit contributions
der-0.054
ived-0.054
ously-0.054
lessly-0.053
rier-0.050
Token explore
Token ID7301
Feature activation-4.19e+00

Pos logit contributions
 exhibit+0.070
rob+0.061
 library+0.061
 hunt+0.060
 zoo+0.058
Neg logit contributions
ously-0.178
lessly-0.172
ived-0.172
der-0.167
 expression-0.162
Token and
Token ID290
Feature activation-1.87e+00

Pos logit contributions
 hunt+0.093
 exhibit+0.086
 library+0.084
 hall+0.082
 park+0.080
Neg logit contributions
lessly-0.108
ously-0.103
der-0.101
rier-0.096
ived-0.091
Token go
Token ID467
Feature activation-1.50e+00

Pos logit contributions
 exhibit+0.025
 hunt+0.025
rob+0.024
 library+0.023
 park+0.023
Neg logit contributions
lessly-0.059
ously-0.057
der-0.054
ived-0.052
rier-0.051
Token on
Token ID319
Feature activation-2.78e+00

Pos logit contributions
 hunt+0.032
 library+0.031
 exhibit+0.030
 park+0.030
 theater+0.029
Neg logit contributions
lessly-0.040
ously-0.037
ived-0.036
rier-0.034
der-0.034
Token adventures
Token ID17545
Feature activation-2.75e+00

Pos logit contributions
 hunt+0.048
 exhibit+0.043
 bake+0.042
 library+0.042
 park+0.041
Neg logit contributions
lessly-0.080
ously-0.078
der-0.077
ived-0.074
iously-0.073
Token year
Token ID614
Feature activation+2.04e+00

Pos logit contributions
lessly+0.056
ously+0.055
rier+0.054
ived+0.053
der+0.052
Neg logit contributions
 hunt-0.049
 park-0.046
 exhibit-0.045
 hall-0.045
 library-0.044
Token old
Token ID1468
Feature activation+2.74e-01

Pos logit contributions
ously+0.050
lessly+0.048
ived+0.047
rier+0.046
der+0.045
Neg logit contributions
 hunt-0.053
 park-0.050
 library-0.049
 hall-0.049
 exhibit-0.049
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
lessly+0.007
ously+0.007
rier+0.007
der+0.007
ived+0.007
Neg logit contributions
 hunt-0.007
 park-0.006
 exhibit-0.006
 library-0.006
 hall-0.006
Token He
Token ID679
Feature activation-1.73e+00

Pos logit contributions
 hunt+0.039
 park+0.037
 library+0.036
 exhibit+0.036
 hall+0.036
Neg logit contributions
lessly-0.025
ously-0.025
rier-0.024
ived-0.024
der-0.023
Token wanted
Token ID2227
Feature activation-2.68e+00

Pos logit contributions
 hunt+0.044
 exhibit+0.041
 park+0.038
 hall+0.037
 zoo+0.037
Neg logit contributions
lessly-0.040
der-0.038
ously-0.038
ived-0.034
iously-0.032
Token to
Token ID284
Feature activation-6.46e+00

Pos logit contributions
 hunt+0.073
 exhibit+0.067
 hall+0.066
 library+0.065
 park+0.064
Neg logit contributions
lessly-0.055
der-0.054
ously-0.054
ived-0.051
rier-0.051
Token go
Token ID467
Feature activation-2.37e+00

Pos logit contributions
rob+0.130
 exhibit+0.123
 library+0.115
 hunt+0.111
 hall+0.111
Neg logit contributions
ously-0.154
lessly-0.150
ived-0.150
um-0.149
 expression-0.149
Token to
Token ID284
Feature activation-6.23e+00

Pos logit contributions
 hunt+0.058
 library+0.057
 exhibit+0.056
 hall+0.055
 park+0.054
Neg logit contributions
lessly-0.051
ived-0.047
ously-0.047
rier-0.043
der-0.042
Token the
Token ID262
Feature activation-3.08e+00

Pos logit contributions
house+0.134
ci+0.130
asis+0.129
c+0.127
uch+0.126
Neg logit contributions
der-0.121
 understanding-0.118
lessly-0.117
 widely-0.111
ously-0.111
Token church
Token ID4928
Feature activation-3.53e+00

Pos logit contributions
 hunt+0.039
nob+0.035
e+0.033
c+0.033
|+0.032
Neg logit contributions
rier-0.098
der-0.095
lessly-0.087
um-0.087
 louder-0.087
Token,
Token ID11
Feature activation+2.95e-01

Pos logit contributions
 hunt+0.095
 park+0.088
rob+0.087
 exhibit+0.086
 library+0.086
Neg logit contributions
lessly-0.071
rier-0.069
ously-0.065
der-0.064
iously-0.060