TOP ACTIVATIONS
MAX = 9.772

very careful when you â re in the bath
stay safe when we â re driving . â
when you open it â . Molly opened the box
dizz y when you â re having fun .
the car until you â re big enough to
to sit in Emma â s nice cell .
sitting in the neighbor â s garden .
somewhere safe until it â s gone . â
special place in Lily â s room . Lily
many times . <|endoftext|> â Come on ,
hanging in their daughter â s room . <|endoftext|>
As long as we â re together , we
face danger when you â re on your own
the swoop ing birds â unusual colour .
deer again . <|endoftext|> â Can I play
give up when you â re trying to achieve
were playing in Molly â s garden . Suddenly
can feel when it â s filled with new
they reached the doctor â s office , his
placed safely in Tiny â s pocket and they

MIN ACTIVATIONS
MIN = -8.656

not real , but a stuffed animal that belonged to her
something new ! She thought av oc ados were the most
mom many questions . She teaches him the names of the
girl said . Offic er Roy had an idea
Ben say . Offic er Mike shows them his
's mom smiles . She teaches them how to be safe
his best friend , a stuffed bear named Teddy , went
elephant . That 's a stuffed toy . Someone left it
everywhere . It was a stuffed blue bear with a smile
a big surprise - a stuffed animal ! Esther was so
was inside - cars , stuffed animals , and a game
books , and even a stuffed animal . After a few
leak . The toy was stuffed with stuffing , but the
favourite toy - a giant stuffed animal he had been dreaming
something new ! She thought av oc ados were the most
This story teaches us a valuable lesson :
hands to talk . He teaches Sara some signs .
â s story teaches us an important lesson -
smart and funny . He teaches the mean boy a lesson
ig . This story teaches us that when we work

INTERVAL 5.165 - 9.772
CONTAINS 0.683%

play the fl ute again tomorrow . <|endoftext|> Once upon a
and soon she was tapping her feet along with the music
him . â Please help me
by the lake ? â . The all
and said : I need food

INTERVAL 0.558 - 5.165
CONTAINS 39.705%

to always ask for help when she needed it .
. She made a heart out of red cloth and st
large rock . John was so excited . They
The brother and sister stayed mad at each other . They
pocket . The coin fell out of his mouth and into

INTERVAL -4.049 - 0.558
CONTAINS 57.660%

They escaped the hunter 's view , and were not in
still go swimming . So they went back to the pool
ask Sam if he would like to play with it .
It is so pretty . Please let me play with it
past him and won the race . Max 's

INTERVAL -8.656 - -4.049
CONTAINS 1.952%

sorry . They say : " We are sorry , sir
Bread ," Tom says . " Because we fed it bread
her coat and hat . " I 'm cold and hungry
runs away . " Ha ha !" Tom laughs
said Mom . And a loud cheer erupted from the family
Token very
Token ID845
Feature activation+2.03e+00

Pos logit contributions
 eyel+0.032
gan+0.030
adow+0.029
 Moe+0.028
sing+0.028
Neg logit contributions
Offic-0.049
Mic-0.040
corn-0.038
oc-0.038
ercise-0.038
Token careful
Token ID8161
Feature activation+5.16e-01

Pos logit contributions
 eyel+0.048
gan+0.046
sing+0.045
 spine+0.045
 Moe+0.045
Neg logit contributions
Offic-0.054
ercise-0.043
corn-0.042
Mic-0.042
 permitting-0.042
Token when
Token ID618
Feature activation+3.12e+00

Pos logit contributions
 eyel+0.010
gan+0.010
 spine+0.010
sing+0.010
adow+0.010
Neg logit contributions
Offic-0.016
Mic-0.013
ercise-0.013
Sir-0.012
corn-0.012
Token you
Token ID345
Feature activation+3.94e+00

Pos logit contributions
 eyel+0.089
gan+0.087
sing+0.086
 spine+0.086
 Moe+0.084
Neg logit contributions
Offic-0.064
 permitting-0.050
ercise-0.049
corn-0.048
 allowing-0.047
Tokenâ
Token ID22940
Feature activation+1.63e+00

Pos logit contributions
 eyel+0.096
gan+0.093
 spine+0.091
sing+0.091
 Moe+0.090
Neg logit contributions
Offic-0.099
ercise-0.080
 permitting-0.078
corn-0.077
 whip-0.076
Token
Token ID26391
Feature activation+9.77e+00

Pos logit contributions
 eyel+0.032
gan+0.031
sing+0.030
 spine+0.030
 Moe+0.030
Neg logit contributions
Offic-0.049
ercise-0.041
corn-0.040
 permitting-0.040
Mic-0.039
Token
Token ID8151
Feature activation+3.17e+00

Pos logit contributions
 eyel+0.241
gan+0.230
sing+0.230
 spine+0.228
 Moe+0.225
Neg logit contributions
Offic-0.242
corn-0.194
ercise-0.192
 permitting-0.189
"-0.184
Tokenre
Token ID260
Feature activation+4.48e+00

Pos logit contributions
 eyel+0.075
sing+0.071
gan+0.071
 spine+0.071
 Moe+0.069
Neg logit contributions
Offic-0.083
ercise-0.066
 permitting-0.065
corn-0.065
Mic-0.064
Token in
Token ID287
Feature activation+1.87e+00

Pos logit contributions
 eyel+0.093
gan+0.089
 Moe+0.087
 spine+0.087
sing+0.087
Neg logit contributions
Offic-0.124
 permitting-0.107
ercise-0.106
 allowing-0.103
 whip-0.102
Token the
Token ID262
Feature activation+2.45e+00

Pos logit contributions
 eyel+0.035
 Moe+0.032
 spine+0.032
 toilet+0.031
oppy+0.030
Neg logit contributions
Offic-0.060
ercise-0.050
Mic-0.050
corn-0.048
Sir-0.047
Token bath
Token ID7837
Feature activation+4.38e+00

Pos logit contributions
gan+0.053
sing+0.053
sed+0.052
owed+0.049
 spine+0.049
Neg logit contributions
Offic-0.063
Can-0.060
 whip-0.057
"-0.056
 permitting-0.056
Token stay
Token ID2652
Feature activation+2.24e+00

Pos logit contributions
Offic+0.017
corn+0.014
Mic+0.013
ercise+0.013
 permitting+0.013
Neg logit contributions
 eyel-0.015
gan-0.014
 Moe-0.014
 Mil-0.014
sing-0.014
Token safe
Token ID3338
Feature activation+1.10e+00

Pos logit contributions
 eyel+0.044
gan+0.042
sing+0.042
adow+0.041
 spine+0.041
Neg logit contributions
Offic-0.069
Mic-0.057
corn-0.054
ercise-0.053
Sir-0.053
Token when
Token ID618
Feature activation+3.72e+00

Pos logit contributions
 eyel+0.024
gan+0.024
 spine+0.024
 Moe+0.023
sing+0.023
Neg logit contributions
Offic-0.030
ercise-0.024
Mic-0.024
corn-0.023
 permitting-0.023
Token we
Token ID356
Feature activation+1.09e+00

Pos logit contributions
 eyel+0.105
sing+0.103
gan+0.102
 spine+0.102
adow+0.097
Neg logit contributions
Offic-0.078
 permitting-0.061
corn-0.058
ercise-0.057
 allowing-0.057
Tokenâ
Token ID22940
Feature activation+1.80e+00

Pos logit contributions
 eyel+0.025
gan+0.024
adow+0.023
sing+0.023
 Moe+0.023
Neg logit contributions
Offic-0.030
corn-0.024
Mic-0.023
 permitting-0.023
ercise-0.023
Token
Token ID26391
Feature activation+9.76e+00

Pos logit contributions
 eyel+0.035
gan+0.033
 spine+0.032
sing+0.032
 Moe+0.032
Neg logit contributions
Offic-0.055
Mic-0.045
ercise-0.045
 permitting-0.045
corn-0.044
Token
Token ID8151
Feature activation+3.08e+00

Pos logit contributions
 eyel+0.230
gan+0.224
 spine+0.220
sing+0.216
 Moe+0.214
Neg logit contributions
Offic-0.254
ercise-0.203
Mic-0.202
 permitting-0.198
 whip-0.196
Tokenre
Token ID260
Feature activation+4.39e+00

Pos logit contributions
 eyel+0.081
sing+0.078
 spine+0.077
gan+0.077
 Moe+0.075
Neg logit contributions
Offic-0.072
ercise-0.056
 permitting-0.055
corn-0.055
Mic-0.054
Token driving
Token ID5059
Feature activation+1.78e-02

Pos logit contributions
 eyel+0.090
gan+0.086
sing+0.084
 spine+0.084
 Moe+0.083
Neg logit contributions
Offic-0.129
ercise-0.105
 permitting-0.104
corn-0.103
Mic-0.103
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
 eyel+0.000
gan+0.000
 Moe+0.000
 spine+0.000
sing+0.000
Neg logit contributions
Offic-0.001
Mic-0.000
ercise-0.000
corn-0.000
 permitting-0.000
Tokenâ
Token ID22940
Feature activation-2.21e-01

Pos logit contributions
gan+0.025
 Moe+0.025
 eyel+0.024
 Mil+0.024
 O+0.023
Neg logit contributions
Offic-0.043
ercise-0.036
Mic-0.035
corn-0.035
 whip-0.035
Token when
Token ID618
Feature activation+1.70e+00

Pos logit contributions
Offic+0.030
ercise+0.024
 permitting+0.024
corn+0.024
Mic+0.023
Neg logit contributions
 eyel-0.027
gan-0.026
sing-0.026
 spine-0.026
 Moe-0.025
Token you
Token ID345
Feature activation+3.89e+00

Pos logit contributions
 eyel+0.050
sing+0.049
gan+0.049
 spine+0.048
 Moe+0.047
Neg logit contributions
Offic-0.034
 permitting-0.025
ercise-0.025
corn-0.024
 allowing-0.024
Token open
Token ID1280
Feature activation+8.62e-01

Pos logit contributions
 eyel+0.095
gan+0.091
sing+0.090
 spine+0.090
 Moe+0.089
Neg logit contributions
Offic-0.099
ercise-0.079
 permitting-0.077
corn-0.076
Mic-0.075
Token it
Token ID340
Feature activation+5.96e+00

Pos logit contributions
 eyel+0.020
 Flor+0.020
 Moe+0.020
 Mil+0.019
gan+0.019
Neg logit contributions
Offic-0.023
Mic-0.019
ercise-0.018
Thank-0.018
 permitting-0.018
Tokenâ
Token ID22940
Feature activation+2.16e+00

Pos logit contributions
 eyel+0.121
gan+0.116
 spine+0.113
 Moe+0.112
sing+0.112
Neg logit contributions
Offic-0.177
ercise-0.144
 permitting-0.142
Mic-0.141
corn-0.140
Token
Token ID26391
Feature activation+9.69e+00

Pos logit contributions
 eyel+0.037
gan+0.035
sing+0.034
 spine+0.034
 Moe+0.034
Neg logit contributions
Offic-0.071
ercise-0.060
corn-0.059
 permitting-0.058
Mic-0.058
Token.
Token ID13
Feature activation+1.94e+00

Pos logit contributions
 eyel+0.227
gan+0.217
 spine+0.214
adow+0.210
oppy+0.209
Neg logit contributions
Offic-0.256
ercise-0.205
 permitting-0.203
Mic-0.202
 whip-0.199
Token Molly
Token ID30236
Feature activation+1.53e+00

Pos logit contributions
gan+0.040
 Moe+0.039
 eyel+0.039
 Flor+0.038
 Mil+0.038
Neg logit contributions
Offic-0.056
ercise-0.047
 whip-0.047
corn-0.045
 permitting-0.044
Token opened
Token ID4721
Feature activation-2.04e-03

Pos logit contributions
 eyel+0.033
gan+0.032
sing+0.031
 spine+0.031
 Moe+0.030
Neg logit contributions
Offic-0.043
ercise-0.035
 permitting-0.034
corn-0.034
Mic-0.034
Token the
Token ID262
Feature activation+3.73e+00

Pos logit contributions
Offic+0.000
Mic+0.000
ercise+0.000
 permitting+0.000
corn+0.000
Neg logit contributions
 Moe-0.000
 eyel-0.000
 Flor-0.000
 Mil-0.000
gan-0.000
Token box
Token ID3091
Feature activation+2.55e+00

Pos logit contributions
gan+0.075
 eyel+0.075
sing+0.074
 Moe+0.071
 spine+0.070
Neg logit contributions
Offic-0.108
ercise-0.090
 whip-0.088
 permitting-0.087
corn-0.087
Token dizz
Token ID29527
Feature activation+4.72e-01

Pos logit contributions
 eyel+0.068
gan+0.065
sing+0.064
 spine+0.064
 Moe+0.063
Neg logit contributions
Offic-0.083
ercise-0.066
 permitting-0.065
corn-0.065
Mic-0.065
Tokeny
Token ID88
Feature activation+1.53e+00

Pos logit contributions
 eyel+0.015
 Flor+0.014
sed+0.014
 Moe+0.014
 thirst+0.014
Neg logit contributions
Offic-0.006
"-0.006
Can-0.005
Mic-0.005
 permitting-0.005
Token when
Token ID618
Feature activation+3.73e+00

Pos logit contributions
 eyel+0.030
gan+0.029
sing+0.028
 spine+0.028
 Moe+0.028
Neg logit contributions
Offic-0.046
ercise-0.038
 permitting-0.037
corn-0.037
Mic-0.037
Token you
Token ID345
Feature activation+4.20e+00

Pos logit contributions
gan+0.116
 eyel+0.113
sing+0.112
 spine+0.112
adow+0.108
Neg logit contributions
Offic-0.065
 permitting-0.053
"-0.051
 allowing-0.050
corn-0.048
Tokenâ
Token ID22940
Feature activation+1.51e+00

Pos logit contributions
 eyel+0.106
gan+0.102
 spine+0.101
sing+0.100
 Moe+0.098
Neg logit contributions
Offic-0.096
ercise-0.081
 whip-0.080
 permitting-0.076
"-0.075
Token
Token ID26391
Feature activation+9.55e+00

Pos logit contributions
 eyel+0.028
sing+0.027
gan+0.027
 spine+0.026
 Moe+0.026
Neg logit contributions
Offic-0.047
ercise-0.042
corn-0.039
 whip-0.038
 permitting-0.038
Token
Token ID8151
Feature activation+3.12e+00

Pos logit contributions
 eyel+0.229
gan+0.219
sing+0.216
 spine+0.216
 Moe+0.213
Neg logit contributions
Offic-0.249
ercise-0.197
 permitting-0.193
corn-0.193
Mic-0.191
Tokenre
Token ID260
Feature activation+4.48e+00

Pos logit contributions
 eyel+0.084
sing+0.083
 spine+0.081
sed+0.079
gan+0.078
Neg logit contributions
Offic-0.070
"-0.054
 permitting-0.054
ercise-0.053
Mic-0.053
Token having
Token ID1719
Feature activation+7.25e-01

Pos logit contributions
 eyel+0.096
gan+0.094
 Moe+0.093
sing+0.092
 spine+0.091
Neg logit contributions
Offic-0.111
 permitting-0.102
 allowing-0.101
ercise-0.099
"-0.099
Token fun
Token ID1257
Feature activation+1.61e+00

Pos logit contributions
gan+0.018
 spine+0.018
sing+0.018
 eyel+0.018
 Moe+0.017
Neg logit contributions
Offic-0.017
 permitting-0.014
ercise-0.014
 whip-0.013
 allowing-0.013
Token.
Token ID13
Feature activation+1.88e+00

Pos logit contributions
 eyel+0.030
gan+0.029
sing+0.029
 spine+0.028
 Moe+0.028
Neg logit contributions
Offic-0.049
ercise-0.041
 permitting-0.040
corn-0.040
Mic-0.040
Token the
Token ID262
Feature activation+1.59e+00

Pos logit contributions
 eyel+0.018
gan+0.018
sing+0.017
 spine+0.017
 Moe+0.017
Neg logit contributions
Offic-0.016
ercise-0.013
 permitting-0.012
corn-0.012
Mic-0.012
Token car
Token ID1097
Feature activation-6.68e-01

Pos logit contributions
 eyel+0.036
gan+0.035
sing+0.034
 spine+0.034
 Moe+0.033
Neg logit contributions
Offic-0.043
ercise-0.034
 permitting-0.034
 whip-0.034
corn-0.033
Token until
Token ID1566
Feature activation+3.74e+00

Pos logit contributions
Offic+0.018
ercise+0.014
 permitting+0.014
Mic+0.014
corn+0.014
Neg logit contributions
 eyel-0.015
gan-0.015
sing-0.014
 spine-0.014
 Moe-0.014
Token you
Token ID345
Feature activation+4.21e+00

Pos logit contributions
 eyel+0.099
gan+0.095
 spine+0.093
sing+0.093
 Moe+0.093
Neg logit contributions
Offic-0.088
ercise-0.068
corn-0.066
 permitting-0.066
Mic-0.066
Tokenâ
Token ID22940
Feature activation+1.96e+00

Pos logit contributions
 eyel+0.096
 spine+0.091
sing+0.091
gan+0.090
 Moe+0.087
Neg logit contributions
Offic-0.112
ercise-0.093
 permitting-0.090
corn-0.089
 whip-0.088
Token
Token ID26391
Feature activation+9.41e+00

Pos logit contributions
 eyel+0.029
gan+0.028
sing+0.027
 spine+0.027
 Moe+0.026
Neg logit contributions
Offic-0.068
ercise-0.058
corn-0.057
 permitting-0.057
Mic-0.056
Token
Token ID8151
Feature activation+3.16e+00

Pos logit contributions
 eyel+0.226
gan+0.218
 spine+0.213
sing+0.212
 Moe+0.211
Neg logit contributions
Offic-0.243
ercise-0.192
Mic-0.189
 permitting-0.188
corn-0.185
Tokenre
Token ID260
Feature activation+4.27e+00

Pos logit contributions
 eyel+0.074
sing+0.072
 spine+0.071
sed+0.067
gan+0.066
Neg logit contributions
Offic-0.083
"-0.067
corn-0.066
 permitting-0.066
ercise-0.065
Token big
Token ID1263
Feature activation-1.89e+00

Pos logit contributions
 eyel+0.094
 spine+0.090
sing+0.088
gan+0.087
 Moe+0.087
Neg logit contributions
Offic-0.112
 permitting-0.098
 allowing-0.094
ercise-0.093
"-0.091
Token enough
Token ID1576
Feature activation+9.46e-01

Pos logit contributions
Offic+0.058
Mic+0.047
ercise+0.046
corn+0.045
 permitting+0.045
Neg logit contributions
 eyel-0.037
gan-0.036
 spine-0.035
 Moe-0.035
sing-0.034
Token to
Token ID284
Feature activation+1.22e+00

Pos logit contributions
 eyel+0.021
gan+0.020
 spine+0.020
sing+0.019
 Moe+0.019
Neg logit contributions
Offic-0.027
Mic-0.021
ercise-0.021
corn-0.021
 permitting-0.021
Token to
Token ID284
Feature activation+3.63e-01

Pos logit contributions
Offic+0.041
ercise+0.033
 permitting+0.033
corn+0.032
Mic+0.032
Neg logit contributions
 eyel-0.040
gan-0.039
 spine-0.039
sing-0.038
oppy-0.037
Token sit
Token ID1650
Feature activation+7.69e-01

Pos logit contributions
 eyel+0.009
gan+0.008
sing+0.008
 Moe+0.008
 spine+0.008
Neg logit contributions
Offic-0.010
ercise-0.008
corn-0.008
 permitting-0.007
Mic-0.007
Token in
Token ID287
Feature activation+3.17e+00

Pos logit contributions
 eyel+0.017
gan+0.016
 spine+0.016
sing+0.016
 Moe+0.016
Neg logit contributions
Offic-0.021
ercise-0.017
corn-0.017
 permitting-0.017
Mic-0.017
Token Emma
Token ID18966
Feature activation+4.11e+00

Pos logit contributions
 eyel+0.061
 Moe+0.058
sing+0.056
gan+0.056
 Flor+0.055
Neg logit contributions
Offic-0.097
ercise-0.083
Mic-0.079
corn-0.078
 permitting-0.077
Tokenâ
Token ID22940
Feature activation+1.29e+00

Pos logit contributions
 eyel+0.106
gan+0.103
 Moe+0.101
sing+0.100
 spine+0.100
Neg logit contributions
Offic-0.096
ercise-0.076
corn-0.075
 permitting-0.073
Mic-0.073
Token
Token ID26391
Feature activation+9.36e+00

Pos logit contributions
 eyel+0.025
gan+0.024
 spine+0.024
sing+0.024
 Moe+0.023
Neg logit contributions
Offic-0.039
ercise-0.032
 permitting-0.031
corn-0.031
Mic-0.031
Token
Token ID8151
Feature activation+2.83e+00

Pos logit contributions
 eyel+0.222
gan+0.213
sing+0.210
 spine+0.209
 Moe+0.208
Neg logit contributions
Offic-0.244
ercise-0.193
corn-0.191
 permitting-0.190
Mic-0.188
Tokens
Token ID82
Feature activation+4.28e+00

Pos logit contributions
 eyel+0.063
sing+0.059
gan+0.058
 spine+0.058
 Moe+0.057
Neg logit contributions
Offic-0.078
ercise-0.063
 permitting-0.063
corn-0.062
Mic-0.062
Token nice
Token ID3621
Feature activation-5.07e-01

Pos logit contributions
gan+0.106
ola+0.105
 Moe+0.105
oppy+0.104
agg+0.104
Neg logit contributions
Offic-0.088
 whip-0.083
 saddle-0.075
 newspapers-0.074
 strings-0.071
Token cell
Token ID2685
Feature activation+2.32e+00

Pos logit contributions
Offic+0.013
 whip+0.011
ercise+0.010
corn+0.010
 newspapers+0.010
Neg logit contributions
gan-0.011
sing-0.011
 eyel-0.011
 Moe-0.011
oppy-0.011
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
 eyel+0.050
gan+0.048
sing+0.047
 spine+0.047
 Moe+0.047
Neg logit contributions
Offic-0.065
ercise-0.053
 permitting-0.052
corn-0.052
Mic-0.051
Token sitting
Token ID5586
Feature activation+1.35e+00

Pos logit contributions
 eyel+0.014
gan+0.013
sing+0.013
 spine+0.013
 Moe+0.013
Neg logit contributions
Offic-0.017
 permitting-0.014
ercise-0.014
corn-0.014
 allowing-0.014
Token in
Token ID287
Feature activation+2.68e+00

Pos logit contributions
 eyel+0.036
sing+0.035
 spine+0.035
gan+0.034
 Moe+0.034
Neg logit contributions
Offic-0.031
ercise-0.024
 permitting-0.024
corn-0.024
Can-0.023
Token the
Token ID262
Feature activation+3.66e+00

Pos logit contributions
 eyel+0.054
 Moe+0.050
gan+0.048
 Flor+0.048
 Mil+0.048
Neg logit contributions
Offic-0.080
ercise-0.070
Mic-0.067
corn-0.067
 permitting-0.064
Token neighbor
Token ID4780
Feature activation+1.25e+00

Pos logit contributions
gan+0.093
oppy+0.092
sing+0.092
 Moe+0.089
 eyel+0.088
Neg logit contributions
Offic-0.090
Can-0.072
Thank-0.069
 whip-0.068
ercise-0.068
Tokenâ
Token ID22940
Feature activation+1.32e+00

Pos logit contributions
 eyel+0.031
gan+0.030
sing+0.030
 spine+0.030
 Moe+0.029
Neg logit contributions
Offic-0.031
ercise-0.024
corn-0.024
 permitting-0.024
Mic-0.024
Token
Token ID26391
Feature activation+9.34e+00

Pos logit contributions
 eyel+0.028
gan+0.027
sing+0.027
 Moe+0.026
 spine+0.026
Neg logit contributions
Offic-0.038
ercise-0.031
 permitting-0.030
corn-0.030
Mic-0.030
Token
Token ID8151
Feature activation+2.46e+00

Pos logit contributions
 eyel+0.222
gan+0.213
sing+0.209
 spine+0.209
 Moe+0.206
Neg logit contributions
Offic-0.245
ercise-0.194
 permitting-0.190
corn-0.190
Mic-0.188
Tokens
Token ID82
Feature activation+3.65e+00

Pos logit contributions
 eyel+0.053
gan+0.050
sing+0.050
 spine+0.050
 Moe+0.049
Neg logit contributions
Offic-0.070
ercise-0.056
 permitting-0.056
corn-0.055
Mic-0.055
Token garden
Token ID11376
Feature activation+2.26e+00

Pos logit contributions
gan+0.077
 eyel+0.074
 Moe+0.074
oppy+0.072
sing+0.070
Neg logit contributions
Offic-0.097
 whip-0.081
 newspapers-0.080
 permitting-0.077
"-0.077
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
 eyel+0.042
gan+0.040
sing+0.040
 spine+0.039
 Moe+0.039
Neg logit contributions
Offic-0.070
ercise-0.058
 permitting-0.057
corn-0.057
Mic-0.056
Token
Token ID220
Feature activation+3.26e+00

Pos logit contributions
 eyel+0.033
gan+0.032
 spine+0.031
sing+0.031
 Moe+0.031
Neg logit contributions
Offic-0.047
ercise-0.039
 permitting-0.038
corn-0.038
Mic-0.038
Token somewhere
Token ID7382
Feature activation+2.89e+00

Pos logit contributions
Offic+0.008
ercise+0.006
 permitting+0.006
corn+0.006
Mic+0.006
Neg logit contributions
 eyel-0.005
gan-0.005
sing-0.005
 spine-0.005
 Moe-0.005
Token safe
Token ID3338
Feature activation+1.47e-01

Pos logit contributions
 eyel+0.067
gan+0.064
sing+0.064
 spine+0.063
 Moe+0.062
Neg logit contributions
Offic-0.075
 permitting-0.062
ercise-0.061
corn-0.060
 allowing-0.060
Token until
Token ID1566
Feature activation+4.18e+00

Pos logit contributions
 eyel+0.003
gan+0.003
 spine+0.003
sing+0.003
 Moe+0.003
Neg logit contributions
Offic-0.004
ercise-0.003
corn-0.003
Mic-0.003
 permitting-0.003
Token it
Token ID340
Feature activation+4.32e+00

Pos logit contributions
 eyel+0.120
gan+0.116
sing+0.115
 spine+0.115
 Moe+0.113
Neg logit contributions
Offic-0.088
ercise-0.066
 permitting-0.065
corn-0.064
Mic-0.062
Tokenâ
Token ID22940
Feature activation+1.87e+00

Pos logit contributions
 spine+0.124
sing+0.121
 Moe+0.118
uzz+0.116
 eyel+0.115
Neg logit contributions
"-0.075
Offic-0.069
oc-0.062
corn-0.061
ercise-0.060
Token
Token ID26391
Feature activation+9.27e+00

Pos logit contributions
 eyel+0.029
sing+0.028
gan+0.027
 spine+0.026
 Moe+0.026
Neg logit contributions
Offic-0.064
ercise-0.055
corn-0.054
 permitting-0.053
Mic-0.053
Token
Token ID8151
Feature activation+3.01e+00

Pos logit contributions
 eyel+0.224
gan+0.215
sing+0.212
 spine+0.211
 Moe+0.209
Neg logit contributions
Offic-0.238
ercise-0.188
corn-0.185
 permitting-0.185
Mic-0.182
Tokens
Token ID82
Feature activation+3.35e+00

Pos logit contributions
 eyel+0.079
sing+0.077
 spine+0.075
sed+0.072
gan+0.071
Neg logit contributions
Offic-0.071
"-0.056
 permitting-0.055
corn-0.055
Mic-0.053
Token gone
Token ID3750
Feature activation+2.31e+00

Pos logit contributions
 Moe+0.078
 Mil+0.075
 Flor+0.072
gan+0.072
 Maur+0.071
Neg logit contributions
Offic-0.070
 permitting-0.070
"-0.068
 allowing-0.067
ercise-0.066
Token.
Token ID13
Feature activation+2.12e+00

Pos logit contributions
 eyel+0.060
sing+0.057
gan+0.057
 Moe+0.057
 spine+0.056
Neg logit contributions
Offic-0.053
ercise-0.042
corn-0.042
 permitting-0.042
 whip-0.040
Tokenâ
Token ID22940
Feature activation-2.06e-01

Pos logit contributions
gan+0.037
 Moe+0.036
 eyel+0.035
 O+0.035
 Flor+0.033
Neg logit contributions
Offic-0.066
ercise-0.059
 whip-0.057
corn-0.056
 permitting-0.054
Token special
Token ID2041
Feature activation-9.60e-01

Pos logit contributions
 eyel+0.015
 spine+0.014
sing+0.014
 Moe+0.014
gan+0.014
Neg logit contributions
Offic-0.021
ercise-0.017
 permitting-0.017
corn-0.016
Sir-0.016
Token place
Token ID1295
Feature activation+3.73e+00

Pos logit contributions
Offic+0.028
ercise+0.022
corn+0.022
 permitting+0.022
Mic+0.022
Neg logit contributions
 eyel-0.020
gan-0.020
sing-0.019
 spine-0.019
 Moe-0.019
Token in
Token ID287
Feature activation+2.83e+00

Pos logit contributions
 eyel+0.062
gan+0.060
 spine+0.058
sing+0.058
 Moe+0.057
Neg logit contributions
Offic-0.125
ercise-0.105
Mic-0.105
corn-0.103
 permitting-0.101
Token Lily
Token ID20037
Feature activation+4.28e+00

Pos logit contributions
 Moe+0.053
 eyel+0.053
 Mil+0.052
 Flor+0.052
gan+0.051
Neg logit contributions
Offic-0.087
ercise-0.077
Mic-0.073
corn-0.072
Sir-0.068
Tokenâ
Token ID22940
Feature activation+1.48e+00

Pos logit contributions
 eyel+0.093
gan+0.087
 spine+0.087
adow+0.086
sing+0.084
Neg logit contributions
Offic-0.126
 permitting-0.100
ercise-0.100
Sir-0.099
corn-0.097
Token
Token ID26391
Feature activation+9.24e+00

Pos logit contributions
 eyel+0.027
gan+0.025
 spine+0.025
sing+0.025
 Moe+0.024
Neg logit contributions
Offic-0.047
ercise-0.039
 permitting-0.039
corn-0.039
Mic-0.039
Token
Token ID8151
Feature activation+3.04e+00

Pos logit contributions
 eyel+0.218
gan+0.209
sing+0.206
 spine+0.206
 Moe+0.203
Neg logit contributions
Offic-0.243
ercise-0.193
 permitting-0.190
corn-0.189
Mic-0.188
Tokens
Token ID82
Feature activation+2.91e+00

Pos logit contributions
 eyel+0.064
gan+0.060
sing+0.060
 spine+0.060
 Moe+0.059
Neg logit contributions
Offic-0.087
ercise-0.070
 permitting-0.070
corn-0.069
Mic-0.069
Token room
Token ID2119
Feature activation+3.78e+00

Pos logit contributions
gan+0.061
 Moe+0.060
 eyel+0.060
sing+0.058
 Mil+0.056
Neg logit contributions
Offic-0.076
"-0.064
 whip-0.063
ercise-0.063
 permitting-0.062
Token.
Token ID13
Feature activation+1.76e+00

Pos logit contributions
 eyel+0.067
gan+0.063
 spine+0.061
sing+0.061
 Moe+0.060
Neg logit contributions
Offic-0.123
ercise-0.103
Mic-0.102
corn-0.100
 permitting-0.100
Token Lily
Token ID20037
Feature activation+1.27e-01

Pos logit contributions
 eyel+0.034
gan+0.034
 Moe+0.033
 Mil+0.031
 Flor+0.031
Neg logit contributions
Offic-0.054
ercise-0.045
corn-0.044
 whip-0.043
Mic-0.043
Token many
Token ID867
Feature activation-1.85e+00

Pos logit contributions
Offic+0.009
ercise+0.007
corn+0.007
 permitting+0.007
Mic+0.007
Neg logit contributions
 eyel-0.005
gan-0.005
 spine-0.005
sing-0.005
 Moe-0.005
Token times
Token ID1661
Feature activation+2.25e+00

Pos logit contributions
Offic+0.055
ercise+0.045
Mic+0.044
corn+0.043
 permitting+0.043
Neg logit contributions
 eyel-0.038
 Moe-0.034
adow-0.034
 spine-0.033
gan-0.033
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
 eyel+0.041
gan+0.039
 spine+0.039
sing+0.038
 Moe+0.038
Neg logit contributions
Offic-0.072
ercise-0.059
Mic-0.059
corn-0.058
 permitting-0.057
Token<|endoftext|>
Token ID50256
Feature activation+3.06e+00

Pos logit contributions
 eyel+0.042
gan+0.041
 Moe+0.040
 spine+0.039
sing+0.039
Neg logit contributions
Offic-0.061
ercise-0.051
corn-0.049
Mic-0.049
 permitting-0.049
Tokenâ
Token ID22940
Feature activation+1.28e+00

Pos logit contributions
sing+0.083
 eyel+0.081
 spine+0.080
gan+0.076
sed+0.075
Neg logit contributions
Offic-0.074
"-0.061
Can-0.060
corn-0.060
Sir-0.059
Token
Token ID26391
Feature activation+9.23e+00

Pos logit contributions
 eyel+0.031
sing+0.029
gan+0.029
sed+0.029
oppy+0.028
Neg logit contributions
Offic-0.034
ercise-0.027
Thank-0.026
Sir-0.026
corn-0.026
Token
Token ID129
Feature activation+1.83e+00

Pos logit contributions
 eyel+0.185
gan+0.183
adow+0.176
sing+0.175
 spine+0.175
Neg logit contributions
Offic-0.268
ercise-0.225
 permitting-0.219
Mic-0.214
 whip-0.213
Token
Token ID241
Feature activation+1.44e-01

Pos logit contributions
 eyel+0.043
 Moe+0.042
gan+0.042
 spine+0.041
sing+0.040
Neg logit contributions
Offic-0.046
Mic-0.037
 permitting-0.036
ercise-0.036
corn-0.036
TokenCome
Token ID16773
Feature activation+2.54e+00

Pos logit contributions
 eyel+0.004
gan+0.004
sing+0.004
 spine+0.004
 Moe+0.004
Neg logit contributions
Offic-0.003
ercise-0.002
corn-0.002
 permitting-0.002
Mic-0.002
Token on
Token ID319
Feature activation+1.92e+00

Pos logit contributions
 Moe+0.046
 O+0.045
 toilet+0.043
 eyel+0.043
ow+0.043
Neg logit contributions
Offic-0.077
Mic-0.064
Sir-0.062
ercise-0.061
corn-0.060
Token,
Token ID11
Feature activation+1.83e+00

Pos logit contributions
 eyel+0.039
sing+0.037
 spine+0.037
gan+0.036
adow+0.034
Neg logit contributions
Offic-0.056
corn-0.047
ercise-0.047
"-0.046
 whip-0.046
Token hanging
Token ID10938
Feature activation+1.36e+00

Pos logit contributions
 eyel+0.038
gan+0.037
 Moe+0.036
sing+0.036
 spine+0.036
Neg logit contributions
Offic-0.052
Mic-0.042
ercise-0.041
corn-0.040
Sir-0.040
Token in
Token ID287
Feature activation+3.34e+00

Pos logit contributions
 eyel+0.029
gan+0.027
sing+0.027
 Flor+0.026
tub+0.026
Neg logit contributions
Offic-0.040
Mic-0.033
ercise-0.032
corn-0.032
 permitting-0.031
Token their
Token ID511
Feature activation+5.74e+00

Pos logit contributions
 eyel+0.064
sing+0.059
 Flor+0.058
 Moe+0.058
 spine+0.057
Neg logit contributions
Offic-0.104
ercise-0.090
Mic-0.087
corn-0.084
Sir-0.082
Token daughter
Token ID4957
Feature activation+4.08e+00

Pos logit contributions
 eyel+0.133
gan+0.133
 Moe+0.131
 spine+0.127
sing+0.127
Neg logit contributions
Offic-0.148
ercise-0.119
corn-0.114
 permitting-0.113
 whip-0.113
Tokenâ
Token ID22940
Feature activation+1.38e+00

Pos logit contributions
 eyel+0.092
 spine+0.086
sing+0.086
gan+0.085
adow+0.083
Neg logit contributions
Offic-0.114
ercise-0.091
Mic-0.090
 permitting-0.090
corn-0.089
Token
Token ID26391
Feature activation+9.22e+00

Pos logit contributions
 eyel+0.030
 spine+0.028
gan+0.028
sing+0.028
 Moe+0.027
Neg logit contributions
Offic-0.039
 permitting-0.031
Mic-0.031
ercise-0.031
corn-0.031
Token
Token ID8151
Feature activation+3.08e+00

Pos logit contributions
 eyel+0.205
adow+0.196
 spine+0.192
sing+0.191
gan+0.189
Neg logit contributions
Offic-0.256
ercise-0.210
Mic-0.208
 whip-0.204
Sir-0.204
Tokens
Token ID82
Feature activation+3.23e+00

Pos logit contributions
 eyel+0.067
sing+0.064
 spine+0.063
gan+0.063
 Moe+0.061
Neg logit contributions
Offic-0.086
ercise-0.069
 permitting-0.069
corn-0.068
Mic-0.067
Token room
Token ID2119
Feature activation+4.03e+00

Pos logit contributions
 eyel+0.064
gan+0.061
 Moe+0.060
sing+0.060
 spine+0.059
Neg logit contributions
Offic-0.096
ercise-0.079
 permitting-0.078
corn-0.078
Mic-0.077
Token.
Token ID13
Feature activation+1.77e+00

Pos logit contributions
 eyel+0.078
gan+0.074
sing+0.072
 spine+0.072
 Moe+0.071
Neg logit contributions
Offic-0.123
ercise-0.101
 permitting-0.100
corn-0.099
Mic-0.098
Token<|endoftext|>
Token ID50256
Feature activation+3.11e+00

Pos logit contributions
 eyel+0.037
gan+0.036
 Moe+0.035
 spine+0.034
 Flor+0.033
Neg logit contributions
Offic-0.052
ercise-0.043
corn-0.041
Mic-0.041
 whip-0.041
Token As
Token ID1081
Feature activation+2.12e-01

Pos logit contributions
gan+0.029
 eyel+0.028
 Moe+0.028
 Mil+0.027
 Flor+0.026
Neg logit contributions
Offic-0.042
ercise-0.035
Mic-0.034
corn-0.034
 whip-0.034
Token long
Token ID890
Feature activation+1.25e+00

Pos logit contributions
 eyel+0.005
gan+0.005
 Moe+0.005
 spine+0.005
sing+0.005
Neg logit contributions
Offic-0.005
ercise-0.004
Mic-0.004
corn-0.004
 permitting-0.004
Token as
Token ID355
Feature activation+1.28e+00

Pos logit contributions
 eyel+0.027
gan+0.025
 spine+0.025
 Moe+0.025
sing+0.025
Neg logit contributions
Offic-0.036
ercise-0.029
Mic-0.028
 permitting-0.028
corn-0.028
Token we
Token ID356
Feature activation+1.17e+00

Pos logit contributions
 eyel+0.032
 spine+0.031
sing+0.031
gan+0.030
 Moe+0.029
Neg logit contributions
Offic-0.032
 permitting-0.025
ercise-0.025
corn-0.025
Sir-0.024
Tokenâ
Token ID22940
Feature activation+1.99e+00

Pos logit contributions
 eyel+0.024
gan+0.024
adow+0.024
 spine+0.024
sing+0.024
Neg logit contributions
Offic-0.034
Mic-0.027
corn-0.027
 permitting-0.026
ercise-0.026
Token
Token ID26391
Feature activation+9.20e+00

Pos logit contributions
 eyel+0.040
gan+0.038
sing+0.038
 spine+0.037
 Moe+0.037
Neg logit contributions
Offic-0.059
ercise-0.049
corn-0.048
 permitting-0.048
Mic-0.047
Token
Token ID8151
Feature activation+2.42e+00

Pos logit contributions
 eyel+0.220
gan+0.214
 spine+0.208
adow+0.206
 Moe+0.205
Neg logit contributions
Offic-0.237
ercise-0.188
Mic-0.186
 permitting-0.182
Sir-0.181
Tokenre
Token ID260
Feature activation+4.32e+00

Pos logit contributions
 eyel+0.070
sing+0.070
 spine+0.067
sed+0.065
ses+0.064
Neg logit contributions
Offic-0.050
"-0.041
 permitting-0.038
corn-0.037
ercise-0.037
Token together
Token ID1978
Feature activation+1.65e+00

Pos logit contributions
 eyel+0.092
gan+0.088
sing+0.086
 spine+0.086
 Moe+0.085
Neg logit contributions
Offic-0.124
ercise-0.100
 permitting-0.098
corn-0.098
Mic-0.098
Token,
Token ID11
Feature activation-1.66e-01

Pos logit contributions
 eyel+0.027
gan+0.025
 Moe+0.024
 spine+0.024
sing+0.024
Neg logit contributions
Offic-0.056
ercise-0.047
Mic-0.047
corn-0.046
Sir-0.045
Token we
Token ID356
Feature activation-1.42e+00

Pos logit contributions
Offic+0.004
Mic+0.003
ercise+0.003
corn+0.003
Sir+0.003
Neg logit contributions
 eyel-0.004
gan-0.004
 Moe-0.004
 Mil-0.004
 spine-0.004
Token face
Token ID1986
Feature activation+2.75e-01

Pos logit contributions
Offic+0.027
ercise+0.022
 permitting+0.022
 whip+0.021
 allowing+0.021
Neg logit contributions
 eyel-0.026
gan-0.025
 Moe-0.025
sing-0.024
 spine-0.024
Token danger
Token ID3514
Feature activation-1.40e+00

Pos logit contributions
 eyel+0.006
gan+0.006
sing+0.006
 spine+0.006
 Moe+0.006
Neg logit contributions
Offic-0.007
ercise-0.006
 permitting-0.006
corn-0.006
Mic-0.006
Token when
Token ID618
Feature activation+2.89e+00

Pos logit contributions
Offic+0.039
ercise+0.032
 permitting+0.031
corn+0.031
Mic+0.031
Neg logit contributions
 eyel-0.030
gan-0.029
 spine-0.028
sing-0.028
 Moe-0.028
Token you
Token ID345
Feature activation+3.98e+00

Pos logit contributions
 eyel+0.081
gan+0.080
 spine+0.079
sing+0.079
 Moe+0.076
Neg logit contributions
Offic-0.060
 permitting-0.047
ercise-0.046
corn-0.046
 allowing-0.044
Tokenâ
Token ID22940
Feature activation+1.33e+00

Pos logit contributions
 eyel+0.093
gan+0.089
 spine+0.088
sing+0.088
 Moe+0.087
Neg logit contributions
Offic-0.102
ercise-0.084
 permitting-0.081
 whip-0.080
corn-0.079
Token
Token ID26391
Feature activation+9.17e+00

Pos logit contributions
 eyel+0.029
sing+0.027
gan+0.027
 spine+0.027
 Moe+0.026
Neg logit contributions
Offic-0.037
ercise-0.033
corn-0.030
 permitting-0.030
 whip-0.030
Token
Token ID8151
Feature activation+2.90e+00

Pos logit contributions
 eyel+0.238
gan+0.228
sing+0.225
 spine+0.224
 Moe+0.224
Neg logit contributions
Offic-0.220
ercise-0.171
corn-0.168
 permitting-0.167
Mic-0.164
Tokenre
Token ID260
Feature activation+3.97e+00

Pos logit contributions
 eyel+0.074
sing+0.072
 spine+0.070
sed+0.067
gan+0.067
Neg logit contributions
Offic-0.070
 permitting-0.055
"-0.055
corn-0.055
ercise-0.054
Token on
Token ID319
Feature activation+1.18e+00

Pos logit contributions
 eyel+0.085
gan+0.081
 Moe+0.080
 spine+0.079
sing+0.079
Neg logit contributions
Offic-0.109
 permitting-0.092
ercise-0.092
 allowing-0.089
 whip-0.088
Token your
Token ID534
Feature activation+3.70e+00

Pos logit contributions
 eyel+0.026
gan+0.025
 spine+0.025
sing+0.025
 Moe+0.024
Neg logit contributions
Offic-0.033
ercise-0.027
Mic-0.026
corn-0.026
 permitting-0.026
Token own
Token ID898
Feature activation+3.29e+00

Pos logit contributions
gan+0.090
 Moe+0.086
 Mil+0.083
ola+0.083
ulu+0.083
Neg logit contributions
Offic-0.087
 whip-0.076
ercise-0.075
 permitting-0.072
 allowing-0.070
Token the
Token ID262
Feature activation+2.09e+00

Pos logit contributions
Offic+0.011
ercise+0.008
 permitting+0.008
corn+0.008
 whip+0.008
Neg logit contributions
 eyel-0.012
gan-0.011
 spine-0.011
sing-0.011
 Moe-0.011
Token swoop
Token ID38527
Feature activation-8.81e-01

Pos logit contributions
 eyel+0.053
sing+0.051
gan+0.051
 spine+0.050
 Moe+0.049
Neg logit contributions
Offic-0.052
ercise-0.041
corn-0.040
Mic-0.040
 permitting-0.039
Tokening
Token ID278
Feature activation-4.08e-01

Pos logit contributions
Offic+0.025
ercise+0.021
corn+0.020
 permitting+0.020
Mic+0.020
Neg logit contributions
 eyel-0.019
gan-0.018
 spine-0.017
sing-0.017
 Moe-0.017
Token birds
Token ID10087
Feature activation+7.49e-01

Pos logit contributions
Offic+0.011
ercise+0.009
 permitting+0.009
corn+0.009
Mic+0.009
Neg logit contributions
 eyel-0.009
gan-0.009
 spine-0.008
sing-0.008
 Moe-0.008
Tokenâ
Token ID22940
Feature activation+1.43e+00

Pos logit contributions
 eyel+0.014
sing+0.013
 spine+0.013
gan+0.013
adow+0.012
Neg logit contributions
Offic-0.024
ercise-0.020
Mic-0.020
corn-0.019
 permitting-0.019
Token
Token ID26391
Feature activation+9.17e+00

Pos logit contributions
 eyel+0.028
 Moe+0.027
gan+0.027
sing+0.027
oppy+0.026
Neg logit contributions
Offic-0.042
ercise-0.036
corn-0.034
 permitting-0.034
Mic-0.033
Token
Token ID8151
Feature activation+2.22e+00

Pos logit contributions
 eyel+0.202
gan+0.194
sing+0.191
adow+0.191
 spine+0.190
Neg logit contributions
Offic-0.256
ercise-0.209
 permitting-0.207
Mic-0.205
Sir-0.200
Token unusual
Token ID8468
Feature activation-4.88e-01

Pos logit contributions
gan+0.043
 eyel+0.042
 Moe+0.040
 spine+0.039
sing+0.038
Neg logit contributions
Offic-0.069
ercise-0.058
 permitting-0.055
Mic-0.055
corn-0.055
Token colour
Token ID9568
Feature activation+1.09e+00

Pos logit contributions
Offic+0.013
ercise+0.011
 whip+0.011
corn+0.011
 permitting+0.011
Neg logit contributions
gan-0.010
 eyel-0.010
 Moe-0.010
 spine-0.010
sing-0.010
Token.
Token ID13
Feature activation+1.90e+00

Pos logit contributions
 eyel+0.022
gan+0.021
sing+0.021
 spine+0.021
 Moe+0.020
Neg logit contributions
Offic-0.032
ercise-0.026
corn-0.026
 permitting-0.026
 whip-0.025
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
 eyel+0.037
gan+0.036
 Moe+0.035
 spine+0.034
sing+0.034
Neg logit contributions
Offic-0.058
ercise-0.048
 permitting-0.047
corn-0.047
Mic-0.046
Token deer
Token ID20096
Feature activation-1.36e-01

Pos logit contributions
 eyel+0.039
 Moe+0.038
 Flor+0.038
 swamp+0.038
 spine+0.037
Neg logit contributions
Offic-0.049
ercise-0.040
corn-0.039
 permitting-0.038
Together-0.037
Token again
Token ID757
Feature activation+2.78e+00

Pos logit contributions
Offic+0.004
ercise+0.003
 permitting+0.003
corn+0.003
Mic+0.003
Neg logit contributions
 eyel-0.003
gan-0.003
 spine-0.003
sing-0.003
 Moe-0.003
Token.
Token ID13
Feature activation+2.30e+00

Pos logit contributions
 eyel+0.060
gan+0.059
 spine+0.059
adow+0.059
sing+0.058
Neg logit contributions
Offic-0.077
Mic-0.063
ercise-0.062
corn-0.061
 whip-0.059
Token<|endoftext|>
Token ID50256
Feature activation+3.11e+00

Pos logit contributions
 eyel+0.053
gan+0.051
 spine+0.050
sing+0.050
 Moe+0.049
Neg logit contributions
Offic-0.062
ercise-0.050
 permitting-0.049
corn-0.049
Mic-0.048
Tokenâ
Token ID22940
Feature activation+1.16e+00

Pos logit contributions
sing+0.090
 eyel+0.087
 spine+0.087
gan+0.082
sed+0.081
Neg logit contributions
Offic-0.070
"-0.057
Can-0.056
corn-0.056
Sir-0.054
Token
Token ID26391
Feature activation+9.16e+00

Pos logit contributions
 eyel+0.028
gan+0.026
sing+0.026
 Moe+0.026
sed+0.026
Neg logit contributions
Offic-0.031
ercise-0.025
corn-0.024
Mic-0.024
Thank-0.024
Token
Token ID129
Feature activation+1.65e+00

Pos logit contributions
 eyel+0.189
gan+0.187
 spine+0.180
sing+0.180
 Moe+0.178
Neg logit contributions
Offic-0.262
ercise-0.218
 permitting-0.213
Mic-0.208
 whip-0.206
Token
Token ID241
Feature activation+1.20e-01

Pos logit contributions
 Moe+0.039
 eyel+0.039
gan+0.038
 spine+0.037
sing+0.037
Neg logit contributions
Offic-0.041
 permitting-0.032
Mic-0.032
corn-0.031
ercise-0.031
TokenCan
Token ID6090
Feature activation-7.98e-01

Pos logit contributions
 eyel+0.004
gan+0.004
sing+0.003
 spine+0.003
 Moe+0.003
Neg logit contributions
Offic-0.002
ercise-0.002
corn-0.002
 permitting-0.002
Mic-0.002
Token I
Token ID314
Feature activation+2.33e-01

Pos logit contributions
Offic+0.017
ercise+0.013
 permitting+0.012
corn+0.012
Can+0.012
Neg logit contributions
 eyel-0.023
 spine-0.023
sing-0.023
gan-0.022
 Moe-0.022
Token play
Token ID711
Feature activation-2.11e-01

Pos logit contributions
 eyel+0.005
gan+0.005
adow+0.005
oppy+0.005
 Moe+0.005
Neg logit contributions
Offic-0.007
corn-0.005
Mic-0.005
ercise-0.005
Sir-0.005
Token give
Token ID1577
Feature activation-1.47e+00

Pos logit contributions
Offic+0.004
ercise+0.003
 permitting+0.003
 whip+0.003
 allowing+0.003
Neg logit contributions
 eyel-0.004
gan-0.004
sing-0.004
 Moe-0.004
 spine-0.004
Token up
Token ID510
Feature activation+1.72e+00

Pos logit contributions
Offic+0.040
ercise+0.035
corn+0.033
 permitting+0.033
 whip+0.033
Neg logit contributions
 eyel-0.031
sing-0.030
 spine-0.030
gan-0.029
 Moe-0.028
Token when
Token ID618
Feature activation+3.08e+00

Pos logit contributions
 eyel+0.036
gan+0.034
 spine+0.033
sing+0.033
 Moe+0.033
Neg logit contributions
Offic-0.050
ercise-0.041
corn-0.040
 permitting-0.040
Mic-0.040
Token you
Token ID345
Feature activation+3.84e+00

Pos logit contributions
 eyel+0.090
gan+0.089
sing+0.088
 spine+0.088
 Moe+0.086
Neg logit contributions
Offic-0.061
 permitting-0.046
ercise-0.046
corn-0.045
 allowing-0.044
Tokenâ
Token ID22940
Feature activation+1.19e+00

Pos logit contributions
 eyel+0.091
gan+0.087
sing+0.086
 spine+0.085
 Moe+0.084
Neg logit contributions
Offic-0.100
ercise-0.081
 permitting-0.079
corn-0.078
Mic-0.077
Token
Token ID26391
Feature activation+9.16e+00

Pos logit contributions
 eyel+0.026
gan+0.025
sing+0.025
 Moe+0.024
 spine+0.024
Neg logit contributions
Offic-0.033
ercise-0.028
corn-0.026
 permitting-0.026
 whip-0.026
Token
Token ID8151
Feature activation+3.00e+00

Pos logit contributions
 eyel+0.231
gan+0.222
 spine+0.219
sing+0.218
 Moe+0.215
Neg logit contributions
Offic-0.227
ercise-0.177
 permitting-0.174
corn-0.173
Mic-0.172
Tokenre
Token ID260
Feature activation+3.73e+00

Pos logit contributions
 eyel+0.077
sing+0.075
 spine+0.073
gan+0.072
sed+0.070
Neg logit contributions
Offic-0.072
 permitting-0.056
ercise-0.056
corn-0.056
Mic-0.055
Token trying
Token ID2111
Feature activation+2.08e+00

Pos logit contributions
 eyel+0.082
gan+0.078
sing+0.077
 spine+0.076
 Moe+0.076
Neg logit contributions
Offic-0.102
 permitting-0.086
ercise-0.085
 allowing-0.083
corn-0.082
Token to
Token ID284
Feature activation+1.36e-01

Pos logit contributions
 eyel+0.046
gan+0.044
sing+0.043
 spine+0.043
 Moe+0.042
Neg logit contributions
Offic-0.058
ercise-0.047
 permitting-0.046
corn-0.046
Mic-0.045
Token achieve
Token ID4620
Feature activation+2.32e-01

Pos logit contributions
 eyel+0.003
 spine+0.002
sing+0.002
 Moe+0.002
gan+0.002
Neg logit contributions
Offic-0.004
corn-0.003
ercise-0.003
Mic-0.003
Sir-0.003
Token were
Token ID547
Feature activation+1.86e+00

Pos logit contributions
Offic+0.040
ercise+0.034
Can+0.033
Thank+0.033
"+0.033
Neg logit contributions
 spine-0.044
 eyel-0.043
sing-0.041
gan-0.040
 Moe-0.040
Token playing
Token ID2712
Feature activation+3.83e-01

Pos logit contributions
 spine+0.040
 eyel+0.038
gan+0.035
sing+0.035
 Moe+0.035
Neg logit contributions
Offic-0.052
 permitting-0.045
 whip-0.043
 allowing-0.043
ercise-0.042
Token in
Token ID287
Feature activation+3.24e+00

Pos logit contributions
 eyel+0.009
gan+0.008
sing+0.008
 spine+0.008
 Moe+0.008
Neg logit contributions
Offic-0.010
ercise-0.008
 permitting-0.008
corn-0.008
Mic-0.008
Token Molly
Token ID30236
Feature activation+5.01e+00

Pos logit contributions
 eyel+0.079
gan+0.075
sing+0.074
 spine+0.074
 Moe+0.074
Neg logit contributions
Offic-0.083
ercise-0.066
 permitting-0.065
corn-0.065
Mic-0.064
Tokenâ
Token ID22940
Feature activation+1.03e+00

Pos logit contributions
 eyel+0.121
gan+0.116
sing+0.114
 spine+0.114
 Moe+0.112
Neg logit contributions
Offic-0.130
ercise-0.103
 permitting-0.101
corn-0.100
Mic-0.099
Token
Token ID26391
Feature activation+9.13e+00

Pos logit contributions
 eyel+0.022
gan+0.021
 Moe+0.021
sing+0.021
sed+0.021
Neg logit contributions
Offic-0.029
ercise-0.024
corn-0.024
 permitting-0.023
Mic-0.023
Token
Token ID8151
Feature activation+2.38e+00

Pos logit contributions
 eyel+0.201
gan+0.196
 spine+0.193
 Moe+0.193
sing+0.193
Neg logit contributions
Offic-0.245
corn-0.206
"-0.196
 permitting-0.193
Mic-0.192
Tokens
Token ID82
Feature activation+3.63e+00

Pos logit contributions
 eyel+0.061
sing+0.057
 spine+0.057
 Moe+0.056
gan+0.056
Neg logit contributions
Offic-0.056
 permitting-0.044
Mic-0.043
"-0.042
 allowing-0.042
Token garden
Token ID11376
Feature activation+2.58e+00

Pos logit contributions
agg+0.100
gan+0.097
ur+0.094
utt+0.094
owed+0.093
Neg logit contributions
 whip-0.065
Offic-0.059
 yarn-0.058
 string-0.057
 strings-0.057
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
 eyel+0.053
gan+0.050
 spine+0.050
sing+0.049
 Moe+0.049
Neg logit contributions
Offic-0.075
ercise-0.061
 permitting-0.061
corn-0.061
Mic-0.059
Token Suddenly
Token ID24975
Feature activation+7.50e-01

Pos logit contributions
 eyel+0.033
 spine+0.033
sing+0.032
gan+0.031
adow+0.030
Neg logit contributions
Offic-0.042
Can-0.035
corn-0.035
"-0.034
 permitting-0.034
Token can
Token ID460
Feature activation+1.74e+00

Pos logit contributions
 eyel+0.084
gan+0.079
sing+0.079
 spine+0.078
 Moe+0.076
Neg logit contributions
Offic-0.136
ercise-0.113
corn-0.111
 permitting-0.111
Mic-0.109
Token feel
Token ID1254
Feature activation-5.97e-01

Pos logit contributions
 eyel+0.040
gan+0.038
sing+0.037
 spine+0.037
 Moe+0.037
Neg logit contributions
Offic-0.047
ercise-0.038
 permitting-0.037
corn-0.037
Mic-0.037
Token when
Token ID618
Feature activation+2.89e+00

Pos logit contributions
Offic+0.017
ercise+0.014
 permitting+0.014
corn+0.014
Mic+0.014
Neg logit contributions
 eyel-0.012
gan-0.012
sing-0.012
 spine-0.012
 Moe-0.011
Token it
Token ID340
Feature activation+4.38e+00

Pos logit contributions
 eyel+0.070
sing+0.070
gan+0.070
 spine+0.070
 Moe+0.067
Neg logit contributions
Offic-0.070
 permitting-0.056
ercise-0.055
corn-0.055
 allowing-0.054
Tokenâ
Token ID22940
Feature activation+1.31e+00

Pos logit contributions
 spine+0.095
sing+0.094
gan+0.093
 eyel+0.092
 Moe+0.090
Neg logit contributions
Offic-0.114
corn-0.097
ercise-0.097
"-0.096
 permitting-0.093
Token
Token ID26391
Feature activation+9.13e+00

Pos logit contributions
 Moe+0.029
oppy+0.029
gan+0.029
 eyel+0.028
 Mil+0.028
Neg logit contributions
Offic-0.033
ercise-0.030
Thank-0.027
corn-0.027
Can-0.026
Token
Token ID8151
Feature activation+2.89e+00

Pos logit contributions
 eyel+0.211
gan+0.203
sing+0.200
 Moe+0.199
 spine+0.198
Neg logit contributions
Offic-0.243
ercise-0.194
corn-0.192
 permitting-0.190
Mic-0.187
Tokens
Token ID82
Feature activation+3.08e+00

Pos logit contributions
 eyel+0.080
sing+0.079
 spine+0.077
sed+0.074
gan+0.074
Neg logit contributions
Offic-0.062
"-0.048
 permitting-0.048
corn-0.047
ercise-0.047
Token filled
Token ID5901
Feature activation-7.12e-01

Pos logit contributions
gan+0.066
 eyel+0.065
 Moe+0.065
oppy+0.063
 spine+0.063
Neg logit contributions
Offic-0.076
 permitting-0.066
ercise-0.066
 allowing-0.065
 whip-0.064
Token with
Token ID351
Feature activation-1.54e-02

Pos logit contributions
Offic+0.019
ercise+0.015
 permitting+0.015
Mic+0.015
corn+0.015
Neg logit contributions
 eyel-0.017
gan-0.016
 Moe-0.016
sing-0.015
 spine-0.015
Token new
Token ID649
Feature activation-2.71e-01

Pos logit contributions
Offic+0.000
 permitting+0.000
ercise+0.000
corn+0.000
 allowing+0.000
Neg logit contributions
 eyel-0.000
gan-0.000
 spine-0.000
sing-0.000
 Moe-0.000
Token they
Token ID484
Feature activation+1.56e+00

Pos logit contributions
sing+0.041
sed+0.039
adow+0.039
 eyel+0.039
 spine+0.039
Neg logit contributions
Offic-0.031
 permitting-0.024
"-0.024
Sir-0.024
Can-0.023
Token reached
Token ID4251
Feature activation+1.76e+00

Pos logit contributions
 eyel+0.042
sing+0.042
 spine+0.041
sed+0.039
tub+0.039
Neg logit contributions
Offic-0.037
Sir-0.028
 permitting-0.027
"-0.027
ercise-0.027
Token the
Token ID262
Feature activation+3.06e+00

Pos logit contributions
 eyel+0.056
gan+0.055
sed+0.053
sing+0.053
 spine+0.052
Neg logit contributions
Offic-0.026
 permitting-0.022
 exchange-0.022
 whip-0.021
 allowing-0.021
Token doctor
Token ID6253
Feature activation+1.14e+00

Pos logit contributions
gan+0.080
adow+0.080
uzz+0.079
oppy+0.078
agg+0.077
Neg logit contributions
Offic-0.074
Can-0.058
 whip-0.054
Sir-0.053
Will-0.052
Tokenâ
Token ID22940
Feature activation+1.23e+00

Pos logit contributions
 eyel+0.024
gan+0.022
sing+0.022
 spine+0.022
 Moe+0.021
Neg logit contributions
Offic-0.033
corn-0.027
ercise-0.027
 permitting-0.026
 whip-0.026
Token
Token ID26391
Feature activation+9.10e+00

Pos logit contributions
 eyel+0.021
sing+0.019
gan+0.019
 spine+0.019
 Moe+0.018
Neg logit contributions
Offic-0.041
ercise-0.035
corn-0.034
 permitting-0.034
Mic-0.034
Token
Token ID8151
Feature activation+2.41e+00

Pos logit contributions
 eyel+0.231
sing+0.220
gan+0.220
 Moe+0.218
 spine+0.210
Neg logit contributions
Offic-0.228
corn-0.178
"-0.173
oc-0.168
ercise-0.168
Tokens
Token ID82
Feature activation+2.81e+00

Pos logit contributions
 eyel+0.062
sing+0.059
 spine+0.057
sed+0.056
gan+0.055
Neg logit contributions
Offic-0.059
"-0.046
 permitting-0.046
corn-0.045
Mic-0.044
Token office
Token ID2607
Feature activation+5.31e-01

Pos logit contributions
 eyel+0.060
gan+0.060
sing+0.058
 spine+0.058
 Moe+0.057
Neg logit contributions
Offic-0.077
 whip-0.061
 permitting-0.061
ercise-0.060
oc-0.059
Token,
Token ID11
Feature activation+3.20e-02

Pos logit contributions
 eyel+0.010
sing+0.010
oppy+0.009
gan+0.009
 spine+0.009
Neg logit contributions
Offic-0.016
"-0.013
 permitting-0.013
corn-0.013
ercise-0.013
Token his
Token ID465
Feature activation+1.25e+00

Pos logit contributions
 eyel+0.001
gan+0.001
sing+0.001
 spine+0.001
 Moe+0.001
Neg logit contributions
Offic-0.001
ercise-0.001
 permitting-0.001
corn-0.001
Mic-0.001
Token placed
Token ID4624
Feature activation-2.87e+00

Pos logit contributions
 eyel+0.070
gan+0.068
oppy+0.066
 Moe+0.066
 spine+0.065
Neg logit contributions
Offic-0.072
 permitting-0.062
 allowing-0.061
 whip-0.059
ercise-0.059
Token safely
Token ID11512
Feature activation+2.62e+00

Pos logit contributions
Offic+0.072
ercise+0.057
corn+0.056
 permitting+0.056
Mic+0.055
Neg logit contributions
 eyel-0.071
gan-0.068
 spine-0.068
sing-0.068
 Moe-0.066
Token in
Token ID287
Feature activation+2.13e+00

Pos logit contributions
 eyel+0.066
gan+0.065
sing+0.063
 Moe+0.062
sed+0.062
Neg logit contributions
Offic-0.061
 permitting-0.051
 allowing-0.050
ercise-0.049
 whip-0.049
Token Tiny
Token ID20443
Feature activation+3.70e+00

Pos logit contributions
 eyel+0.042
 Moe+0.040
sing+0.039
 spine+0.038
 Flor+0.038
Neg logit contributions
Offic-0.065
ercise-0.056
Mic-0.054
corn-0.053
Sir-0.051
Tokenâ
Token ID22940
Feature activation+1.53e+00

Pos logit contributions
 eyel+0.085
 spine+0.081
gan+0.080
sing+0.080
 Moe+0.078
Neg logit contributions
Offic-0.100
ercise-0.080
 permitting-0.079
corn-0.078
Mic-0.078
Token
Token ID26391
Feature activation+9.07e+00

Pos logit contributions
 eyel+0.029
gan+0.029
 Moe+0.028
sing+0.028
oppy+0.027
Neg logit contributions
Offic-0.047
ercise-0.039
corn-0.038
 permitting-0.037
Mic-0.037
Token
Token ID8151
Feature activation+2.47e+00

Pos logit contributions
 eyel+0.215
gan+0.205
 spine+0.205
sing+0.204
 Moe+0.198
Neg logit contributions
Offic-0.238
ercise-0.189
 permitting-0.186
Mic-0.184
corn-0.184
Tokens
Token ID82
Feature activation+3.34e+00

Pos logit contributions
 eyel+0.055
sing+0.052
 spine+0.051
gan+0.051
 Moe+0.050
Neg logit contributions
Offic-0.068
 permitting-0.054
ercise-0.054
corn-0.054
Mic-0.054
Token pocket
Token ID10000
Feature activation+1.36e+00

Pos logit contributions
gan+0.081
 Moe+0.077
 eyel+0.074
agg+0.072
sing+0.072
Neg logit contributions
Offic-0.077
 whip-0.069
 string-0.065
"-0.065
ercise-0.065
Token and
Token ID290
Feature activation-2.28e+00

Pos logit contributions
 eyel+0.029
gan+0.028
sing+0.027
 spine+0.027
 Moe+0.027
Neg logit contributions
Offic-0.039
ercise-0.032
 permitting-0.031
corn-0.031
Mic-0.031
Token they
Token ID484
Feature activation-8.45e-01

Pos logit contributions
Offic+0.064
ercise+0.052
Mic+0.051
corn+0.051
 permitting+0.051
Neg logit contributions
 eyel-0.049
gan-0.047
 Moe-0.046
sing-0.046
 spine-0.045
Token not
Token ID407
Feature activation-7.79e-01

Pos logit contributions
 eyel+0.049
 spine+0.047
gan+0.047
sing+0.046
 Moe+0.046
Neg logit contributions
Offic-0.072
ercise-0.060
 permitting-0.060
corn-0.058
 whip-0.058
Token real
Token ID1103
Feature activation-3.65e+00

Pos logit contributions
Offic+0.021
 permitting+0.017
ercise+0.017
 whip+0.017
 allowing+0.017
Neg logit contributions
 eyel-0.018
gan-0.017
 Moe-0.017
 spine-0.017
sing-0.017
Token,
Token ID11
Feature activation-1.82e+00

Pos logit contributions
Offic+0.088
oc+0.075
 permitting+0.071
 whip+0.071
 allowing+0.071
Neg logit contributions
gan-0.093
sing-0.089
 eyel-0.087
 spine-0.083
oppy-0.083
Token but
Token ID475
Feature activation-3.82e-01

Pos logit contributions
Offic+0.050
ercise+0.040
corn+0.039
 permitting+0.039
Mic+0.039
Neg logit contributions
 eyel-0.041
gan-0.039
sing-0.038
 spine-0.038
 Moe-0.038
Token a
Token ID257
Feature activation-4.57e+00

Pos logit contributions
Offic+0.009
 permitting+0.007
ercise+0.007
corn+0.007
 whip+0.007
Neg logit contributions
 eyel-0.010
gan-0.009
 spine-0.009
sing-0.009
 Moe-0.009
Token stuffed
Token ID22259
Feature activation-8.66e+00

Pos logit contributions
Offic+0.121
 permitting+0.095
ercise+0.095
 whip+0.095
Mic+0.094
Neg logit contributions
 eyel-0.106
gan-0.106
 Moe-0.101
 spine-0.101
sing-0.101
Token animal
Token ID5044
Feature activation-2.62e+00

Pos logit contributions
Offic+0.216
oc+0.164
Mic+0.161
 whip+0.161
ercise+0.160
Neg logit contributions
 eyel-0.224
gan-0.215
 spine-0.210
sing-0.209
 Moe-0.204
Token that
Token ID326
Feature activation-1.52e+00

Pos logit contributions
Offic+0.064
 whip+0.051
 permitting+0.051
ercise+0.050
corn+0.050
Neg logit contributions
 eyel-0.065
gan-0.064
sing-0.061
 spine-0.061
 Moe-0.061
Token belonged
Token ID19611
Feature activation+3.17e-01

Pos logit contributions
Offic+0.045
ercise+0.037
corn+0.037
 permitting+0.037
Mic+0.036
Neg logit contributions
 eyel-0.030
gan-0.029
 spine-0.029
sing-0.029
 Moe-0.028
Token to
Token ID284
Feature activation+3.77e-02

Pos logit contributions
 eyel+0.009
sing+0.009
 spine+0.009
 Moe+0.009
gan+0.009
Neg logit contributions
Offic-0.006
"-0.005
Can-0.005
 permitting-0.005
 allowing-0.005
Token her
Token ID607
Feature activation+3.61e+00

Pos logit contributions
 eyel+0.001
gan+0.001
sing+0.001
 spine+0.001
 Moe+0.001
Neg logit contributions
Offic-0.001
 permitting-0.001
ercise-0.001
corn-0.001
Mic-0.001
Token something
Token ID1223
Feature activation-5.49e-02

Pos logit contributions
Offic+0.099
ercise+0.080
corn+0.079
Mic+0.079
 permitting+0.079
Neg logit contributions
 eyel-0.064
gan-0.061
 Moe-0.060
 spine-0.059
sing-0.058
Token new
Token ID649
Feature activation-2.95e+00

Pos logit contributions
Offic+0.001
ercise+0.001
corn+0.001
 permitting+0.001
"+0.001
Neg logit contributions
sing-0.001
 spine-0.001
 eyel-0.001
gan-0.001
sed-0.001
Token!
Token ID0
Feature activation+1.36e+00

Pos logit contributions
Offic+0.075
ercise+0.062
 permitting+0.060
corn+0.060
"+0.059
Neg logit contributions
 eyel-0.070
sing-0.068
gan-0.067
 spine-0.067
 Moe-0.064
Token She
Token ID1375
Feature activation-8.34e-01

Pos logit contributions
 eyel+0.029
sing+0.028
gan+0.027
 spine+0.027
 Moe+0.026
Neg logit contributions
Offic-0.040
ercise-0.032
 permitting-0.031
Mic-0.031
corn-0.031
Token thought
Token ID1807
Feature activation-2.75e-01

Pos logit contributions
Offic+0.026
ercise+0.021
Mic+0.021
Sir+0.021
 permitting+0.021
Neg logit contributions
 eyel-0.016
gan-0.015
 Moe-0.015
sing-0.014
oppy-0.014
Token av
Token ID1196
Feature activation-8.34e+00

Pos logit contributions
Offic+0.007
ercise+0.005
Mic+0.005
corn+0.005
 permitting+0.005
Neg logit contributions
 eyel-0.007
gan-0.007
sing-0.007
 spine-0.007
 Moe-0.007
Tokenoc
Token ID420
Feature activation-7.29e+00

Pos logit contributions
Offic+0.173
 permitting+0.155
 allowing+0.132
 letting+0.126
 groom+0.124
Neg logit contributions
adow-0.220
oppy-0.215
by-0.201
pt-0.201
anda-0.201
Tokenados
Token ID22484
Feature activation-4.81e+00

Pos logit contributions
Offic+0.150
 permitting+0.111
ercise+0.107
Can+0.104
corn+0.104
Neg logit contributions
 eyel-0.212
gan-0.208
 Moe-0.204
sed-0.198
sing-0.198
Token were
Token ID547
Feature activation+6.83e-01

Pos logit contributions
Offic+0.164
ercise+0.137
 permitting+0.135
Mic+0.134
corn+0.134
Neg logit contributions
 eyel-0.077
gan-0.072
sing-0.070
 Moe-0.070
 spine-0.070
Token the
Token ID262
Feature activation+4.71e-01

Pos logit contributions
 eyel+0.012
gan+0.012
 Moe+0.011
sing+0.011
 Flor+0.011
Neg logit contributions
Offic-0.023
ercise-0.018
Mic-0.018
corn-0.018
Sir-0.018
Token most
Token ID749
Feature activation-2.68e+00

Pos logit contributions
 eyel+0.006
gan+0.005
 spine+0.005
sing+0.005
 Moe+0.005
Neg logit contributions
Offic-0.019
ercise-0.016
corn-0.016
 permitting-0.015
Mic-0.015
Token mom
Token ID1995
Feature activation-1.77e+00

Pos logit contributions
 eyel+0.033
 spine+0.031
gan+0.031
sing+0.030
 Moe+0.030
Neg logit contributions
Offic-0.045
 whip-0.036
ercise-0.036
 permitting-0.035
Sir-0.035
Token many
Token ID867
Feature activation-4.05e+00

Pos logit contributions
Offic+0.052
ercise+0.042
Mic+0.042
corn+0.041
 permitting+0.041
Neg logit contributions
 eyel-0.036
gan-0.035
 Moe-0.034
 spine-0.034
sing-0.034
Token questions
Token ID2683
Feature activation-9.92e-01

Pos logit contributions
Offic+0.129
Mic+0.110
ercise+0.109
corn+0.107
 permitting+0.107
Neg logit contributions
 eyel-0.073
adow-0.065
 Moe-0.065
 Mil-0.063
 spine-0.063
Token.
Token ID13
Feature activation+4.06e-01

Pos logit contributions
Offic+0.032
Mic+0.027
ercise+0.026
corn+0.026
 permitting+0.026
Neg logit contributions
 eyel-0.018
gan-0.017
sing-0.017
 spine-0.017
 Moe-0.016
Token She
Token ID1375
Feature activation-1.60e+00

Pos logit contributions
 eyel+0.009
gan+0.008
sing+0.008
 spine+0.008
 Moe+0.008
Neg logit contributions
Offic-0.011
ercise-0.009
 permitting-0.009
corn-0.009
Mic-0.009
Token teaches
Token ID17324
Feature activation-8.25e+00

Pos logit contributions
Offic+0.041
Mic+0.032
 permitting+0.032
ercise+0.032
corn+0.031
Neg logit contributions
 eyel-0.039
gan-0.037
 Moe-0.036
sing-0.036
oppy-0.035
Token him
Token ID683
Feature activation+4.13e-01

Pos logit contributions
Offic+0.232
ercise+0.189
corn+0.185
 permitting+0.183
Mic+0.182
Neg logit contributions
 eyel-0.179
 Moe-0.170
gan-0.168
 Mil-0.162
 Flor-0.162
Token the
Token ID262
Feature activation-4.09e-01

Pos logit contributions
 spine+0.012
 eyel+0.012
sing+0.012
gan+0.011
 Maur+0.011
Neg logit contributions
Offic-0.008
"-0.007
ercise-0.007
 newspapers-0.007
corn-0.006
Token names
Token ID3891
Feature activation-7.64e-01

Pos logit contributions
Offic+0.011
ercise+0.009
corn+0.009
Mic+0.009
 permitting+0.009
Neg logit contributions
 eyel-0.009
sing-0.009
 spine-0.009
gan-0.009
 Moe-0.009
Token of
Token ID286
Feature activation-1.72e+00

Pos logit contributions
Offic+0.020
Mic+0.016
ercise+0.016
 permitting+0.016
corn+0.015
Neg logit contributions
 eyel-0.018
gan-0.017
 Moe-0.017
sing-0.017
 spine-0.016
Token the
Token ID262
Feature activation+1.66e+00

Pos logit contributions
Offic+0.049
Mic+0.040
ercise+0.040
corn+0.039
oc+0.038
Neg logit contributions
 eyel-0.035
 Flor-0.034
 Moe-0.034
 Mil-0.034
 toilet-0.034
Token girl
Token ID2576
Feature activation-2.72e-01

Pos logit contributions
Offic+0.117
ercise+0.091
Mic+0.091
Sir+0.090
corn+0.089
Neg logit contributions
 eyel-0.099
sing-0.097
gan-0.095
 spine-0.094
 Moe-0.092
Token said
Token ID531
Feature activation+6.77e-01

Pos logit contributions
Offic+0.005
"+0.003
 whip+0.003
oc+0.003
corn+0.003
Neg logit contributions
 spine-0.009
tub-0.008
sing-0.008
 eyel-0.008
 Moe-0.008
Token.
Token ID13
Feature activation+3.32e-01

Pos logit contributions
 eyel+0.014
 Moe+0.014
gan+0.014
sing+0.013
 spine+0.013
Neg logit contributions
Offic-0.019
 permitting-0.015
 allowing-0.015
"-0.015
Sir-0.015
Token
Token ID198
Feature activation-2.61e+00

Pos logit contributions
 eyel+0.006
 spine+0.006
gan+0.006
 Flor+0.006
sing+0.006
Neg logit contributions
Offic-0.009
ercise-0.009
corn-0.008
 permitting-0.008
 whip-0.008
Token
Token ID198
Feature activation-2.82e+00

Pos logit contributions
Offic+0.082
ercise+0.080
corn+0.077
 permitting+0.077
 allowing+0.075
Neg logit contributions
ia-0.031
gan-0.029
 Flor-0.029
 eyel-0.029
oppy-0.029
TokenOffic
Token ID12710
Feature activation-8.22e+00

Pos logit contributions
 topp+0.051
 gravy+0.048
burn+0.047
 permitting+0.046
 helium+0.046
Neg logit contributions
ia-0.064
:-0.061
azes-0.059
ahl-0.058
 Boo-0.056
Tokener
Token ID263
Feature activation-1.64e+00

Pos logit contributions
Offic+0.221
ercise+0.171
corn+0.169
Sir+0.168
Mic+0.167
Neg logit contributions
 eyel-0.198
sing-0.189
 spine-0.186
gan-0.185
 Moe-0.181
Token Roy
Token ID9817
Feature activation-2.34e-01

Pos logit contributions
Offic+0.043
"+0.034
 whip+0.033
Mic+0.032
Billy+0.032
Neg logit contributions
 eyel-0.039
 spine-0.038
sing-0.037
 Moe-0.036
oppy-0.035
Token had
Token ID550
Feature activation-1.65e+00

Pos logit contributions
Offic+0.006
corn+0.005
"+0.005
ercise+0.005
Sir+0.005
Neg logit contributions
 spine-0.006
 eyel-0.006
sing-0.006
gan-0.005
 Moe-0.005
Token an
Token ID281
Feature activation-2.33e+00

Pos logit contributions
Offic+0.051
ercise+0.042
 permitting+0.042
corn+0.042
Mic+0.041
Neg logit contributions
 eyel-0.031
gan-0.030
sing-0.029
 spine-0.029
 Moe-0.028
Token idea
Token ID2126
Feature activation-1.43e+00

Pos logit contributions
Offic+0.070
ercise+0.059
 whip+0.058
 permitting+0.057
Mic+0.056
Neg logit contributions
 eyel-0.045
gan-0.045
 Moe-0.043
 spine-0.042
 Mil-0.042
Token Ben
Token ID3932
Feature activation+1.82e-01

Pos logit contributions
Offic+0.122
ercise+0.100
corn+0.098
Mic+0.097
 permitting+0.097
Neg logit contributions
 eyel-0.075
gan-0.070
 spine-0.070
sing-0.070
adow-0.067
Token say
Token ID910
Feature activation-9.36e-01

Pos logit contributions
 eyel+0.005
gan+0.005
sing+0.005
 spine+0.005
 Moe+0.005
Neg logit contributions
Offic-0.004
ercise-0.003
 permitting-0.003
corn-0.003
Mic-0.003
Token.
Token ID13
Feature activation+8.80e-02

Pos logit contributions
Offic+0.028
 permitting+0.023
ercise+0.023
corn+0.022
 allowing+0.022
Neg logit contributions
 eyel-0.018
gan-0.018
sing-0.017
 spine-0.017
 Moe-0.017
Token
Token ID198
Feature activation-2.59e+00

Pos logit contributions
 eyel+0.002
gan+0.001
sing+0.001
 spine+0.001
 Moe+0.001
Neg logit contributions
Offic-0.003
ercise-0.002
corn-0.002
Mic-0.002
 permitting-0.002
Token
Token ID198
Feature activation-2.84e+00

Pos logit contributions
Offic+0.079
ercise+0.072
 permitting+0.068
corn+0.067
 whip+0.066
Neg logit contributions
 eyel-0.044
 Moe-0.043
gan-0.043
 Flor-0.041
 Mil-0.041
TokenOffic
Token ID12710
Feature activation-8.18e+00

Pos logit contributions
ercise+0.055
 topp+0.050
ograph+0.050
 redd+0.049
 permitting+0.048
Neg logit contributions
ia-0.062
 Flor-0.059
 Mil-0.059
 Moe-0.059
gan-0.059
Tokener
Token ID263
Feature activation-1.80e+00

Pos logit contributions
Offic+0.192
Sir+0.142
ercise+0.142
corn+0.142
"+0.135
Neg logit contributions
 eyel-0.229
sing-0.220
 spine-0.218
oppy-0.215
adow-0.209
Token Mike
Token ID4995
Feature activation+8.33e-01

Pos logit contributions
Offic+0.046
ercise+0.035
corn+0.034
 permitting+0.034
 whip+0.034
Neg logit contributions
 eyel-0.045
 spine-0.044
sing-0.043
gan-0.042
oppy-0.041
Token shows
Token ID2523
Feature activation-3.45e+00

Pos logit contributions
 eyel+0.020
gan+0.019
 Moe+0.019
sing+0.019
 spine+0.019
Neg logit contributions
Offic-0.021
Mic-0.017
ercise-0.017
 permitting-0.017
corn-0.016
Token them
Token ID606
Feature activation+1.00e+00

Pos logit contributions
Offic+0.074
Sir+0.055
ercise+0.054
corn+0.053
 permitting+0.053
Neg logit contributions
 eyel-0.098
 spine-0.098
sing-0.097
gan-0.093
adow-0.093
Token his
Token ID465
Feature activation+2.66e+00

Pos logit contributions
 eyel+0.025
 spine+0.024
sing+0.024
gan+0.024
 Moe+0.023
Neg logit contributions
Offic-0.025
ercise-0.020
 permitting-0.019
corn-0.019
 whip-0.019
Token's
Token ID338
Feature activation+5.97e-01

Pos logit contributions
 eyel+0.008
gan+0.008
 spine+0.008
sing+0.008
 Moe+0.007
Neg logit contributions
Offic-0.008
ercise-0.007
corn-0.007
 permitting-0.006
Mic-0.006
Token mom
Token ID1995
Feature activation-1.62e+00

Pos logit contributions
adow+0.013
gan+0.013
 eyel+0.013
 Moe+0.013
 spine+0.013
Neg logit contributions
Offic-0.015
 whip-0.013
Can-0.012
 permitting-0.012
ercise-0.012
Token smiles
Token ID21845
Feature activation-2.24e+00

Pos logit contributions
Offic+0.044
ercise+0.035
Mic+0.035
 permitting+0.035
corn+0.034
Neg logit contributions
 eyel-0.038
gan-0.035
 Moe-0.035
sing-0.035
 spine-0.034
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
Offic+0.070
ercise+0.057
Mic+0.056
corn+0.056
 permitting+0.056
Neg logit contributions
 eyel-0.044
gan-0.040
sing-0.040
 spine-0.040
 Moe-0.039
Token She
Token ID1375
Feature activation-1.40e+00

Pos logit contributions
Offic+0.003
ercise+0.003
 permitting+0.003
corn+0.003
Mic+0.003
Neg logit contributions
 eyel-0.002
gan-0.002
 spine-0.002
sing-0.002
 Moe-0.002
Token teaches
Token ID17324
Feature activation-8.17e+00

Pos logit contributions
Offic+0.037
Mic+0.029
ercise+0.029
 permitting+0.029
corn+0.028
Neg logit contributions
 eyel-0.033
gan-0.031
 Moe-0.031
sing-0.031
 Flor-0.030
Token them
Token ID606
Feature activation-9.09e-01

Pos logit contributions
Offic+0.171
ercise+0.129
 permitting+0.127
Sir+0.127
corn+0.126
Neg logit contributions
 eyel-0.234
 spine-0.232
sing-0.230
gan-0.227
adow-0.220
Token how
Token ID703
Feature activation-1.99e-02

Pos logit contributions
Offic+0.016
 newspapers+0.015
ercise+0.015
"+0.015
 whip+0.014
Neg logit contributions
 spine-0.028
sing-0.026
 eyel-0.026
sed-0.025
oppy-0.025
Token to
Token ID284
Feature activation-1.12e+00

Pos logit contributions
Offic+0.000
ercise+0.000
 permitting+0.000
corn+0.000
"+0.000
Neg logit contributions
 eyel-0.001
 spine-0.001
sing-0.001
gan-0.000
 Moe-0.000
Token be
Token ID307
Feature activation+1.22e+00

Pos logit contributions
Offic+0.031
Sir+0.025
Mic+0.025
 permitting+0.024
corn+0.024
Neg logit contributions
 eyel-0.025
sing-0.024
 Moe-0.024
gan-0.024
 spine-0.023
Token safe
Token ID3338
Feature activation+4.98e-01

Pos logit contributions
gan+0.023
 eyel+0.022
adow+0.021
 Flor+0.021
 Moe+0.021
Neg logit contributions
Offic-0.038
Sir-0.030
corn-0.030
Mic-0.030
ercise-0.030
Token his
Token ID465
Feature activation+1.38e+00

Pos logit contributions
Offic+0.090
ercise+0.074
 permitting+0.074
corn+0.073
Mic+0.073
Neg logit contributions
 eyel-0.053
gan-0.050
 spine-0.049
 Moe-0.048
sing-0.048
Token best
Token ID1266
Feature activation-1.69e+00

Pos logit contributions
 eyel+0.034
 Moe+0.034
 Mil+0.033
gan+0.033
adow+0.033
Neg logit contributions
Offic-0.033
 whip-0.028
Mic-0.027
Can-0.025
ercise-0.025
Token friend
Token ID1545
Feature activation-1.33e+00

Pos logit contributions
Offic+0.049
ercise+0.040
 permitting+0.039
Mic+0.039
corn+0.039
Neg logit contributions
 eyel-0.035
gan-0.033
sing-0.033
 spine-0.033
 Moe-0.032
Token,
Token ID11
Feature activation+4.01e-01

Pos logit contributions
Offic+0.029
"+0.024
Billy+0.024
Can+0.023
 permitting+0.023
Neg logit contributions
 spine-0.038
sing-0.037
 eyel-0.037
sed-0.033
 Flor-0.033
Token a
Token ID257
Feature activation-2.79e+00

Pos logit contributions
 eyel+0.011
 spine+0.011
sing+0.011
anda+0.011
oken+0.010
Neg logit contributions
Offic-0.009
Billy-0.008
 permitting-0.007
Mic-0.007
corn-0.007
Token stuffed
Token ID22259
Feature activation-8.07e+00

Pos logit contributions
Offic+0.073
Mic+0.059
 whip+0.059
Can+0.056
 permitting+0.056
Neg logit contributions
 eyel-0.067
gan-0.065
 spine-0.064
adow-0.063
 Moe-0.063
Token bear
Token ID6842
Feature activation-1.70e+00

Pos logit contributions
Offic+0.213
ercise+0.170
 permitting+0.167
corn+0.167
Mic+0.164
Neg logit contributions
 eyel-0.189
gan-0.182
sing-0.179
 spine-0.179
 Moe-0.176
Token named
Token ID3706
Feature activation-7.74e-01

Pos logit contributions
Offic+0.046
ercise+0.036
 permitting+0.036
corn+0.036
Mic+0.035
Neg logit contributions
 eyel-0.039
gan-0.038
 spine-0.037
sing-0.037
 Moe-0.037
Token Teddy
Token ID29345
Feature activation+1.43e+00

Pos logit contributions
Offic+0.020
Mic+0.016
 permitting+0.016
corn+0.016
ercise+0.016
Neg logit contributions
 eyel-0.019
 spine-0.018
sing-0.018
gan-0.017
adow-0.017
Token,
Token ID11
Feature activation-3.06e-01

Pos logit contributions
 eyel+0.028
gan+0.027
 spine+0.026
sing+0.026
 Moe+0.026
Neg logit contributions
Offic-0.043
 permitting-0.036
ercise-0.036
Mic-0.035
corn-0.035
Token went
Token ID1816
Feature activation+2.80e+00

Pos logit contributions
Offic+0.008
 permitting+0.006
Mic+0.006
ercise+0.006
Can+0.006
Neg logit contributions
 eyel-0.007
 spine-0.007
gan-0.007
sing-0.007
 Moe-0.007
Token elephant
Token ID20950
Feature activation-4.12e+00

Pos logit contributions
Offic+0.059
ercise+0.041
 permitting+0.040
Mic+0.040
 whip+0.040
Neg logit contributions
 eyel-0.098
gan-0.094
 spine-0.094
 Moe-0.093
sing-0.092
Token.
Token ID13
Feature activation+8.76e-02

Pos logit contributions
Offic+0.132
ercise+0.110
 permitting+0.108
corn+0.108
Mic+0.108
Neg logit contributions
 eyel-0.074
gan-0.069
sing-0.068
 spine-0.068
 Moe-0.066
Token That
Token ID1320
Feature activation-7.34e-01

Pos logit contributions
 eyel+0.002
sing+0.002
 spine+0.002
adow+0.002
gan+0.002
Neg logit contributions
Offic-0.002
Can-0.002
oc-0.002
ercise-0.002
"-0.002
Token's
Token ID338
Feature activation+7.21e-01

Pos logit contributions
Offic+0.020
ercise+0.016
 permitting+0.016
corn+0.016
Mic+0.016
Neg logit contributions
 eyel-0.016
gan-0.016
sing-0.015
 spine-0.015
 Moe-0.015
Token a
Token ID257
Feature activation-3.46e+00

Pos logit contributions
 eyel+0.016
 spine+0.016
 Moe+0.015
sing+0.015
gan+0.015
Neg logit contributions
Offic-0.020
ercise-0.016
 permitting-0.016
 whip-0.016
corn-0.016
Token stuffed
Token ID22259
Feature activation-8.06e+00

Pos logit contributions
Offic+0.095
ercise+0.080
corn+0.077
 permitting+0.077
 allowing+0.074
Neg logit contributions
 eyel-0.076
sing-0.072
gan-0.069
 spine-0.069
 Moe-0.068
Token toy
Token ID13373
Feature activation-1.68e+00

Pos logit contributions
Offic+0.183
ercise+0.144
corn+0.140
 permitting+0.140
Mic+0.136
Neg logit contributions
 eyel-0.215
gan-0.209
sing-0.206
 spine-0.204
 Moe-0.204
Token.
Token ID13
Feature activation+5.92e-01

Pos logit contributions
Offic+0.049
ercise+0.040
 permitting+0.040
corn+0.039
Mic+0.039
Neg logit contributions
 eyel-0.034
gan-0.033
 spine-0.032
sing-0.032
 Moe-0.031
Token Someone
Token ID17877
Feature activation-2.40e+00

Pos logit contributions
 eyel+0.013
gan+0.013
sing+0.012
 spine+0.012
 Moe+0.012
Neg logit contributions
Offic-0.016
ercise-0.013
 permitting-0.013
corn-0.013
Mic-0.013
Token left
Token ID1364
Feature activation-5.34e-01

Pos logit contributions
Offic+0.061
ercise+0.048
 permitting+0.047
corn+0.047
Mic+0.047
Neg logit contributions
 eyel-0.058
gan-0.056
 spine-0.055
sing-0.055
 Moe-0.055
Token it
Token ID340
Feature activation+3.89e+00

Pos logit contributions
Offic+0.013
Mic+0.010
ercise+0.009
corn+0.009
Together+0.009
Neg logit contributions
 eyel-0.014
gan-0.014
 spine-0.014
 Moe-0.013
sing-0.013
Token everywhere
Token ID8347
Feature activation+4.39e+00

Pos logit contributions
 eyel+0.099
gan+0.097
 Moe+0.094
 spine+0.093
sing+0.092
Neg logit contributions
Offic-0.117
ercise-0.099
 permitting-0.098
 whip-0.097
corn-0.096
Token.
Token ID13
Feature activation+2.38e+00

Pos logit contributions
 eyel+0.091
gan+0.087
sing+0.085
 spine+0.085
 Moe+0.083
Neg logit contributions
Offic-0.127
ercise-0.104
 permitting-0.103
corn-0.103
Mic-0.100
Token It
Token ID632
Feature activation-2.86e-01

Pos logit contributions
 eyel+0.047
gan+0.045
 Moe+0.044
sing+0.043
 spine+0.043
Neg logit contributions
Offic-0.072
ercise-0.059
 permitting-0.058
corn-0.058
Mic-0.057
Token was
Token ID373
Feature activation+2.29e+00

Pos logit contributions
Offic+0.009
ercise+0.008
corn+0.008
 permitting+0.007
Mic+0.007
Neg logit contributions
 eyel-0.005
gan-0.005
 spine-0.005
sing-0.005
 Moe-0.005
Token a
Token ID257
Feature activation-2.22e+00

Pos logit contributions
 eyel+0.045
gan+0.043
sing+0.042
 spine+0.042
 Moe+0.042
Neg logit contributions
Offic-0.069
ercise-0.057
corn-0.056
 permitting-0.056
Mic-0.055
Token stuffed
Token ID22259
Feature activation-8.05e+00

Pos logit contributions
Offic+0.063
ercise+0.051
 permitting+0.050
corn+0.050
Mic+0.049
Neg logit contributions
 eyel-0.048
sing-0.046
 spine-0.045
gan-0.044
 Moe-0.044
Token blue
Token ID4171
Feature activation-2.84e+00

Pos logit contributions
Offic+0.222
ercise+0.182
 permitting+0.178
corn+0.176
Mic+0.173
Neg logit contributions
 eyel-0.177
gan-0.169
sing-0.168
 spine-0.167
 Moe-0.166
Token bear
Token ID6842
Feature activation-1.82e+00

Pos logit contributions
Offic+0.079
ercise+0.063
 permitting+0.062
Mic+0.061
corn+0.061
Neg logit contributions
 eyel-0.065
 spine-0.061
sing-0.061
gan-0.060
 Moe-0.059
Token with
Token ID351
Feature activation-1.73e+00

Pos logit contributions
Offic+0.054
ercise+0.045
Mic+0.043
 permitting+0.043
Sir+0.043
Neg logit contributions
 eyel-0.037
gan-0.035
sing-0.034
 spine-0.034
 Moe-0.034
Token a
Token ID257
Feature activation-1.20e+00

Pos logit contributions
Offic+0.051
ercise+0.042
 permitting+0.041
corn+0.041
Mic+0.041
Neg logit contributions
 eyel-0.035
gan-0.034
 spine-0.033
sing-0.033
 Moe-0.033
Token smile
Token ID8212
Feature activation-2.28e+00

Pos logit contributions
Offic+0.033
ercise+0.027
 permitting+0.026
corn+0.026
Sir+0.026
Neg logit contributions
 eyel-0.028
sing-0.026
 spine-0.026
gan-0.025
 Moe-0.025
Token a
Token ID257
Feature activation-1.76e+00

Pos logit contributions
 spine+0.064
 eyel+0.063
oppy+0.062
gan+0.061
sing+0.061
Neg logit contributions
Offic-0.079
 permitting-0.066
 allowing-0.064
ercise-0.064
corn-0.064
Token big
Token ID1263
Feature activation-3.82e+00

Pos logit contributions
Offic+0.045
ercise+0.035
corn+0.035
 permitting+0.035
 whip+0.035
Neg logit contributions
 eyel-0.042
gan-0.041
 spine-0.040
 Moe-0.040
sing-0.040
Token surprise
Token ID5975
Feature activation-1.07e+00

Pos logit contributions
Offic+0.109
ercise+0.089
corn+0.087
 permitting+0.087
Mic+0.086
Neg logit contributions
 eyel-0.081
gan-0.078
sing-0.076
 spine-0.076
 Moe-0.075
Token -
Token ID532
Feature activation+8.86e-01

Pos logit contributions
Offic+0.028
 permitting+0.023
corn+0.022
 whip+0.022
 allowing+0.022
Neg logit contributions
 eyel-0.024
gan-0.024
 spine-0.024
sing-0.023
oppy-0.023
Token a
Token ID257
Feature activation-2.86e+00

Pos logit contributions
 eyel+0.017
gan+0.016
sing+0.015
 spine+0.015
 Moe+0.015
Neg logit contributions
Offic-0.028
ercise-0.023
corn-0.022
 permitting-0.022
Mic-0.022
Token stuffed
Token ID22259
Feature activation-8.04e+00

Pos logit contributions
Offic+0.074
ercise+0.059
 permitting+0.058
corn+0.058
Mic+0.057
Neg logit contributions
 eyel-0.068
gan-0.066
 spine-0.065
sing-0.065
 Moe-0.064
Token animal
Token ID5044
Feature activation-1.79e+00

Pos logit contributions
Offic+0.203
ercise+0.161
 permitting+0.157
corn+0.156
Mic+0.155
Neg logit contributions
 eyel-0.198
gan-0.190
sing-0.187
 spine-0.187
 Moe-0.186
Token!
Token ID0
Feature activation+1.09e+00

Pos logit contributions
Offic+0.047
corn+0.038
 permitting+0.038
 whip+0.038
ercise+0.038
Neg logit contributions
 eyel-0.040
gan-0.039
 spine-0.039
sing-0.038
 Moe-0.037
Token Esther
Token ID47533
Feature activation+1.01e+00

Pos logit contributions
 eyel+0.021
gan+0.021
 Moe+0.020
 spine+0.020
sing+0.020
Neg logit contributions
Offic-0.033
ercise-0.027
corn-0.027
 permitting-0.027
Mic-0.027
Token was
Token ID373
Feature activation+2.26e+00

Pos logit contributions
 spine+0.022
sing+0.022
 eyel+0.021
gan+0.020
 Moe+0.020
Neg logit contributions
Offic-0.028
"-0.024
corn-0.023
oc-0.023
 whip-0.022
Token so
Token ID523
Feature activation+1.46e+00

Pos logit contributions
 spine+0.051
oppy+0.051
 eyel+0.049
uffy+0.048
tub+0.047
Neg logit contributions
Offic-0.057
 permitting-0.050
 allowing-0.049
 whip-0.045
"-0.045
Token was
Token ID373
Feature activation+3.28e+00

Pos logit contributions
 eyel+0.016
 spine+0.016
sing+0.016
gan+0.015
oppy+0.015
Neg logit contributions
Offic-0.020
 permitting-0.017
corn-0.017
ercise-0.017
 whip-0.017
Token inside
Token ID2641
Feature activation+2.88e+00

Pos logit contributions
 eyel+0.071
gan+0.069
sing+0.068
 spine+0.067
 Moe+0.066
Neg logit contributions
Offic-0.090
 permitting-0.075
ercise-0.073
 allowing-0.073
corn-0.073
Token -
Token ID532
Feature activation+7.39e-01

Pos logit contributions
 eyel+0.056
gan+0.053
 Moe+0.052
 spine+0.052
sing+0.052
Neg logit contributions
Offic-0.088
ercise-0.072
Mic-0.071
corn-0.070
 permitting-0.070
Token cars
Token ID5006
Feature activation-3.77e+00

Pos logit contributions
 eyel+0.016
gan+0.015
 Moe+0.015
sing+0.015
 spine+0.015
Neg logit contributions
Offic-0.021
ercise-0.017
Mic-0.017
corn-0.017
 permitting-0.016
Token,
Token ID11
Feature activation-8.92e-01

Pos logit contributions
Offic+0.116
ercise+0.095
Mic+0.094
corn+0.093
 permitting+0.093
Neg logit contributions
 eyel-0.072
gan-0.068
 spine-0.067
sing-0.067
 Moe-0.066
Token stuffed
Token ID22259
Feature activation-8.02e+00

Pos logit contributions
Offic+0.024
ercise+0.020
Mic+0.019
corn+0.019
Sir+0.019
Neg logit contributions
 eyel-0.020
 Moe-0.019
gan-0.019
sing-0.019
 spine-0.019
Token animals
Token ID4695
Feature activation-2.77e+00

Pos logit contributions
Offic+0.189
ercise+0.151
 permitting+0.146
Mic+0.142
corn+0.141
Neg logit contributions
 eyel-0.202
 Moe-0.202
gan-0.199
 Flor-0.194
 Mil-0.190
Token,
Token ID11
Feature activation-9.01e-01

Pos logit contributions
Offic+0.086
ercise+0.071
Mic+0.069
corn+0.068
Sir+0.068
Neg logit contributions
 eyel-0.053
 spine-0.050
gan-0.049
 Moe-0.049
sing-0.049
Token and
Token ID290
Feature activation-2.77e+00

Pos logit contributions
Offic+0.025
ercise+0.020
Mic+0.020
corn+0.019
Sir+0.019
Neg logit contributions
 eyel-0.021
gan-0.019
sing-0.019
 Moe-0.019
 spine-0.019
Token a
Token ID257
Feature activation-3.69e+00

Pos logit contributions
Offic+0.076
ercise+0.061
Mic+0.060
corn+0.059
 permitting+0.058
Neg logit contributions
 eyel-0.063
gan-0.059
 Moe-0.059
sing-0.058
 spine-0.057
Token game
Token ID983
Feature activation-2.52e+00

Pos logit contributions
Offic+0.099
ercise+0.080
 permitting+0.078
corn+0.078
Mic+0.077
Neg logit contributions
 eyel-0.085
gan-0.081
sing-0.080
 spine-0.080
 Moe-0.079
Token books
Token ID3835
Feature activation-2.17e+00

Pos logit contributions
Offic+0.001
ercise+0.001
Mic+0.001
corn+0.001
ograph+0.001
Neg logit contributions
 eyel-0.001
 Mil-0.001
gan-0.001
sing-0.001
 Moe-0.001
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
Offic+0.081
ercise+0.069
Mic+0.069
Sir+0.066
corn+0.065
Neg logit contributions
 Moe-0.027
 eyel-0.027
gan-0.027
 spine-0.026
tub-0.026
Token and
Token ID290
Feature activation-2.29e+00

Pos logit contributions
Offic+0.007
ercise+0.005
Mic+0.005
ograph+0.005
corn+0.005
Neg logit contributions
 eyel-0.004
 spine-0.004
gan-0.004
 Mil-0.004
sing-0.004
Token even
Token ID772
Feature activation-2.50e+00

Pos logit contributions
Offic+0.063
ercise+0.050
Mic+0.049
corn+0.048
 permitting+0.048
Neg logit contributions
 eyel-0.053
 Moe-0.050
gan-0.050
 Mil-0.049
sing-0.048
Token a
Token ID257
Feature activation-2.92e+00

Pos logit contributions
Offic+0.071
ercise+0.056
corn+0.055
Mic+0.054
 permitting+0.054
Neg logit contributions
 eyel-0.056
gan-0.052
 Moe-0.052
 spine-0.051
sing-0.050
Token stuffed
Token ID22259
Feature activation-8.00e+00

Pos logit contributions
Offic+0.077
ercise+0.062
 permitting+0.060
corn+0.060
Mic+0.060
Neg logit contributions
 eyel-0.069
sing-0.065
gan-0.065
 spine-0.065
 Moe-0.063
Token animal
Token ID5044
Feature activation-1.96e+00

Pos logit contributions
Offic+0.211
ercise+0.170
 permitting+0.167
corn+0.165
Mic+0.163
Neg logit contributions
 eyel-0.188
gan-0.179
sing-0.177
 spine-0.177
 Moe-0.176
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
Offic+0.056
ercise+0.045
 permitting+0.044
Mic+0.044
corn+0.044
Neg logit contributions
 eyel-0.042
gan-0.040
sing-0.039
 spine-0.039
 Moe-0.039
Token After
Token ID2293
Feature activation+5.38e-01

Pos logit contributions
 eyel+0.040
sing+0.038
 spine+0.037
gan+0.037
 Moe+0.035
Neg logit contributions
Offic-0.049
ercise-0.040
 permitting-0.039
corn-0.039
Mic-0.039
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
 eyel+0.012
 spine+0.012
gan+0.012
sing+0.012
adow+0.011
Neg logit contributions
Offic-0.015
 permitting-0.012
corn-0.011
ercise-0.011
 allowing-0.011
Token few
Token ID1178
Feature activation-8.01e-01

Pos logit contributions
Offic+0.025
ercise+0.018
 permitting+0.017
corn+0.016
Mic+0.016
Neg logit contributions
 eyel-0.039
 spine-0.037
sing-0.037
gan-0.037
 Moe-0.037
Token leak
Token ID13044
Feature activation-2.44e+00

Pos logit contributions
Offic+0.037
ercise+0.030
 permitting+0.029
corn+0.029
Mic+0.028
Neg logit contributions
 eyel-0.031
sing-0.030
 spine-0.029
gan-0.029
 Moe-0.029
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
Offic+0.076
ercise+0.063
 permitting+0.062
corn+0.062
Mic+0.061
Neg logit contributions
 eyel-0.046
gan-0.044
sing-0.043
 spine-0.043
 Moe-0.042
Token The
Token ID383
Feature activation-1.27e+00

Pos logit contributions
 eyel+0.030
sing+0.029
 spine+0.028
oppy+0.028
adow+0.028
Neg logit contributions
Offic-0.032
Can-0.024
Sir-0.024
corn-0.024
"-0.023
Token toy
Token ID13373
Feature activation-2.41e+00

Pos logit contributions
Offic+0.036
 whip+0.030
ercise+0.029
Can+0.029
 permitting+0.029
Neg logit contributions
 eyel-0.027
gan-0.027
adow-0.026
 Moe-0.026
 spine-0.025
Token was
Token ID373
Feature activation+1.83e+00

Pos logit contributions
Offic+0.070
ercise+0.057
 whip+0.056
corn+0.056
 permitting+0.056
Neg logit contributions
 eyel-0.050
gan-0.048
 spine-0.048
 Moe-0.047
sing-0.047
Token stuffed
Token ID22259
Feature activation-7.99e+00

Pos logit contributions
 eyel+0.037
gan+0.035
sing+0.035
 spine+0.034
 Moe+0.034
Neg logit contributions
Offic-0.055
ercise-0.045
corn-0.044
Mic-0.044
 permitting-0.044
Token with
Token ID351
Feature activation-1.94e+00

Pos logit contributions
Offic+0.197
ercise+0.150
Mic+0.150
corn+0.150
 permitting+0.147
Neg logit contributions
 eyel-0.208
 spine-0.197
gan-0.196
sing-0.195
 Moe-0.188
Token stuffing
Token ID43672
Feature activation-2.30e+00

Pos logit contributions
Offic+0.055
ercise+0.044
 permitting+0.043
corn+0.043
Mic+0.043
Neg logit contributions
 eyel-0.042
gan-0.040
 spine-0.040
sing-0.040
 Moe-0.039
Token,
Token ID11
Feature activation-5.70e-02

Pos logit contributions
Offic+0.061
ercise+0.048
 permitting+0.047
corn+0.047
Mic+0.047
Neg logit contributions
 eyel-0.054
gan-0.052
 spine-0.051
sing-0.051
 Moe-0.050
Token but
Token ID475
Feature activation-2.31e-01

Pos logit contributions
Offic+0.002
ercise+0.001
Mic+0.001
corn+0.001
Sir+0.001
Neg logit contributions
 eyel-0.001
sing-0.001
 spine-0.001
gan-0.001
 Moe-0.001
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
Offic+0.005
ercise+0.004
 permitting+0.004
corn+0.004
 allowing+0.004
Neg logit contributions
 eyel-0.006
 spine-0.006
gan-0.006
sing-0.006
oppy-0.006
Token favourite
Token ID12507
Feature activation-3.21e+00

Pos logit contributions
 Maur+0.043
 eyel+0.043
gan+0.043
ahl+0.043
 Mil+0.042
Neg logit contributions
Offic-0.042
 whip-0.040
 permitting-0.036
 ribbon-0.036
 string-0.036
Token toy
Token ID13373
Feature activation-1.36e+00

Pos logit contributions
Offic+0.075
corn+0.060
 whip+0.060
 permitting+0.059
ercise+0.059
Neg logit contributions
 eyel-0.081
gan-0.081
 spine-0.079
sing-0.078
oppy-0.077
Token -
Token ID532
Feature activation+6.76e-01

Pos logit contributions
Offic+0.030
 whip+0.025
 permitting+0.024
corn+0.024
ercise+0.023
Neg logit contributions
 eyel-0.036
gan-0.035
 spine-0.034
sing-0.034
 Maur-0.033
Token a
Token ID257
Feature activation-3.02e+00

Pos logit contributions
 eyel+0.013
 Moe+0.013
gan+0.013
sing+0.012
 spine+0.012
Neg logit contributions
Offic-0.020
Mic-0.017
ercise-0.017
corn-0.016
 permitting-0.016
Token giant
Token ID6175
Feature activation-3.43e+00

Pos logit contributions
Offic+0.079
ercise+0.063
 permitting+0.062
corn+0.062
Mic+0.061
Neg logit contributions
 eyel-0.071
sing-0.067
gan-0.067
 spine-0.067
 Moe-0.066
Token stuffed
Token ID22259
Feature activation-7.97e+00

Pos logit contributions
Offic+0.092
ercise+0.073
 permitting+0.072
Mic+0.071
corn+0.071
Neg logit contributions
 eyel-0.080
gan-0.076
 spine-0.075
sing-0.075
 Moe-0.074
Token animal
Token ID5044
Feature activation-1.64e+00

Pos logit contributions
Offic+0.201
ercise+0.160
 permitting+0.157
corn+0.155
Mic+0.154
Neg logit contributions
 eyel-0.194
gan-0.187
 Moe-0.186
sing-0.185
 spine-0.184
Token he
Token ID339
Feature activation+1.45e+00

Pos logit contributions
Offic+0.043
ercise+0.034
corn+0.034
 permitting+0.033
Mic+0.033
Neg logit contributions
 eyel-0.039
gan-0.037
sing-0.037
 spine-0.037
 Moe-0.036
Token had
Token ID550
Feature activation+3.70e-01

Pos logit contributions
 eyel+0.027
gan+0.025
sing+0.025
 spine+0.025
 Moe+0.024
Neg logit contributions
Offic-0.046
ercise-0.038
corn-0.037
 permitting-0.037
Mic-0.037
Token been
Token ID587
Feature activation+3.20e+00

Pos logit contributions
 eyel+0.008
gan+0.007
 spine+0.007
sing+0.007
 Moe+0.007
Neg logit contributions
Offic-0.011
ercise-0.009
Mic-0.009
corn-0.009
 permitting-0.009
Token dreaming
Token ID28368
Feature activation+6.87e-01

Pos logit contributions
 eyel+0.067
gan+0.063
sing+0.062
 spine+0.062
 Moe+0.061
Neg logit contributions
Offic-0.094
ercise-0.076
 permitting-0.075
corn-0.075
Mic-0.074
Token something
Token ID1223
Feature activation+7.86e-03

Pos logit contributions
Offic+0.097
ercise+0.079
Mic+0.078
corn+0.078
 permitting+0.078
Neg logit contributions
 eyel-0.060
gan-0.057
 Moe-0.056
 spine-0.054
sing-0.054
Token new
Token ID649
Feature activation-2.72e+00

Pos logit contributions
 spine+0.000
sing+0.000
 eyel+0.000
gan+0.000
sed+0.000
Neg logit contributions
Offic-0.000
corn-0.000
ercise-0.000
 permitting-0.000
"-0.000
Token!
Token ID0
Feature activation+1.54e+00

Pos logit contributions
Offic+0.070
ercise+0.057
corn+0.056
 permitting+0.056
"+0.055
Neg logit contributions
 eyel-0.065
sing-0.062
gan-0.061
 spine-0.061
 Moe-0.059
Token She
Token ID1375
Feature activation-7.10e-01

Pos logit contributions
 eyel+0.033
sing+0.032
gan+0.031
 spine+0.031
 Moe+0.030
Neg logit contributions
Offic-0.044
ercise-0.036
 permitting-0.035
Mic-0.035
corn-0.035
Token thought
Token ID1807
Feature activation-1.26e-01

Pos logit contributions
Offic+0.022
ercise+0.019
Mic+0.018
Sir+0.018
 permitting+0.018
Neg logit contributions
 eyel-0.013
gan-0.013
 Moe-0.012
sing-0.012
oppy-0.012
Token av
Token ID1196
Feature activation-7.97e+00

Pos logit contributions
Offic+0.003
ercise+0.002
Mic+0.002
corn+0.002
 permitting+0.002
Neg logit contributions
 eyel-0.003
gan-0.003
 Moe-0.003
sing-0.003
 spine-0.003
Tokenoc
Token ID420
Feature activation-7.14e+00

Pos logit contributions
Offic+0.164
 permitting+0.146
 allowing+0.124
 letting+0.116
 groom+0.113
Neg logit contributions
adow-0.212
oppy-0.211
anda-0.197
pt-0.197
by-0.195
Tokenados
Token ID22484
Feature activation-4.61e+00

Pos logit contributions
Offic+0.144
 permitting+0.106
ercise+0.103
corn+0.099
Mic+0.099
Neg logit contributions
 eyel-0.211
gan-0.207
 Moe-0.203
sing-0.197
sed-0.195
Token were
Token ID547
Feature activation+8.55e-01

Pos logit contributions
Offic+0.157
ercise+0.131
 permitting+0.129
Mic+0.128
corn+0.128
Neg logit contributions
 eyel-0.074
gan-0.069
sing-0.067
 Moe-0.067
 spine-0.067
Token the
Token ID262
Feature activation+6.11e-01

Pos logit contributions
 eyel+0.015
gan+0.014
 Moe+0.014
 Flor+0.013
 Mil+0.013
Neg logit contributions
Offic-0.029
ercise-0.023
Mic-0.023
oc-0.023
Sir-0.023
Token most
Token ID749
Feature activation-2.48e+00

Pos logit contributions
 eyel+0.007
gan+0.007
 spine+0.006
sing+0.006
 Moe+0.006
Neg logit contributions
Offic-0.024
ercise-0.020
corn-0.020
 permitting-0.020
Mic-0.020
Token
Token ID198
Feature activation-2.06e+00

Pos logit contributions
 eyel+0.022
sing+0.021
 spine+0.021
gan+0.021
sed+0.020
Neg logit contributions
Offic-0.024
corn-0.018
ercise-0.018
 permitting-0.018
Mic-0.018
Token
Token ID220
Feature activation+2.12e+00

Pos logit contributions
Offic+0.059
ercise+0.048
 permitting+0.047
corn+0.047
Mic+0.047
Neg logit contributions
 eyel-0.044
gan-0.042
sing-0.041
 spine-0.041
 Moe-0.041
Token
Token ID198
Feature activation-2.43e+00

Pos logit contributions
 eyel+0.055
 spine+0.052
sing+0.052
gan+0.050
 Moe+0.049
Neg logit contributions
Offic-0.051
 permitting-0.040
ercise-0.039
"-0.039
corn-0.039
TokenThis
Token ID1212
Feature activation-1.47e+00

Pos logit contributions
Offic+0.045
ercise+0.037
 newspapers+0.036
 whip+0.035
 permitting+0.034
Neg logit contributions
gan-0.069
 Moe-0.069
 Flor-0.066
 eyel-0.065
 Mil-0.065
Token story
Token ID1621
Feature activation-4.19e-01

Pos logit contributions
Offic+0.049
Mic+0.043
ercise+0.041
 permitting+0.041
corn+0.039
Neg logit contributions
 eyel-0.024
 Moe-0.021
adow-0.021
gan-0.020
sing-0.020
Token teaches
Token ID17324
Feature activation-7.93e+00

Pos logit contributions
Offic+0.009
Mic+0.006
ercise+0.006
 permitting+0.006
corn+0.006
Neg logit contributions
 eyel-0.012
gan-0.012
 Moe-0.012
sing-0.011
 spine-0.011
Token us
Token ID514
Feature activation+8.29e-01

Pos logit contributions
Offic+0.192
ercise+0.149
 permitting+0.145
corn+0.145
Mic+0.144
Neg logit contributions
 eyel-0.206
gan-0.198
sing-0.195
 spine-0.194
 Moe-0.193
Token a
Token ID257
Feature activation-2.79e+00

Pos logit contributions
 eyel+0.024
 spine+0.024
sing+0.023
gan+0.023
 Moe+0.022
Neg logit contributions
Offic-0.015
ercise-0.013
 permitting-0.013
"-0.013
 allowing-0.012
Token valuable
Token ID8119
Feature activation-4.26e+00

Pos logit contributions
Offic+0.073
ercise+0.057
 permitting+0.056
Mic+0.056
corn+0.056
Neg logit contributions
 eyel-0.068
sing-0.064
gan-0.064
 spine-0.063
 Moe-0.062
Token lesson
Token ID11483
Feature activation-3.30e+00

Pos logit contributions
Offic+0.112
ercise+0.089
 permitting+0.087
corn+0.087
Mic+0.086
Neg logit contributions
 eyel-0.101
gan-0.097
sing-0.095
 spine-0.095
 Moe-0.094
Token:
Token ID25
Feature activation+3.29e-01

Pos logit contributions
Offic+0.101
ercise+0.083
 permitting+0.082
corn+0.082
Sir+0.080
Neg logit contributions
 eyel-0.064
gan-0.061
sing-0.060
 spine-0.060
 Moe-0.058
Token hands
Token ID2832
Feature activation-1.47e+00

Pos logit contributions
 eyel+0.051
gan+0.049
sing+0.048
 spine+0.048
 Moe+0.047
Neg logit contributions
Offic-0.068
ercise-0.055
 permitting-0.054
corn-0.054
Mic-0.054
Token to
Token ID284
Feature activation-1.06e-01

Pos logit contributions
Offic+0.040
ercise+0.033
corn+0.032
 permitting+0.032
 whip+0.032
Neg logit contributions
 eyel-0.033
gan-0.031
sing-0.030
 spine-0.030
 Moe-0.030
Token talk
Token ID1561
Feature activation-1.11e+00

Pos logit contributions
Offic+0.003
ercise+0.002
corn+0.002
Mic+0.002
 permitting+0.002
Neg logit contributions
 eyel-0.003
sing-0.002
 spine-0.002
gan-0.002
 Flor-0.002
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
Offic+0.034
ercise+0.028
Mic+0.028
 permitting+0.027
corn+0.027
Neg logit contributions
 eyel-0.022
gan-0.021
sing-0.020
 spine-0.020
 Moe-0.020
Token He
Token ID679
Feature activation-1.53e+00

Pos logit contributions
sing+0.037
 eyel+0.037
 spine+0.036
ail+0.034
sed+0.034
Neg logit contributions
Offic-0.040
Can-0.034
"-0.033
Sir-0.032
corn-0.031
Token teaches
Token ID17324
Feature activation-7.93e+00

Pos logit contributions
Offic+0.040
ercise+0.031
 permitting+0.031
Mic+0.031
corn+0.031
Neg logit contributions
 eyel-0.037
gan-0.035
sing-0.034
 spine-0.034
 Moe-0.034
Token Sara
Token ID24799
Feature activation-1.75e-02

Pos logit contributions
Offic+0.171
Sir+0.132
ercise+0.128
 permitting+0.127
corn+0.126
Neg logit contributions
 spine-0.223
sing-0.222
 eyel-0.219
gan-0.213
adow-0.210
Token some
Token ID617
Feature activation-3.05e+00

Pos logit contributions
Offic+0.000
corn+0.000
"+0.000
ercise+0.000
 whip+0.000
Neg logit contributions
 eyel-0.000
 spine-0.000
sing-0.000
 Moe-0.000
gan-0.000
Token signs
Token ID5895
Feature activation-3.59e+00

Pos logit contributions
Offic+0.080
Mic+0.062
ercise+0.062
 permitting+0.061
corn+0.061
Neg logit contributions
 eyel-0.075
sing-0.069
gan-0.069
 spine-0.068
 Moe-0.068
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
Offic+0.107
Mic+0.087
ercise+0.086
corn+0.085
 permitting+0.085
Neg logit contributions
 eyel-0.073
gan-0.071
sing-0.069
 spine-0.068
 Moe-0.068
Token
Token ID198
Feature activation-1.65e+00

Pos logit contributions
 eyel+0.027
sing+0.026
 spine+0.026
gan+0.025
adow+0.024
Neg logit contributions
Offic-0.034
Can-0.028
Sir-0.027
"-0.027
corn-0.026
Tokenâ
Token ID22940
Feature activation-8.95e-01

Pos logit contributions
Offic+0.022
ercise+0.017
corn+0.017
 permitting+0.017
"+0.017
Neg logit contributions
 eyel-0.017
 spine-0.017
sing-0.016
gan-0.016
 Moe-0.016
Token
Token ID26391
Feature activation+5.98e+00

Pos logit contributions
Offic+0.024
ercise+0.020
Thank+0.019
corn+0.019
Mic+0.018
Neg logit contributions
 eyel-0.021
sing-0.020
 Moe-0.020
oppy-0.019
tub-0.019
Token
Token ID8151
Feature activation-8.75e-03

Pos logit contributions
 eyel+0.146
gan+0.139
sing+0.138
 spine+0.138
 Moe+0.136
Neg logit contributions
Offic-0.153
ercise-0.120
corn-0.118
 permitting-0.118
Mic-0.117
Tokens
Token ID82
Feature activation+1.32e-01

Pos logit contributions
Offic+0.000
"+0.000
Every+0.000
Together+0.000
 permitting+0.000
Neg logit contributions
 eyel-0.000
sing-0.000
 spine-0.000
sed-0.000
ses-0.000
Token story
Token ID1621
Feature activation-1.29e-01

Pos logit contributions
 spine+0.003
 eyel+0.003
gan+0.003
sing+0.003
tub+0.003
Neg logit contributions
Offic-0.003
 whip-0.003
"-0.003
ercise-0.003
 permitting-0.003
Token teaches
Token ID17324
Feature activation-7.93e+00

Pos logit contributions
Offic+0.004
"+0.003
ercise+0.003
 permitting+0.003
corn+0.003
Neg logit contributions
 eyel-0.003
sing-0.002
 spine-0.002
oppy-0.002
gan-0.002
Token us
Token ID514
Feature activation+6.34e-01

Pos logit contributions
Offic+0.185
ercise+0.142
 permitting+0.138
corn+0.138
Mic+0.137
Neg logit contributions
 eyel-0.213
gan-0.206
sing-0.202
 spine-0.202
 Moe-0.200
Token an
Token ID281
Feature activation-3.14e+00

Pos logit contributions
 spine+0.019
 eyel+0.018
sing+0.018
gan+0.017
sed+0.017
Neg logit contributions
Offic-0.011
"-0.010
ercise-0.010
 permitting-0.010
 allowing-0.009
Token important
Token ID1593
Feature activation-4.12e+00

Pos logit contributions
Offic+0.081
ercise+0.065
 permitting+0.064
 whip+0.063
corn+0.062
Neg logit contributions
 eyel-0.075
gan-0.072
 spine-0.071
sing-0.070
 Moe-0.070
Token lesson
Token ID11483
Feature activation-3.64e+00

Pos logit contributions
Offic+0.094
 whip+0.074
ercise+0.073
corn+0.071
 permitting+0.071
Neg logit contributions
gan-0.109
 eyel-0.109
 spine-0.104
 Moe-0.103
sing-0.103
Token -
Token ID532
Feature activation+1.39e-01

Pos logit contributions
Offic+0.103
 permitting+0.085
"+0.084
ercise+0.083
corn+0.083
Neg logit contributions
 eyel-0.078
sing-0.075
 spine-0.074
gan-0.073
 Moe-0.070
Token smart
Token ID4451
Feature activation-3.72e+00

Pos logit contributions
 eyel+0.044
gan+0.042
sing+0.042
 spine+0.041
 Moe+0.041
Neg logit contributions
Offic-0.058
ercise-0.047
corn-0.046
 permitting-0.046
Mic-0.046
Token and
Token ID290
Feature activation-1.94e+00

Pos logit contributions
Offic+0.110
ercise+0.089
Mic+0.089
Sir+0.085
corn+0.085
Neg logit contributions
 eyel-0.077
gan-0.074
 spine-0.074
 Moe-0.072
sing-0.072
Token funny
Token ID8258
Feature activation-1.81e+00

Pos logit contributions
Offic+0.054
ercise+0.043
 permitting+0.043
Mic+0.042
corn+0.042
Neg logit contributions
 eyel-0.043
gan-0.041
sing-0.041
 spine-0.040
 Moe-0.040
Token.
Token ID13
Feature activation+7.25e-01

Pos logit contributions
Offic+0.050
ercise+0.041
 permitting+0.040
corn+0.040
Mic+0.039
Neg logit contributions
 eyel-0.040
gan-0.038
sing-0.038
 spine-0.038
 Moe-0.037
Token He
Token ID679
Feature activation-1.76e+00

Pos logit contributions
 eyel+0.016
gan+0.015
sing+0.015
 spine+0.015
 Moe+0.015
Neg logit contributions
Offic-0.020
ercise-0.016
 permitting-0.016
corn-0.016
Mic-0.016
Token teaches
Token ID17324
Feature activation-7.93e+00

Pos logit contributions
Offic+0.045
ercise+0.037
corn+0.036
 permitting+0.035
 whip+0.035
Neg logit contributions
 eyel-0.042
gan-0.041
 spine-0.041
sing-0.040
 Moe-0.039
Token the
Token ID262
Feature activation-1.62e+00

Pos logit contributions
Offic+0.196
ercise+0.154
corn+0.149
Mic+0.149
 permitting+0.148
Neg logit contributions
 eyel-0.201
 Moe-0.191
gan-0.190
 Mil-0.183
sing-0.182
Token mean
Token ID1612
Feature activation-4.41e+00

Pos logit contributions
Offic+0.042
 permitting+0.033
ercise+0.033
corn+0.032
 whip+0.032
Neg logit contributions
 eyel-0.039
gan-0.038
 spine-0.037
sing-0.037
 Moe-0.036
Token boy
Token ID2933
Feature activation-7.63e-01

Pos logit contributions
Offic+0.126
ercise+0.102
corn+0.101
 permitting+0.101
Sir+0.099
Neg logit contributions
 eyel-0.095
gan-0.089
 spine-0.089
sing-0.088
 Moe-0.086
Token a
Token ID257
Feature activation-2.22e+00

Pos logit contributions
Offic+0.019
ercise+0.015
corn+0.015
 permitting+0.015
 whip+0.014
Neg logit contributions
 eyel-0.019
 spine-0.018
sing-0.018
gan-0.018
 Moe-0.018
Token lesson
Token ID11483
Feature activation-2.74e+00

Pos logit contributions
Offic+0.057
ercise+0.045
 permitting+0.044
corn+0.044
Mic+0.043
Neg logit contributions
 eyel-0.054
gan-0.052
 spine-0.051
sing-0.051
 Moe-0.051
Tokenig
Token ID328
Feature activation+4.80e+00

Pos logit contributions
 eyel+0.025
gan+0.023
sing+0.023
 spine+0.023
 Moe+0.023
Neg logit contributions
Offic-0.040
ercise-0.033
corn-0.032
 permitting-0.032
Mic-0.032
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
 eyel+0.103
gan+0.099
sing+0.097
 spine+0.097
 Moe+0.096
Neg logit contributions
Offic-0.136
ercise-0.110
 permitting-0.108
corn-0.108
Mic-0.107
Token
Token ID198
Feature activation-1.81e+00

Pos logit contributions
 eyel+0.032
gan+0.031
 Moe+0.030
 spine+0.030
sing+0.030
Neg logit contributions
Offic-0.053
ercise-0.044
corn-0.043
 permitting-0.043
Mic-0.043
TokenThis
Token ID1212
Feature activation-1.34e+00

Pos logit contributions
Offic+0.059
ercise+0.053
corn+0.051
 whip+0.050
 newspapers+0.050
Neg logit contributions
 Moe-0.027
gan-0.025
 eyel-0.025
oppy-0.025
adow-0.024
Token story
Token ID1621
Feature activation-3.04e-01

Pos logit contributions
Offic+0.041
ercise+0.034
 permitting+0.033
Mic+0.033
corn+0.033
Neg logit contributions
 eyel-0.026
gan-0.024
sing-0.024
 spine-0.024
 Moe-0.024
Token teaches
Token ID17324
Feature activation-7.90e+00

Pos logit contributions
Offic+0.007
ercise+0.005
Mic+0.005
 permitting+0.005
corn+0.005
Neg logit contributions
 eyel-0.009
gan-0.008
 Moe-0.008
sing-0.008
 spine-0.008
Token us
Token ID514
Feature activation+7.89e-01

Pos logit contributions
Offic+0.181
ercise+0.137
 permitting+0.133
corn+0.133
Mic+0.133
Neg logit contributions
 eyel-0.215
gan-0.207
 Moe-0.204
sing-0.201
 spine-0.201
Token that
Token ID326
Feature activation-9.89e-01

Pos logit contributions
 spine+0.024
 eyel+0.024
sing+0.023
gan+0.023
sed+0.022
Neg logit contributions
Offic-0.013
ercise-0.012
"-0.012
 permitting-0.011
 allowing-0.011
Token when
Token ID618
Feature activation+1.98e+00

Pos logit contributions
Offic+0.024
Mic+0.018
ercise+0.018
corn+0.018
Sir+0.017
Neg logit contributions
 eyel-0.026
sing-0.025
gan-0.025
 spine-0.025
 Moe-0.024
Token we
Token ID356
Feature activation+4.81e-01

Pos logit contributions
 eyel+0.055
gan+0.053
 spine+0.053
sing+0.053
 Moe+0.051
Neg logit contributions
Offic-0.043
ercise-0.033
 permitting-0.033
corn-0.032
 allowing-0.031
Token work
Token ID670
Feature activation-1.08e-02

Pos logit contributions
 eyel+0.011
gan+0.010
sing+0.010
 spine+0.010
 Moe+0.010
Neg logit contributions
Offic-0.013
ercise-0.010
corn-0.010
 permitting-0.010
Mic-0.010
Token play
Token ID711
Feature activation-4.56e-01

Pos logit contributions
 eyel+0.009
gan+0.009
 spine+0.009
sing+0.009
 Moe+0.008
Neg logit contributions
Offic-0.009
ercise-0.007
 permitting-0.007
corn-0.007
Mic-0.007
Token the
Token ID262
Feature activation+2.97e+00

Pos logit contributions
Offic+0.011
ercise+0.009
 permitting+0.009
corn+0.009
Mic+0.009
Neg logit contributions
 eyel-0.011
gan-0.011
sing-0.011
 spine-0.011
 Moe-0.011
Token fl
Token ID781
Feature activation+6.25e-01

Pos logit contributions
 eyel+0.057
sing+0.053
 toilet+0.051
 Flor+0.051
 spine+0.050
Neg logit contributions
Offic-0.090
Mic-0.075
oc-0.074
ercise-0.073
Sir-0.071
Tokenute
Token ID1133
Feature activation+1.78e+00

Pos logit contributions
 eyel+0.010
sing+0.009
gan+0.009
 spine+0.009
oppy+0.009
Neg logit contributions
Offic-0.022
ercise-0.018
corn-0.018
Mic-0.018
 permitting-0.018
Token again
Token ID757
Feature activation+4.04e+00

Pos logit contributions
 eyel+0.039
gan+0.037
 spine+0.036
sing+0.036
 Moe+0.036
Neg logit contributions
Offic-0.049
ercise-0.041
 permitting-0.040
corn-0.040
 whip-0.039
Token tomorrow
Token ID9439
Feature activation+5.74e+00

Pos logit contributions
 eyel+0.062
gan+0.061
adow+0.059
 spine+0.059
oppy+0.057
Neg logit contributions
Offic-0.139
ercise-0.116
Mic-0.115
 permitting-0.114
corn-0.113
Token.
Token ID13
Feature activation+2.17e+00

Pos logit contributions
 eyel+0.098
gan+0.093
 spine+0.092
sing+0.091
 Moe+0.089
Neg logit contributions
Offic-0.189
ercise-0.158
 permitting-0.155
Mic-0.155
corn-0.155
Token<|endoftext|>
Token ID50256
Feature activation+3.67e+00

Pos logit contributions
 eyel+0.056
sing+0.056
 spine+0.054
gan+0.053
anda+0.050
Neg logit contributions
Offic-0.055
oc-0.044
Thank-0.042
"-0.042
ercise-0.042
TokenOnce
Token ID7454
Feature activation+5.87e+00

Pos logit contributions
 eyel+0.098
sing+0.096
 spine+0.095
gan+0.094
sed+0.090
Neg logit contributions
Offic-0.086
corn-0.066
Can-0.065
Sir-0.065
 permitting-0.064
Token upon
Token ID2402
Feature activation+1.34e+00

Pos logit contributions
 eyel+0.122
sing+0.110
 spine+0.109
 thirst+0.102
sed+0.101
Neg logit contributions
"-0.159
Offic-0.159
Every-0.144
corn-0.136
 permitting-0.136
Token a
Token ID257
Feature activation+1.07e+00

Pos logit contributions
 spine+0.031
 eyel+0.030
sing+0.029
gan+0.029
 Moe+0.029
Neg logit contributions
Offic-0.036
Can-0.030
 permitting-0.029
"-0.029
corn-0.028
Token and
Token ID290
Feature activation-7.79e-01

Pos logit contributions
Offic+0.009
ercise+0.007
Mic+0.007
corn+0.007
 permitting+0.007
Neg logit contributions
 eyel-0.008
gan-0.007
 spine-0.007
sing-0.007
 Moe-0.007
Token soon
Token ID2582
Feature activation+1.61e+00

Pos logit contributions
Offic+0.021
Mic+0.018
ercise+0.017
 permitting+0.016
corn+0.016
Neg logit contributions
 eyel-0.018
gan-0.017
 Moe-0.017
 Mil-0.016
sing-0.016
Token she
Token ID673
Feature activation+1.34e+00

Pos logit contributions
 eyel+0.044
sing+0.044
 spine+0.043
gan+0.041
sed+0.040
Neg logit contributions
Offic-0.037
corn-0.029
 permitting-0.028
ercise-0.028
 allowing-0.027
Token was
Token ID373
Feature activation+4.03e+00

Pos logit contributions
 eyel+0.023
gan+0.022
sing+0.022
 spine+0.021
 Moe+0.021
Neg logit contributions
Offic-0.044
ercise-0.037
 permitting-0.036
corn-0.036
Mic-0.036
Token tapping
Token ID24435
Feature activation-1.09e+00

Pos logit contributions
 eyel+0.089
 spine+0.087
sing+0.085
gan+0.085
 Moe+0.084
Neg logit contributions
Offic-0.106
 permitting-0.090
 allowing-0.087
ercise-0.085
 whip-0.083
Token her
Token ID607
Feature activation+5.94e+00

Pos logit contributions
Offic+0.026
ercise+0.022
 permitting+0.021
corn+0.021
 whip+0.020
Neg logit contributions
 eyel-0.027
gan-0.026
sing-0.026
 spine-0.026
oppy-0.025
Token feet
Token ID3625
Feature activation+6.10e-01

Pos logit contributions
 eyel+0.137
gan+0.136
 Moe+0.133
sing+0.131
 spine+0.129
Neg logit contributions
Offic-0.151
ercise-0.124
 permitting-0.121
 whip-0.120
corn-0.119
Token along
Token ID1863
Feature activation+1.49e+00

Pos logit contributions
 eyel+0.013
gan+0.012
 spine+0.012
sing+0.012
 Moe+0.012
Neg logit contributions
Offic-0.018
ercise-0.014
corn-0.014
Mic-0.014
 permitting-0.014
Token with
Token ID351
Feature activation-1.78e-01

Pos logit contributions
 eyel+0.031
gan+0.030
sing+0.029
 spine+0.029
 Moe+0.029
Neg logit contributions
Offic-0.043
ercise-0.035
 permitting-0.034
corn-0.034
Mic-0.034
Token the
Token ID262
Feature activation+2.15e+00

Pos logit contributions
Offic+0.005
ercise+0.004
corn+0.004
Mic+0.004
 permitting+0.004
Neg logit contributions
 eyel-0.004
gan-0.004
sing-0.004
 Moe-0.004
 spine-0.004
Token music
Token ID2647
Feature activation+1.99e-01

Pos logit contributions
 eyel+0.054
 spine+0.049
sing+0.048
tub+0.045
 Moe+0.045
Neg logit contributions
Offic-0.059
Mic-0.046
oc-0.046
corn-0.046
ercise-0.044
Token him
Token ID683
Feature activation+4.59e+00

Pos logit contributions
 eyel+0.011
 spine+0.011
gan+0.011
sing+0.011
oppy+0.011
Neg logit contributions
Offic-0.011
Sir-0.009
 permitting-0.009
ercise-0.008
corn-0.008
Token.
Token ID13
Feature activation+8.78e-01

Pos logit contributions
gan+0.110
 eyel+0.109
 Moe+0.106
 thirst+0.106
agg+0.104
Neg logit contributions
Offic-0.107
"-0.091
 whip-0.087
Can-0.084
oc-0.083
Token
Token ID198
Feature activation-1.84e+00

Pos logit contributions
 eyel+0.022
 spine+0.021
gan+0.021
sing+0.021
oppy+0.021
Neg logit contributions
Offic-0.022
ercise-0.017
corn-0.017
"-0.016
 permitting-0.016
Token
Token ID198
Feature activation-2.12e+00

Pos logit contributions
Offic+0.060
ercise+0.054
 whip+0.053
 permitting+0.053
Mic+0.052
Neg logit contributions
 Moe-0.028
ia-0.027
gan-0.026
 Mil-0.025
adow-0.025
Tokenâ
Token ID22940
Feature activation-3.73e-01

Pos logit contributions
Offic+0.033
burn+0.030
 whip+0.029
 permitting+0.028
ograph+0.028
Neg logit contributions
ia-0.064
gan-0.062
adow-0.061
 Moe-0.061
agg-0.060
Token
Token ID26391
Feature activation+6.36e+00

Pos logit contributions
Offic+0.011
ercise+0.009
corn+0.009
Sir+0.008
Thank+0.008
Neg logit contributions
 eyel-0.008
gan-0.008
sing-0.007
 Moe-0.007
oppy-0.007
Token
Token ID129
Feature activation-5.44e-01

Pos logit contributions
 eyel+0.140
gan+0.135
 spine+0.133
sing+0.133
 Moe+0.130
Neg logit contributions
Offic-0.175
ercise-0.143
 permitting-0.140
Mic-0.139
corn-0.139
Token
Token ID241
Feature activation-1.58e+00

Pos logit contributions
Offic+0.014
ercise+0.011
corn+0.011
 permitting+0.011
Mic+0.011
Neg logit contributions
 eyel-0.013
gan-0.012
sing-0.012
 spine-0.012
 Moe-0.012
TokenPlease
Token ID5492
Feature activation+9.42e-01

Pos logit contributions
Offic+0.029
ercise+0.021
Mic+0.020
 permitting+0.020
corn+0.020
Neg logit contributions
 eyel-0.049
gan-0.048
 Moe-0.047
 spine-0.047
sing-0.047
Token help
Token ID1037
Feature activation-1.73e+00

Pos logit contributions
 eyel+0.021
sing+0.020
 spine+0.020
gan+0.020
 Moe+0.019
Neg logit contributions
Offic-0.026
ercise-0.021
 permitting-0.021
corn-0.021
Mic-0.021
Token me
Token ID502
Feature activation+3.60e+00

Pos logit contributions
Offic+0.040
ercise+0.031
 permitting+0.030
corn+0.030
Mic+0.030
Neg logit contributions
 eyel-0.047
gan-0.045
sing-0.044
 spine-0.044
 Moe-0.044
Token by
Token ID416
Feature activation+2.23e+00

Pos logit contributions
 eyel+0.064
gan+0.060
 spine+0.060
 Moe+0.059
sing+0.058
Neg logit contributions
Offic-0.081
 permitting-0.070
 whip-0.068
ercise-0.067
 allowing-0.067
Token the
Token ID262
Feature activation+1.63e+00

Pos logit contributions
 eyel+0.053
 Moe+0.050
gan+0.049
 spine+0.049
sing+0.048
Neg logit contributions
Offic-0.059
ercise-0.049
Mic-0.048
Sir-0.047
corn-0.046
Token lake
Token ID13546
Feature activation+1.08e+00

Pos logit contributions
 swamp+0.030
 Moe+0.030
 toilet+0.029
 eyel+0.028
tub+0.027
Neg logit contributions
Offic-0.053
Mic-0.043
corn-0.043
ercise-0.043
oc-0.042
Token?
Token ID30
Feature activation+1.57e+00

Pos logit contributions
 eyel+0.028
 spine+0.026
gan+0.026
sing+0.026
 Moe+0.025
Neg logit contributions
Offic-0.026
ercise-0.021
corn-0.021
 permitting-0.021
 whip-0.020
Tokenâ
Token ID22940
Feature activation-3.01e-01

Pos logit contributions
 eyel+0.029
 Moe+0.029
gan+0.029
 Flor+0.027
 Mil+0.027
Neg logit contributions
Offic-0.048
ercise-0.040
 permitting-0.039
corn-0.038
Mic-0.038
Token
Token ID26391
Feature activation+6.22e+00

Pos logit contributions
Offic+0.010
ercise+0.008
corn+0.008
 permitting+0.008
Mic+0.008
Neg logit contributions
 eyel-0.005
gan-0.005
sing-0.005
 spine-0.005
 Moe-0.005
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
gan+0.148
adow+0.147
 eyel+0.146
oppy+0.143
 Moe+0.142
Neg logit contributions
Offic-0.151
ercise-0.122
 permitting-0.121
 whip-0.120
Mic-0.118
Token
Token ID198
Feature activation-1.81e+00

Pos logit contributions
 Moe+0.020
 eyel+0.019
gan+0.019
 Flor+0.018
 Mil+0.018
Neg logit contributions
Offic-0.031
ercise-0.026
 whip-0.025
 permitting-0.025
Mic-0.025
Token
Token ID198
Feature activation-2.19e+00

Pos logit contributions
Offic+0.048
ercise+0.038
corn+0.038
 permitting+0.037
Mic+0.037
Neg logit contributions
 eyel-0.043
gan-0.041
sing-0.040
 spine-0.040
 Moe-0.039
TokenThe
Token ID464
Feature activation-1.54e+00

Pos logit contributions
Offic+0.058
ercise+0.049
 whip+0.049
 newspapers+0.048
 photographer+0.047
Neg logit contributions
 Moe-0.049
gan-0.045
ola-0.043
 Nick-0.043
 eyel-0.043
Token all
Token ID477
Feature activation-2.43e+00

Pos logit contributions
Offic+0.045
ercise+0.037
corn+0.036
 permitting+0.035
Mic+0.035
Neg logit contributions
 eyel-0.031
 Moe-0.031
gan-0.031
sing-0.029
 Flor-0.029
Token and
Token ID290
Feature activation-2.21e+00

Pos logit contributions
 eyel+0.054
gan+0.051
 spine+0.051
sing+0.051
 Moe+0.050
Neg logit contributions
Offic-0.075
ercise-0.061
Mic-0.060
corn-0.059
 permitting-0.059
Token said
Token ID531
Feature activation+8.51e-01

Pos logit contributions
Offic+0.053
ercise+0.041
 permitting+0.040
corn+0.040
Mic+0.039
Neg logit contributions
 eyel-0.058
gan-0.055
 spine-0.055
sing-0.055
 Moe-0.054
Token:
Token ID25
Feature activation+6.25e-01

Pos logit contributions
 spine+0.019
sing+0.019
 eyel+0.018
adow+0.018
gan+0.017
Neg logit contributions
Offic-0.022
"-0.020
ercise-0.018
Sir-0.018
corn-0.018
Token
Token ID6184
Feature activation-1.28e+00

Pos logit contributions
 eyel+0.016
gan+0.015
sing+0.015
 Moe+0.015
 spine+0.015
Neg logit contributions
Offic-0.016
ercise-0.012
corn-0.012
 permitting-0.012
Mic-0.012
Token
Token ID95
Feature activation-1.35e+00

Pos logit contributions
Offic+0.035
ercise+0.029
corn+0.028
"+0.028
Sir+0.027
Neg logit contributions
 eyel-0.029
sing-0.027
 spine-0.027
gan-0.027
sed-0.026
Token
Token ID26391
Feature activation+5.56e+00

Pos logit contributions
Offic+0.037
ercise+0.030
corn+0.029
 permitting+0.029
Mic+0.029
Neg logit contributions
 eyel-0.031
gan-0.029
sing-0.029
 spine-0.029
 Moe-0.028
Token
Token ID129
Feature activation-1.10e+00

Pos logit contributions
 eyel+0.127
gan+0.127
adow+0.122
 spine+0.121
sing+0.121
Neg logit contributions
Offic-0.146
ercise-0.119
 permitting-0.117
Mic-0.115
 whip-0.114
Token
Token ID241
Feature activation-1.84e+00

Pos logit contributions
Offic+0.025
ercise+0.019
 permitting+0.018
corn+0.018
Mic+0.018
Neg logit contributions
 eyel-0.030
gan-0.029
sing-0.029
 spine-0.029
 Moe-0.029
TokenI
Token ID40
Feature activation-1.46e+00

Pos logit contributions
Offic+0.039
ercise+0.029
 permitting+0.028
corn+0.028
Sir+0.027
Neg logit contributions
 eyel-0.053
sing-0.051
gan-0.051
 spine-0.051
 Moe-0.049
Token need
Token ID761
Feature activation-3.34e+00

Pos logit contributions
Offic+0.043
ercise+0.036
corn+0.035
 permitting+0.035
Mic+0.035
Neg logit contributions
 eyel-0.029
gan-0.028
 spine-0.027
sing-0.027
 Moe-0.027
Token food
Token ID2057
Feature activation-2.83e+00

Pos logit contributions
Offic+0.091
ercise+0.073
 permitting+0.072
corn+0.071
Mic+0.070
Neg logit contributions
 eyel-0.076
gan-0.073
sing-0.072
 spine-0.072
 Moe-0.071
Token to
Token ID284
Feature activation-2.17e+00

Pos logit contributions
Offic+0.057
 permitting+0.044
Sir+0.043
ercise+0.043
 allowing+0.043
Neg logit contributions
 eyel-0.063
sing-0.062
gan-0.060
 spine-0.060
 Moe-0.059
Token always
Token ID1464
Feature activation-9.70e-01

Pos logit contributions
Offic+0.066
corn+0.056
ercise+0.053
Mic+0.052
oc+0.052
Neg logit contributions
 eyel-0.043
 spine-0.040
gan-0.040
sing-0.040
adow-0.038
Token ask
Token ID1265
Feature activation+5.02e-01

Pos logit contributions
Offic+0.032
corn+0.026
Mic+0.026
ercise+0.025
Sir+0.025
Neg logit contributions
 eyel-0.018
 spine-0.016
adow-0.016
sing-0.016
gan-0.015
Token for
Token ID329
Feature activation-1.75e+00

Pos logit contributions
 eyel+0.010
gan+0.010
 spine+0.010
sing+0.009
 Moe+0.009
Neg logit contributions
Offic-0.015
ercise-0.012
corn-0.012
Mic-0.012
 permitting-0.012
Token help
Token ID1037
Feature activation-9.92e-01

Pos logit contributions
Offic+0.055
corn+0.048
ercise+0.047
Mic+0.047
Aud+0.046
Neg logit contributions
 eyel-0.031
sing-0.027
 spine-0.027
 toilet-0.027
 Flor-0.027
Token when
Token ID618
Feature activation+2.58e+00

Pos logit contributions
Offic+0.035
ercise+0.030
corn+0.029
Mic+0.028
 permitting+0.028
Neg logit contributions
 eyel-0.016
 spine-0.014
gan-0.014
sing-0.014
adow-0.013
Token she
Token ID673
Feature activation+2.95e+00

Pos logit contributions
gan+0.069
 spine+0.069
sing+0.069
 eyel+0.068
adow+0.066
Neg logit contributions
Offic-0.056
 permitting-0.047
 allowing-0.045
"-0.044
Can-0.043
Token needed
Token ID2622
Feature activation-7.91e-01

Pos logit contributions
 eyel+0.061
gan+0.059
sing+0.058
 spine+0.058
 Moe+0.057
Neg logit contributions
Offic-0.087
ercise-0.071
corn-0.069
 permitting-0.069
Sir-0.068
Token it
Token ID340
Feature activation+3.49e+00

Pos logit contributions
Offic+0.020
Mic+0.015
ercise+0.015
corn+0.015
oc+0.014
Neg logit contributions
 eyel-0.021
 spine-0.019
gan-0.019
 Moe-0.019
sing-0.019
Token.
Token ID13
Feature activation+2.21e+00

Pos logit contributions
 eyel+0.070
gan+0.067
sing+0.066
 Moe+0.065
 spine+0.065
Neg logit contributions
Offic-0.102
ercise-0.085
 permitting-0.085
corn-0.083
 allowing-0.082
Token
Token ID198
Feature activation-1.49e+00

Pos logit contributions
 eyel+0.057
sing+0.056
 spine+0.055
gan+0.054
sed+0.051
Neg logit contributions
Offic-0.055
Mic-0.042
corn-0.042
ercise-0.041
"-0.041
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
Offic+0.054
ercise+0.045
 permitting+0.044
Mic+0.044
corn+0.044
Neg logit contributions
 eyel-0.031
gan-0.029
 spine-0.029
sing-0.029
 Moe-0.028
Token She
Token ID1375
Feature activation-1.07e+00

Pos logit contributions
 eyel+0.038
sing+0.036
gan+0.036
 spine+0.036
 Moe+0.035
Neg logit contributions
Offic-0.048
ercise-0.039
corn-0.038
 permitting-0.038
Mic-0.038
Token made
Token ID925
Feature activation-2.02e+00

Pos logit contributions
Offic+0.029
ercise+0.023
corn+0.023
 permitting+0.023
Mic+0.022
Neg logit contributions
 eyel-0.025
 spine-0.024
gan-0.024
sing-0.023
 Moe-0.023
Token a
Token ID257
Feature activation-1.95e+00

Pos logit contributions
Offic+0.052
ercise+0.043
 whip+0.043
 permitting+0.042
corn+0.042
Neg logit contributions
 spine-0.047
 eyel-0.046
gan-0.045
sing-0.045
adow-0.043
Token heart
Token ID2612
Feature activation-4.52e-01

Pos logit contributions
Offic+0.054
ercise+0.043
 permitting+0.043
corn+0.043
Mic+0.042
Neg logit contributions
 eyel-0.044
gan-0.042
sing-0.041
 spine-0.041
 Moe-0.040
Token out
Token ID503
Feature activation+1.29e+00

Pos logit contributions
Offic+0.012
ercise+0.010
Mic+0.009
corn+0.009
 permitting+0.009
Neg logit contributions
 eyel-0.011
gan-0.010
sing-0.010
 spine-0.010
 Moe-0.010
Token of
Token ID286
Feature activation-1.54e+00

Pos logit contributions
 eyel+0.027
gan+0.025
sing+0.025
 Moe+0.024
 Flor+0.024
Neg logit contributions
Offic-0.038
ercise-0.032
Mic-0.032
corn-0.030
Sir-0.029
Token red
Token ID2266
Feature activation-1.49e+00

Pos logit contributions
Offic+0.043
ercise+0.035
corn+0.034
Mic+0.034
 permitting+0.034
Neg logit contributions
 eyel-0.035
sing-0.032
gan-0.032
 spine-0.032
 Moe-0.032
Token cloth
Token ID16270
Feature activation+9.09e-01

Pos logit contributions
Offic+0.036
ercise+0.028
corn+0.027
 permitting+0.027
 whip+0.027
Neg logit contributions
 eyel-0.038
gan-0.038
sing-0.037
 spine-0.036
 Moe-0.036
Token and
Token ID290
Feature activation-1.86e+00

Pos logit contributions
 eyel+0.022
gan+0.021
sing+0.020
 spine+0.020
 Moe+0.020
Neg logit contributions
Offic-0.024
ercise-0.019
 permitting-0.019
corn-0.019
Mic-0.018
Token st
Token ID336
Feature activation+9.74e-01

Pos logit contributions
Offic+0.050
Mic+0.039
Sir+0.039
corn+0.038
ercise+0.038
Neg logit contributions
 eyel-0.048
sing-0.042
gan-0.041
 spine-0.041
 Mil-0.041
Token large
Token ID1588
Feature activation-2.38e+00

Pos logit contributions
 eyel+0.007
gan+0.007
sing+0.007
 spine+0.007
 Moe+0.007
Neg logit contributions
Offic-0.010
ercise-0.008
 permitting-0.008
corn-0.008
Mic-0.007
Token rock
Token ID3881
Feature activation-8.30e-02

Pos logit contributions
Offic+0.067
ercise+0.054
corn+0.053
 permitting+0.053
Mic+0.053
Neg logit contributions
 eyel-0.051
gan-0.049
sing-0.048
 spine-0.048
 Moe-0.047
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
Offic+0.002
ercise+0.002
corn+0.002
 permitting+0.002
Mic+0.002
Neg logit contributions
 eyel-0.002
gan-0.002
sing-0.002
 spine-0.002
 Moe-0.002
Token John
Token ID1757
Feature activation-2.19e-01

Pos logit contributions
 eyel+0.030
gan+0.030
 Moe+0.029
 spine+0.029
sing+0.029
Neg logit contributions
Offic-0.040
ercise-0.033
 permitting-0.032
corn-0.032
Mic-0.032
Token was
Token ID373
Feature activation+2.26e+00

Pos logit contributions
Offic+0.006
"+0.005
corn+0.005
oc+0.005
 whip+0.005
Neg logit contributions
 eyel-0.005
 spine-0.005
sing-0.005
gan-0.005
 Moe-0.005
Token so
Token ID523
Feature activation+1.27e+00

Pos logit contributions
 spine+0.057
oppy+0.055
uffy+0.055
 eyel+0.054
ahl+0.052
Neg logit contributions
Offic-0.050
 permitting-0.046
 allowing-0.045
"-0.041
 letting-0.041
Token excited
Token ID6568
Feature activation-3.10e+00

Pos logit contributions
 spine+0.036
 eyel+0.036
sing+0.033
uffy+0.033
gan+0.033
Neg logit contributions
Offic-0.025
"-0.021
 permitting-0.021
Thank-0.021
 allowing-0.020
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
Offic+0.071
corn+0.057
ercise+0.057
 permitting+0.056
 allowing+0.055
Neg logit contributions
 eyel-0.082
sing-0.078
gan-0.078
 spine-0.076
oppy-0.075
Token
Token ID198
Feature activation-1.55e+00

Pos logit contributions
 eyel+0.038
sing+0.036
 spine+0.035
gan+0.035
oppy+0.033
Neg logit contributions
Offic-0.056
ercise-0.045
 permitting-0.045
Sir-0.045
Mic-0.045
Token
Token ID198
Feature activation-1.93e+00

Pos logit contributions
Offic+0.048
ercise+0.043
corn+0.040
 whip+0.040
 permitting+0.040
Neg logit contributions
 Moe-0.027
gan-0.026
 eyel-0.025
 Mil-0.024
adow-0.024
TokenThey
Token ID2990
Feature activation-1.15e+00

Pos logit contributions
Offic+0.044
ercise+0.044
 whip+0.042
corn+0.041
 stamps+0.041
Neg logit contributions
gan-0.040
 Moe-0.039
agg-0.037
 Megan-0.037
 Nick-0.036
TokenThe
Token ID464
Feature activation-1.01e+00

Pos logit contributions
Offic+0.046
ercise+0.040
corn+0.038
 permitting+0.038
 whip+0.037
Neg logit contributions
 eyel-0.029
gan-0.028
 Moe-0.028
 Mil-0.026
oppy-0.026
Token brother
Token ID3956
Feature activation-2.47e+00

Pos logit contributions
Offic+0.032
ercise+0.026
corn+0.026
 permitting+0.026
Mic+0.026
Neg logit contributions
 eyel-0.019
sing-0.018
gan-0.017
 spine-0.017
 Moe-0.017
Token and
Token ID290
Feature activation-3.35e+00

Pos logit contributions
Offic+0.067
ercise+0.053
corn+0.053
 permitting+0.052
Mic+0.052
Neg logit contributions
 eyel-0.057
 spine-0.054
sing-0.054
gan-0.053
 Moe-0.052
Token sister
Token ID6621
Feature activation-2.01e+00

Pos logit contributions
Offic+0.118
ercise+0.100
 permitting+0.099
corn+0.098
Mic+0.098
Neg logit contributions
 eyel-0.049
gan-0.046
 spine-0.046
sing-0.045
 Moe-0.044
Token stayed
Token ID9658
Feature activation+2.36e+00

Pos logit contributions
Offic+0.059
ercise+0.049
 permitting+0.048
corn+0.047
 whip+0.047
Neg logit contributions
 eyel-0.041
 spine-0.039
gan-0.038
sing-0.038
 Moe-0.037
Token mad
Token ID8805
Feature activation+8.86e-01

Pos logit contributions
 eyel+0.052
gan+0.050
 spine+0.049
sing+0.049
 Moe+0.048
Neg logit contributions
Offic-0.066
ercise-0.053
 permitting-0.053
corn-0.052
 whip-0.051
Token at
Token ID379
Feature activation-1.10e-01

Pos logit contributions
 eyel+0.024
gan+0.023
oppy+0.023
sing+0.022
adow+0.022
Neg logit contributions
Offic-0.018
 allowing-0.015
 permitting-0.015
"-0.015
 letting-0.014
Token each
Token ID1123
Feature activation-9.11e-02

Pos logit contributions
Offic+0.002
 permitting+0.002
 whip+0.002
ercise+0.002
 allowing+0.002
Neg logit contributions
 eyel-0.003
 spine-0.003
gan-0.003
sing-0.003
 Moe-0.003
Token other
Token ID584
Feature activation+3.16e+00

Pos logit contributions
Offic+0.003
ercise+0.003
corn+0.003
Aud+0.003
ograph+0.003
Neg logit contributions
 eyel-0.001
gan-0.001
 Moe-0.001
 toilet-0.001
 spine-0.001
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
 eyel+0.075
gan+0.072
sing+0.070
 Moe+0.069
 spine+0.069
Neg logit contributions
Offic-0.081
 permitting-0.065
ercise-0.065
corn-0.064
 whip-0.063
Token They
Token ID1119
Feature activation-1.63e+00

Pos logit contributions
adow+0.043
oppy+0.042
 eyel+0.041
ulu+0.040
gan+0.040
Neg logit contributions
Offic-0.031
ercise-0.024
Can-0.024
Together-0.023
Thank-0.023
Token pocket
Token ID10000
Feature activation+2.55e+00

Pos logit contributions
gan+0.096
 Moe+0.095
 eyel+0.095
ola+0.089
ahl+0.089
Neg logit contributions
Offic-0.136
 whip-0.121
ercise-0.115
oc-0.115
corn-0.114
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
 eyel+0.038
 spine+0.037
gan+0.036
sing+0.035
adow+0.035
Neg logit contributions
Offic-0.090
ercise-0.077
Mic-0.075
corn-0.075
 permitting-0.075
Token The
Token ID383
Feature activation-1.84e+00

Pos logit contributions
 eyel+0.029
sing+0.027
 spine+0.027
adow+0.026
gan+0.026
Neg logit contributions
Offic-0.036
Sir-0.028
Can-0.028
"-0.028
corn-0.028
Token coin
Token ID10752
Feature activation-2.29e+00

Pos logit contributions
Offic+0.044
ercise+0.034
 permitting+0.034
 whip+0.033
corn+0.033
Neg logit contributions
 eyel-0.048
gan-0.046
 spine-0.045
sing-0.045
 Moe-0.045
Token fell
Token ID3214
Feature activation+1.07e+00

Pos logit contributions
Offic+0.064
corn+0.053
 whip+0.053
ercise+0.052
 permitting+0.051
Neg logit contributions
 eyel-0.048
 spine-0.046
gan-0.045
 Moe-0.044
sing-0.044
Token out
Token ID503
Feature activation+3.91e+00

Pos logit contributions
 eyel+0.028
gan+0.027
 Moe+0.027
oppy+0.027
adow+0.027
Neg logit contributions
Offic-0.024
 whip-0.019
 permitting-0.019
 allowing-0.019
"-0.018
Token of
Token ID286
Feature activation+1.48e-01

Pos logit contributions
 eyel+0.102
gan+0.098
sing+0.097
 spine+0.097
 Moe+0.095
Neg logit contributions
Offic-0.094
 permitting-0.072
ercise-0.072
corn-0.071
Mic-0.070
Token his
Token ID465
Feature activation+5.98e+00

Pos logit contributions
 eyel+0.003
gan+0.003
sing+0.003
 spine+0.003
 Moe+0.003
Neg logit contributions
Offic-0.004
ercise-0.003
corn-0.003
Mic-0.003
 permitting-0.003
Token mouth
Token ID5422
Feature activation+2.88e+00

Pos logit contributions
 Moe+0.136
gan+0.135
 eyel+0.130
ola+0.127
ahl+0.126
Neg logit contributions
Offic-0.141
 whip-0.133
 string-0.122
 exchange-0.120
 stamp-0.118
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 eyel+0.055
gan+0.052
 spine+0.051
sing+0.051
 Moe+0.049
Neg logit contributions
Offic-0.090
ercise-0.075
corn-0.073
 permitting-0.073
Mic-0.073
Token into
Token ID656
Feature activation+8.61e-01

Pos logit contributions
Offic+0.035
ercise+0.028
 permitting+0.028
corn+0.028
Mic+0.027
Neg logit contributions
 eyel-0.029
gan-0.028
 spine-0.027
sing-0.027
 Moe-0.027
Token They
Token ID1119
Feature activation-1.87e+00

Pos logit contributions
 eyel+0.028
gan+0.026
sing+0.026
 spine+0.026
 Moe+0.025
Neg logit contributions
Offic-0.034
ercise-0.028
 permitting-0.027
corn-0.027
Mic-0.027
Token escaped
Token ID13537
Feature activation-1.22e-01

Pos logit contributions
Offic+0.049
ercise+0.039
corn+0.039
 permitting+0.038
Mic+0.038
Neg logit contributions
 eyel-0.044
gan-0.042
sing-0.041
 Moe-0.041
 spine-0.041
Token the
Token ID262
Feature activation+2.39e+00

Pos logit contributions
Offic+0.003
 allowing+0.002
 permitting+0.002
corn+0.002
Sir+0.002
Neg logit contributions
 eyel-0.003
oppy-0.003
gan-0.003
 spine-0.003
sing-0.003
Token hunter
Token ID19177
Feature activation-8.19e-01

Pos logit contributions
 eyel+0.053
 spine+0.051
sing+0.050
gan+0.049
 Moe+0.049
Neg logit contributions
Offic-0.066
ercise-0.054
 permitting-0.052
corn-0.052
Mic-0.051
Token's
Token ID338
Feature activation+3.14e+00

Pos logit contributions
Offic+0.020
ercise+0.015
 permitting+0.015
corn+0.015
Mic+0.015
Neg logit contributions
 eyel-0.021
gan-0.020
sing-0.020
 spine-0.020
 Moe-0.020
Token view
Token ID1570
Feature activation+2.16e-01

Pos logit contributions
gan+0.074
 Mil+0.069
 eyel+0.068
 Moe+0.067
 Maur+0.066
Neg logit contributions
Offic-0.082
 whip-0.067
oc-0.066
Mic-0.064
 permitting-0.064
Token,
Token ID11
Feature activation+9.85e-02

Pos logit contributions
 eyel+0.005
gan+0.005
 spine+0.005
sing+0.005
 Moe+0.005
Neg logit contributions
Offic-0.006
ercise-0.005
Mic-0.004
 permitting-0.004
corn-0.004
Token and
Token ID290
Feature activation-2.08e+00

Pos logit contributions
 eyel+0.002
sing+0.002
gan+0.002
 spine+0.002
 Moe+0.002
Neg logit contributions
Offic-0.003
ercise-0.002
Mic-0.002
corn-0.002
Sir-0.002
Token were
Token ID547
Feature activation+1.25e+00

Pos logit contributions
Offic+0.052
ercise+0.040
 permitting+0.040
corn+0.040
Mic+0.039
Neg logit contributions
 eyel-0.053
gan-0.050
 spine-0.050
sing-0.049
 Moe-0.048
Token not
Token ID407
Feature activation-1.05e+00

Pos logit contributions
 eyel+0.026
 spine+0.025
gan+0.025
sing+0.025
 Moe+0.024
Neg logit contributions
Offic-0.036
ercise-0.029
 permitting-0.029
corn-0.029
Mic-0.028
Token in
Token ID287
Feature activation-6.54e-01

Pos logit contributions
Offic+0.028
 permitting+0.024
 allowing+0.024
 whip+0.023
 letting+0.023
Neg logit contributions
 eyel-0.023
 spine-0.022
oppy-0.021
sing-0.021
gan-0.021
Token still
Token ID991
Feature activation+1.84e+00

Pos logit contributions
 eyel+0.022
gan+0.021
 spine+0.021
sing+0.021
 Moe+0.021
Neg logit contributions
Offic-0.022
ercise-0.017
 permitting-0.017
 whip-0.017
corn-0.017
Token go
Token ID467
Feature activation+2.71e+00

Pos logit contributions
 eyel+0.048
gan+0.046
 spine+0.046
 Moe+0.045
sing+0.045
Neg logit contributions
Offic-0.043
ercise-0.034
 whip-0.033
 permitting-0.033
corn-0.032
Token swimming
Token ID14899
Feature activation+2.31e+00

Pos logit contributions
 eyel+0.057
gan+0.053
sing+0.052
 spine+0.052
 Moe+0.050
Neg logit contributions
Offic-0.079
ercise-0.065
 permitting-0.065
 whip-0.064
Can-0.064
Token.
Token ID13
Feature activation+1.93e+00

Pos logit contributions
 eyel+0.051
sing+0.048
gan+0.048
 spine+0.048
sed+0.046
Neg logit contributions
Offic-0.064
ercise-0.052
corn-0.051
 permitting-0.051
Can-0.050
Token So
Token ID1406
Feature activation+3.48e-03

Pos logit contributions
 eyel+0.035
gan+0.035
 Moe+0.033
 spine+0.032
sing+0.032
Neg logit contributions
Offic-0.061
ercise-0.050
 permitting-0.050
Mic-0.049
Sir-0.049
Token they
Token ID484
Feature activation-6.40e-01

Pos logit contributions
 eyel+0.000
gan+0.000
sing+0.000
 spine+0.000
 Moe+0.000
Neg logit contributions
Offic-0.000
ercise-0.000
 permitting-0.000
corn-0.000
Mic-0.000
Token went
Token ID1816
Feature activation+2.73e+00

Pos logit contributions
Offic+0.017
ercise+0.014
 permitting+0.014
corn+0.014
Mic+0.013
Neg logit contributions
 eyel-0.015
gan-0.014
sing-0.014
 spine-0.014
 Moe-0.014
Token back
Token ID736
Feature activation+4.92e+00

Pos logit contributions
 eyel+0.060
gan+0.056
sing+0.056
 spine+0.055
sed+0.053
Neg logit contributions
Offic-0.077
 permitting-0.063
ercise-0.063
"-0.061
 whip-0.061
Token to
Token ID284
Feature activation+2.01e+00

Pos logit contributions
 eyel+0.139
sing+0.130
 spine+0.127
gan+0.127
sed+0.122
Neg logit contributions
Offic-0.110
ercise-0.085
"-0.085
 permitting-0.084
corn-0.084
Token the
Token ID262
Feature activation+3.49e+00

Pos logit contributions
 eyel+0.048
gan+0.046
sing+0.045
 spine+0.045
 Moe+0.045
Neg logit contributions
Offic-0.053
ercise-0.042
corn-0.041
 permitting-0.041
Mic-0.041
Token pool
Token ID5933
Feature activation+1.37e+00

Pos logit contributions
 eyel+0.074
gan+0.071
sing+0.070
 spine+0.070
 Moe+0.069
Neg logit contributions
Offic-0.101
ercise-0.082
 permitting-0.081
corn-0.081
Mic-0.080
Token ask
Token ID1265
Feature activation+9.04e-01

Pos logit contributions
Offic+0.002
corn+0.002
Sir+0.002
Mic+0.002
ercise+0.002
Neg logit contributions
 eyel-0.002
 Moe-0.002
gan-0.002
sing-0.002
oppy-0.002
Token Sam
Token ID3409
Feature activation+9.24e-01

Pos logit contributions
 eyel+0.018
gan+0.017
 Moe+0.017
sing+0.016
 spine+0.016
Neg logit contributions
Offic-0.027
ercise-0.023
corn-0.022
Mic-0.022
 permitting-0.022
Token if
Token ID611
Feature activation-1.29e+00

Pos logit contributions
 eyel+0.026
 spine+0.025
sing+0.025
gan+0.024
sed+0.024
Neg logit contributions
Offic-0.020
ercise-0.016
"-0.015
 permitting-0.015
corn-0.015
Token he
Token ID339
Feature activation+1.78e+00

Pos logit contributions
Offic+0.034
ercise+0.027
 permitting+0.027
corn+0.026
Sir+0.026
Neg logit contributions
 eyel-0.031
 spine-0.030
sing-0.030
gan-0.029
 Moe-0.028
Token would
Token ID561
Feature activation+9.14e-01

Pos logit contributions
 spine+0.045
 eyel+0.045
sing+0.043
gan+0.042
 Moe+0.041
Neg logit contributions
Offic-0.043
ercise-0.035
Can-0.034
corn-0.034
"-0.033
Token like
Token ID588
Feature activation-7.29e-01

Pos logit contributions
 eyel+0.023
gan+0.022
sing+0.022
 spine+0.022
 Moe+0.021
Neg logit contributions
Offic-0.023
ercise-0.018
corn-0.018
Mic-0.017
 permitting-0.017
Token to
Token ID284
Feature activation+4.08e-01

Pos logit contributions
Offic+0.021
Mic+0.017
Aud+0.016
Together+0.016
ercise+0.016
Neg logit contributions
 eyel-0.016
 toilet-0.016
 Moe-0.015
 Flor-0.015
 O-0.015
Token play
Token ID711
Feature activation+8.64e-01

Pos logit contributions
 eyel+0.010
 Moe+0.009
sing+0.009
gan+0.009
oppy+0.009
Neg logit contributions
Offic-0.011
corn-0.009
 permitting-0.009
ercise-0.009
Sir-0.009
Token with
Token ID351
Feature activation+6.58e-01

Pos logit contributions
 eyel+0.018
 Moe+0.018
tub+0.017
 toilet+0.017
sing+0.017
Neg logit contributions
Offic-0.025
Mic-0.020
 permitting-0.020
ercise-0.020
Sir-0.019
Token it
Token ID340
Feature activation+3.96e+00

Pos logit contributions
 eyel+0.017
 Moe+0.017
gan+0.016
sing+0.016
 Mil+0.016
Neg logit contributions
Offic-0.016
ercise-0.012
 permitting-0.012
Mic-0.012
corn-0.012
Token.
Token ID13
Feature activation+1.83e+00

Pos logit contributions
 eyel+0.073
gan+0.069
sing+0.068
 spine+0.068
 Moe+0.067
Neg logit contributions
Offic-0.125
ercise-0.103
 permitting-0.102
corn-0.102
Mic-0.101
Token It
Token ID632
Feature activation-3.06e-01

Pos logit contributions
 eyel+0.014
gan+0.014
 Moe+0.013
 Mil+0.013
 Flor+0.013
Neg logit contributions
Offic-0.021
ercise-0.018
 permitting-0.017
corn-0.017
Mic-0.017
Token is
Token ID318
Feature activation+2.23e+00

Pos logit contributions
Offic+0.009
ercise+0.007
corn+0.007
 whip+0.007
 permitting+0.007
Neg logit contributions
 eyel-0.006
 spine-0.006
sing-0.006
gan-0.006
 Moe-0.006
Token so
Token ID523
Feature activation+1.26e+00

Pos logit contributions
 eyel+0.048
gan+0.046
sing+0.045
 spine+0.045
 Moe+0.044
Neg logit contributions
Offic-0.063
ercise-0.051
 permitting-0.051
corn-0.050
Mic-0.050
Token pretty
Token ID2495
Feature activation-2.87e+00

Pos logit contributions
 spine+0.033
 eyel+0.033
sing+0.033
gan+0.032
 Moe+0.031
Neg logit contributions
Offic-0.028
corn-0.022
ercise-0.021
 permitting-0.021
"-0.021
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
Offic+0.085
Mic+0.068
ercise+0.067
 permitting+0.066
Sir+0.065
Neg logit contributions
 eyel-0.062
gan-0.060
 Moe-0.058
sing-0.056
 spine-0.056
Token Please
Token ID4222
Feature activation+3.02e-01

Pos logit contributions
gan+0.027
 eyel+0.027
 Flor+0.027
 Mil+0.027
 Moe+0.027
Neg logit contributions
Offic-0.032
ercise-0.027
 permitting-0.026
 whip-0.026
corn-0.026
Token let
Token ID1309
Feature activation-1.79e+00

Pos logit contributions
 eyel+0.007
gan+0.006
oppy+0.006
 Moe+0.006
 Flor+0.006
Neg logit contributions
Offic-0.009
Mic-0.007
ercise-0.007
Sir-0.007
 permitting-0.007
Token me
Token ID502
Feature activation+3.98e+00

Pos logit contributions
Offic+0.052
 permitting+0.041
Mic+0.040
 allowing+0.040
ercise+0.039
Neg logit contributions
 eyel-0.038
gan-0.038
 Moe-0.037
oppy-0.036
 Maur-0.035
Token play
Token ID711
Feature activation-3.16e-01

Pos logit contributions
 eyel+0.094
gan+0.091
oppy+0.087
adow+0.086
sing+0.086
Neg logit contributions
Offic-0.111
Mic-0.087
corn-0.087
Sir-0.085
ercise-0.084
Token with
Token ID351
Feature activation-3.61e-01

Pos logit contributions
Offic+0.009
Mic+0.008
ercise+0.008
 permitting+0.007
Sir+0.007
Neg logit contributions
 Flor-0.006
 Moe-0.006
 eyel-0.006
gan-0.006
 toilet-0.006
Token it
Token ID340
Feature activation+3.20e+00

Pos logit contributions
Offic+0.010
Mic+0.008
ercise+0.008
 permitting+0.008
oc+0.007
Neg logit contributions
gan-0.008
 eyel-0.008
 Flor-0.008
 Moe-0.008
 Mil-0.008
Token past
Token ID1613
Feature activation-4.94e-01

Pos logit contributions
Offic+0.001
ercise+0.001
 permitting+0.001
corn+0.001
Mic+0.001
Neg logit contributions
 eyel-0.001
gan-0.001
sing-0.001
 spine-0.001
 Moe-0.001
Token him
Token ID683
Feature activation+5.22e+00

Pos logit contributions
Offic+0.013
corn+0.010
ercise+0.010
 permitting+0.010
Sir+0.010
Neg logit contributions
 eyel-0.012
 spine-0.011
sing-0.011
gan-0.011
oppy-0.011
Token and
Token ID290
Feature activation-2.00e+00

Pos logit contributions
 eyel+0.115
gan+0.110
 Moe+0.107
sing+0.107
 spine+0.106
Neg logit contributions
Offic-0.143
ercise-0.116
 permitting-0.115
corn-0.114
 whip-0.113
Token won
Token ID1839
Feature activation+5.97e-01

Pos logit contributions
Offic+0.054
ercise+0.044
Mic+0.044
corn+0.043
 permitting+0.042
Neg logit contributions
 eyel-0.044
gan-0.043
 Moe-0.042
sing-0.041
 spine-0.041
Token the
Token ID262
Feature activation+1.56e+00

Pos logit contributions
 eyel+0.016
 spine+0.015
gan+0.015
sing+0.015
 Moe+0.015
Neg logit contributions
Offic-0.014
 permitting-0.011
ercise-0.011
corn-0.011
Mic-0.010
Token race
Token ID3234
Feature activation-1.26e+00

Pos logit contributions
 eyel+0.043
gan+0.041
sing+0.041
 spine+0.041
 Moe+0.040
Neg logit contributions
Offic-0.035
ercise-0.027
 permitting-0.026
corn-0.026
Mic-0.026
Token.
Token ID13
Feature activation+1.76e+00

Pos logit contributions
Offic+0.039
ercise+0.032
Mic+0.032
corn+0.032
 permitting+0.031
Neg logit contributions
 eyel-0.023
gan-0.023
 Moe-0.022
sing-0.022
 spine-0.022
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
 eyel+0.038
sing+0.036
 spine+0.035
gan+0.035
adow+0.033
Neg logit contributions
Offic-0.051
corn-0.040
Mic-0.040
ercise-0.040
 permitting-0.040
Token
Token ID198
Feature activation-2.11e+00

Pos logit contributions
Offic+0.048
ercise+0.042
 permitting+0.040
corn+0.040
 whip+0.039
Neg logit contributions
 eyel-0.030
 Moe-0.030
gan-0.029
 Mil-0.028
oppy-0.028
TokenMax
Token ID11518
Feature activation+5.22e-01

Pos logit contributions
Offic+0.041
ercise+0.037
 whip+0.034
 newspapers+0.034
 stamps+0.033
Neg logit contributions
 Moe-0.055
gan-0.053
ulu-0.052
ola-0.052
 Mil-0.051
Token's
Token ID338
Feature activation+8.58e-01

Pos logit contributions
 eyel+0.011
 spine+0.011
sing+0.011
gan+0.010
 Moe+0.010
Neg logit contributions
Offic-0.015
corn-0.012
ercise-0.012
"-0.012
 permitting-0.012
Token sorry
Token ID7926
Feature activation-1.21e+00

Pos logit contributions
 spine+0.026
 eyel+0.025
 Moe+0.024
oppy+0.024
ahl+0.024
Neg logit contributions
Offic-0.023
 permitting-0.021
 allowing-0.020
"-0.020
 letting-0.019
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
"+0.021
Offic+0.019
 allowing+0.018
Can+0.018
 permitting+0.017
Neg logit contributions
gan-0.035
 spine-0.034
 eyel-0.034
sed-0.034
 Maur-0.033
Token They
Token ID1119
Feature activation-2.69e+00

Pos logit contributions
 eyel+0.004
 spine+0.004
sing+0.004
oppy+0.004
gan+0.004
Neg logit contributions
Offic-0.004
Can-0.003
Sir-0.003
"-0.003
corn-0.003
Token say
Token ID910
Feature activation-1.04e+00

Pos logit contributions
Offic+0.061
 whip+0.048
ercise+0.047
 permitting+0.045
Mic+0.045
Neg logit contributions
gan-0.071
 eyel-0.071
 spine-0.070
tub-0.070
sing-0.070
Token:
Token ID25
Feature activation+9.47e-02

Pos logit contributions
Offic+0.030
"+0.026
ercise+0.025
Sir+0.025
 permitting+0.025
Neg logit contributions
 spine-0.022
 eyel-0.022
sing-0.021
gan-0.020
adow-0.020
Token "
Token ID366
Feature activation-6.46e+00

Pos logit contributions
 eyel+0.002
gan+0.002
sing+0.002
 spine+0.002
 Moe+0.002
Neg logit contributions
Offic-0.002
ercise-0.002
 permitting-0.002
corn-0.002
Mic-0.002
TokenWe
Token ID1135
Feature activation-1.57e+00

Pos logit contributions
Offic+0.128
ercise+0.107
 permitting+0.103
 allowing+0.098
corn+0.096
Neg logit contributions
gan-0.182
 Moe-0.175
 eyel-0.175
 Mil-0.173
 Flor-0.170
Token are
Token ID389
Feature activation+1.06e+00

Pos logit contributions
Offic+0.042
ercise+0.035
 permitting+0.034
 whip+0.034
Can+0.033
Neg logit contributions
 eyel-0.036
 spine-0.035
sing-0.034
gan-0.034
 Moe-0.033
Token sorry
Token ID7926
Feature activation-1.59e+00

Pos logit contributions
 spine+0.022
 eyel+0.022
tub+0.020
adow+0.020
gan+0.020
Neg logit contributions
Offic-0.030
 permitting-0.026
 whip-0.026
 allowing-0.025
Mic-0.025
Token,
Token ID11
Feature activation-3.15e+00

Pos logit contributions
Offic+0.036
"+0.035
 permitting+0.030
corn+0.030
Can+0.030
Neg logit contributions
 spine-0.040
 eyel-0.039
sing-0.037
adow-0.036
gan-0.036
Token sir
Token ID15967
Feature activation-2.63e+00

Pos logit contributions
Offic+0.098
ercise+0.089
corn+0.087
Mic+0.085
Aud+0.081
Neg logit contributions
gan-0.056
sing-0.054
 Mil-0.052
 eyel-0.052
 spine-0.051
Token Bread
Token ID28731
Feature activation-1.11e+00

Pos logit contributions
 eyel+0.053
gan+0.052
sing+0.051
 Moe+0.050
 spine+0.049
Neg logit contributions
Offic-0.096
 permitting-0.079
Mic-0.079
ercise-0.078
 allowing-0.076
Token,"
Token ID553
Feature activation-2.64e+00

Pos logit contributions
Offic+0.035
 permitting+0.028
Mic+0.028
ercise+0.027
Sir+0.027
Neg logit contributions
 eyel-0.020
gan-0.020
 Moe-0.020
sing-0.019
adow-0.019
Token Tom
Token ID4186
Feature activation-6.47e-01

Pos logit contributions
Offic+0.082
ercise+0.070
 permitting+0.069
Mic+0.068
corn+0.067
Neg logit contributions
 eyel-0.048
gan-0.048
 Moe-0.047
 Mil-0.045
sing-0.044
Token says
Token ID1139
Feature activation-1.39e-01

Pos logit contributions
Offic+0.016
ercise+0.012
 permitting+0.012
corn+0.012
Mic+0.012
Neg logit contributions
 eyel-0.017
gan-0.016
sing-0.016
 spine-0.016
 Moe-0.016
Token.
Token ID13
Feature activation-4.85e-01

Pos logit contributions
Offic+0.003
"+0.003
 permitting+0.003
 allowing+0.003
 whip+0.003
Neg logit contributions
 spine-0.003
 eyel-0.003
sing-0.003
 Moe-0.003
gan-0.003
Token "
Token ID366
Feature activation-5.74e+00

Pos logit contributions
Offic+0.015
ercise+0.013
 permitting+0.013
corn+0.013
Mic+0.012
Neg logit contributions
gan-0.009
 eyel-0.009
 Moe-0.008
 Mil-0.008
 Flor-0.008
TokenBecause
Token ID8128
Feature activation-2.78e-01

Pos logit contributions
Offic+0.121
 permitting+0.103
 allowing+0.097
ercise+0.093
corn+0.092
Neg logit contributions
gan-0.154
 Mil-0.148
 Flor-0.143
 Moe-0.142
 Maur-0.141
Token we
Token ID356
Feature activation-1.48e+00

Pos logit contributions
Offic+0.008
Mic+0.007
ercise+0.006
Aud+0.006
Together+0.006
Neg logit contributions
 O-0.006
 Flor-0.006
 Mil-0.006
 Moe-0.006
 eyel-0.006
Token fed
Token ID11672
Feature activation-3.66e+00

Pos logit contributions
Offic+0.046
corn+0.037
 permitting+0.037
Mic+0.036
Sir+0.036
Neg logit contributions
pt-0.027
sing-0.026
adow-0.026
oppy-0.026
gan-0.026
Token it
Token ID340
Feature activation+2.38e+00

Pos logit contributions
Offic+0.105
 permitting+0.082
Mic+0.079
 allowing+0.078
ercise+0.078
Neg logit contributions
 O-0.079
 Flor-0.078
gan-0.078
 eyel-0.077
 Moe-0.077
Token bread
Token ID8509
Feature activation-1.46e+00

Pos logit contributions
 eyel+0.050
gan+0.049
sing+0.048
 Flor+0.047
 Moe+0.046
Neg logit contributions
Offic-0.071
 permitting-0.056
Mic-0.055
ercise-0.054
Sir-0.054
Token her
Token ID607
Feature activation+4.62e+00

Pos logit contributions
 eyel+0.017
gan+0.016
sing+0.016
 spine+0.016
 Moe+0.016
Neg logit contributions
Offic-0.021
ercise-0.018
corn-0.017
Mic-0.017
 permitting-0.017
Token coat
Token ID13209
Feature activation+4.44e-01

Pos logit contributions
 Moe+0.131
ola+0.129
ahl+0.124
ulu+0.122
gan+0.121
Neg logit contributions
 whip-0.086
Offic-0.082
 saddle-0.079
 string-0.075
 ribbon-0.073
Token and
Token ID290
Feature activation-2.54e+00

Pos logit contributions
 eyel+0.011
gan+0.011
 Moe+0.011
sing+0.011
oppy+0.011
Neg logit contributions
Offic-0.011
ercise-0.008
 permitting-0.008
corn-0.008
 whip-0.008
Token hat
Token ID6877
Feature activation-1.60e+00

Pos logit contributions
Offic+0.072
ercise+0.058
Mic+0.057
 permitting+0.057
corn+0.057
Neg logit contributions
 eyel-0.056
gan-0.053
sing-0.052
 spine-0.052
 Moe-0.051
Token.
Token ID13
Feature activation+5.95e-01

Pos logit contributions
Offic+0.042
 whip+0.035
ercise+0.034
 permitting+0.034
corn+0.034
Neg logit contributions
 eyel-0.035
oppy-0.034
 Moe-0.034
gan-0.033
 spine-0.033
Token "
Token ID366
Feature activation-5.72e+00

Pos logit contributions
 eyel+0.013
 spine+0.013
gan+0.013
sing+0.013
 Moe+0.012
Neg logit contributions
Offic-0.016
ercise-0.013
 permitting-0.013
corn-0.013
Can-0.013
TokenI
Token ID40
Feature activation-6.58e-01

Pos logit contributions
Offic+0.115
corn+0.090
Mic+0.088
ercise+0.087
 permitting+0.087
Neg logit contributions
gan-0.167
 eyel-0.163
 Mil-0.160
 Moe-0.156
sing-0.154
Token'm
Token ID1101
Feature activation+2.38e+00

Pos logit contributions
Offic+0.017
ercise+0.013
 permitting+0.013
corn+0.013
Mic+0.013
Neg logit contributions
 eyel-0.016
gan-0.015
 spine-0.015
sing-0.015
 Moe-0.015
Token cold
Token ID4692
Feature activation+7.27e-01

Pos logit contributions
 eyel+0.053
 spine+0.051
 Moe+0.050
gan+0.050
sing+0.049
Neg logit contributions
Offic-0.064
 permitting-0.054
 allowing-0.052
 whip-0.051
ercise-0.051
Token and
Token ID290
Feature activation-7.18e-01

Pos logit contributions
 eyel+0.016
gan+0.016
sing+0.015
 spine+0.015
 Moe+0.015
Neg logit contributions
Offic-0.020
ercise-0.016
corn-0.016
 permitting-0.016
Mic-0.016
Token hungry
Token ID14720
Feature activation-1.77e+00

Pos logit contributions
Offic+0.024
ercise+0.020
corn+0.020
 permitting+0.020
Mic+0.020
Neg logit contributions
 eyel-0.012
gan-0.011
sing-0.011
 spine-0.011
 Moe-0.010
Token runs
Token ID4539
Feature activation-3.96e-01

Pos logit contributions
Offic+0.007
ercise+0.006
corn+0.006
 permitting+0.006
 whip+0.005
Neg logit contributions
 eyel-0.007
gan-0.007
 spine-0.007
sing-0.007
 Moe-0.007
Token away
Token ID1497
Feature activation+3.59e+00

Pos logit contributions
Offic+0.010
ercise+0.008
 permitting+0.008
corn+0.008
Mic+0.008
Neg logit contributions
 eyel-0.010
gan-0.009
sing-0.009
 spine-0.009
 Moe-0.009
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
 eyel+0.087
oppy+0.085
 spine+0.083
adow+0.083
sing+0.082
Neg logit contributions
Offic-0.085
"-0.073
 allowing-0.069
 permitting-0.068
 letting-0.066
Token
Token ID198
Feature activation-1.74e+00

Pos logit contributions
 eyel+0.031
gan+0.030
sing+0.029
 spine+0.029
 Moe+0.029
Neg logit contributions
Offic-0.041
ercise-0.034
corn-0.033
 permitting-0.033
Mic-0.033
Token
Token ID198
Feature activation-2.20e+00

Pos logit contributions
Offic+0.046
ercise+0.037
 permitting+0.036
corn+0.036
Mic+0.036
Neg logit contributions
 eyel-0.041
gan-0.039
 spine-0.039
sing-0.038
 Moe-0.038
Token"
Token ID1
Feature activation-6.35e+00

Pos logit contributions
 whip+0.043
ercise+0.041
 groom+0.040
Offic+0.040
 permitting+0.039
Neg logit contributions
gan-0.057
ia-0.053
 Moe-0.053
 Mil-0.050
 Megan-0.050
TokenHa
Token ID23303
Feature activation-1.04e+00

Pos logit contributions
Offic+0.137
ercise+0.118
 permitting+0.109
oc+0.108
 newspapers+0.108
Neg logit contributions
 eyel-0.167
gan-0.162
 Moe-0.158
 Mil-0.157
 Flor-0.152
Token ha
Token ID387
Feature activation+6.30e-01

Pos logit contributions
Offic+0.029
ercise+0.024
corn+0.024
 permitting+0.024
 whip+0.023
Neg logit contributions
 eyel-0.022
 spine-0.021
gan-0.021
sing-0.021
 Moe-0.020
Token!"
Token ID2474
Feature activation-9.90e-01

Pos logit contributions
 eyel+0.015
 spine+0.015
sing+0.015
gan+0.015
 Moe+0.015
Neg logit contributions
Offic-0.016
corn-0.012
ercise-0.012
 permitting-0.012
"-0.012
Token Tom
Token ID4186
Feature activation+3.10e-04

Pos logit contributions
Offic+0.024
Sir+0.018
"+0.018
Thank+0.017
Can+0.017
Neg logit contributions
 eyel-0.026
 spine-0.026
oppy-0.026
sing-0.025
adow-0.025
Token laughs
Token ID22051
Feature activation-1.63e+00

Pos logit contributions
 eyel+0.000
 spine+0.000
sing+0.000
gan+0.000
oppy+0.000
Neg logit contributions
Offic-0.000
corn-0.000
 whip-0.000
"-0.000
ercise-0.000
Token said
Token ID531
Feature activation+7.77e-01

Pos logit contributions
Offic+0.065
"+0.051
Sir+0.051
corn+0.050
 whip+0.050
Neg logit contributions
 eyel-0.056
 spine-0.054
sing-0.054
adow-0.052
gan-0.050
Token Mom
Token ID11254
Feature activation-1.27e+00

Pos logit contributions
 spine+0.019
adow+0.019
 eyel+0.018
sing+0.018
adows+0.018
Neg logit contributions
Offic-0.019
 whip-0.015
Sir-0.015
ercise-0.014
"-0.014
Token.
Token ID13
Feature activation-7.73e-01

Pos logit contributions
Offic+0.034
ercise+0.027
corn+0.027
 permitting+0.027
Mic+0.026
Neg logit contributions
 eyel-0.029
sing-0.028
 spine-0.027
 Moe-0.027
gan-0.027
Token And
Token ID843
Feature activation-1.76e+00

Pos logit contributions
Offic+0.021
ercise+0.017
corn+0.016
 permitting+0.016
Mic+0.016
Neg logit contributions
 eyel-0.018
gan-0.018
 spine-0.017
sing-0.017
 Moe-0.017
Token a
Token ID257
Feature activation-4.15e+00

Pos logit contributions
Offic+0.044
 permitting+0.036
 allowing+0.035
ercise+0.034
corn+0.034
Neg logit contributions
 eyel-0.043
 spine-0.042
sing-0.041
gan-0.040
oppy-0.039
Token loud
Token ID7812
Feature activation-4.97e+00

Pos logit contributions
Offic+0.111
ercise+0.090
 permitting+0.087
corn+0.087
Mic+0.086
Neg logit contributions
 eyel-0.096
gan-0.091
sing-0.091
 spine-0.090
 Moe-0.089
Token cheer
Token ID14042
Feature activation-4.76e+00

Pos logit contributions
Offic+0.132
 whip+0.107
corn+0.104
ercise+0.102
 permitting+0.102
Neg logit contributions
gan-0.114
 eyel-0.113
oppy-0.108
 Moe-0.107
 spine-0.107
Token erupted
Token ID23681
Feature activation-4.06e-01

Pos logit contributions
Offic+0.139
 permitting+0.115
ercise+0.114
corn+0.112
 allowing+0.111
Neg logit contributions
 eyel-0.097
gan-0.092
sing-0.091
 spine-0.090
 Moe-0.089
Token from
Token ID422
Feature activation+2.20e+00

Pos logit contributions
Offic+0.010
 permitting+0.008
corn+0.008
 whip+0.008
"+0.008
Neg logit contributions
 eyel-0.010
 spine-0.009
sing-0.009
oppy-0.009
gan-0.009
Token the
Token ID262
Feature activation+2.91e+00

Pos logit contributions
 eyel+0.057
gan+0.055
 spine+0.055
adow+0.054
sing+0.053
Neg logit contributions
Offic-0.053
 permitting-0.041
 whip-0.041
ercise-0.040
corn-0.039
Token family
Token ID1641
Feature activation-9.23e-01

Pos logit contributions
 eyel+0.066
gan+0.066
 Moe+0.062
sing+0.062
oppy+0.062
Neg logit contributions
Offic-0.079
 permitting-0.062
ercise-0.062
 whip-0.062
Sir-0.061