TOP ACTIVATIONS
MAX = 0.473

. The girl mar ve lled at how big the
, and it was pri ve leg ed to be part
was known as the cle ve rest mouse in the whole
said , " The cle ve rest sheep ever !" <|endoftext|>
prints I â ve made ! â Everyone
got to the orange gro ve , Jane and her mom
of the colours and mar ve lled at the way the
! I â ve been staring at it for
she said , " Re ve al !" Tim jumped out
I â ve sent you this squash .
looked it over , mar ve lling at how perfect it
prove I 'm the cle ve rest ! I bet I
stone I â ve ever seen â .
. You â ve done a great job .
I â ve been waiting for somebody to
. I â ve been counting the days until
were a deep , vel ve ty purple .
the village . She mar ve lled at the bright ,
g to be the cle ve rest . But in the
proudly , and everyone mar ve lled at his kindness and

MIN ACTIVATIONS
MIN = -0.369

Fe eling much cal mer , Amy held the
because soon the fish was cal mer and more relaxed .
icker and soon felt much cal mer . Her brother eventually
around him and he felt cal mer . His mom knew
leaves and he felt much cal mer . He hopped up
. The water was much cal mer now . Everyone was
just curious kids . He cal ms down and explains to
, she felt herself getting cal mer and cal mer .
herself getting cal mer and cal mer . She kept taking
happier and the house felt cal mer . Life was so
Dad sigh s and cal ms down . He sees
down . She felt much cal mer now , knowing that
see the sun together , alright ?" So ,
cookie . Anna cal ms down and forg ives
. She stops them and cal ms them down . She
. The kitten cal ms down and pur rs
rh ino eats it and cal ms down .
, there . Everything is alright . If you go back
made the little monkey feel cal mer and sensing a warm
was finished she felt much cal mer . The girl was

INTERVAL 0.262 - 0.473
CONTAINS 0.024%

tissue and rubbed the scrib bles away .
. He saw the new er aser in her hand .
He grabs the pe b bles from Anna 's hand and
sh iver go down his spine . He looked around but
He tug ged and tug ged , but he couldn 't

INTERVAL 0.052 - 0.262
CONTAINS 16.825%

's house and bring her your dictionary . Maybe you can
She felt very lucky to find them . She wanted to
The frog cro aked excited ly and put the bracelet around
, she wanted to call out to them but she was
quickly took out the pe b ble and p atted it

INTERVAL -0.158 - 0.052
CONTAINS 81.150%

finally have some tasty vegetables to buy . They ate them
to be a witch !" Lily said . She put on
go to Lily 's room . Lily gives Ben a pair
of his hat . He went to a park and saw
J enny was happy and decided to write a letter

INTERVAL -0.369 - -0.158
CONTAINS 2.001%

water and ate some more cookies . Lily said , "
told her she needed to hurry to take a bath .
but she felt a little better . " I
on his snacks . He slowly crept over to M omm
troubled . She was thinking about how she wished her life
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
 cal+0.000
 alright+0.000
 swe+0.000
 ok+0.000
 lets+0.000
Neg logit contributions
ve-0.000
rie-0.000
iana-0.000
ged-0.000
makers-0.000
Token
Token ID198
Feature activation-2.76e-03

Pos logit contributions
 cal+0.028
 swe+0.027
 alright+0.026
 ok+0.025
 calmed+0.025
Neg logit contributions
ve-0.032
rie-0.026
iana-0.025
ged-0.023
makers-0.023
TokenThe
Token ID464
Feature activation-1.55e-02

Pos logit contributions
 cal+0.004
 swe+0.004
 alright+0.004
 ok+0.004
 lets+0.004
Neg logit contributions
ve-0.004
rie-0.003
iana-0.003
makers-0.003
ged-0.003
Token girl
Token ID2576
Feature activation+9.55e-02

Pos logit contributions
 cal+0.022
 alright+0.020
 swe+0.020
 ok+0.019
 lets+0.019
Neg logit contributions
ve-0.026
rie-0.020
iana-0.020
makers-0.019
ged-0.018
Token mar
Token ID1667
Feature activation+2.67e-01

Pos logit contributions
ve+0.155
rie+0.119
iana+0.113
ged+0.111
makers+0.102
Neg logit contributions
 cal-0.139
 swe-0.132
 alright-0.131
 ok-0.124
 hurricanes-0.123
Tokenve
Token ID303
Feature activation+4.73e-01

Pos logit contributions
ve+0.226
rie+0.210
 spine+0.208
makers+0.182
 ante+0.174
Neg logit contributions
 cal-0.467
 alright-0.460
 shall-0.458
 calmed-0.451
 ok-0.443
Tokenlled
Token ID3353
Feature activation+1.70e-01

Pos logit contributions
ve+0.816
eg+0.680
inky+0.678
rie+0.676
sed+0.659
Neg logit contributions
 cal-0.575
 calmed-0.550
 alright-0.530
 lets-0.524
 shall-0.508
Token at
Token ID379
Feature activation-3.24e-02

Pos logit contributions
ve+0.306
rie+0.245
iana+0.236
ged+0.232
makers+0.226
Neg logit contributions
 cal-0.212
 swe-0.196
 alright-0.195
 lets-0.190
 ok-0.188
Token how
Token ID703
Feature activation-1.37e-01

Pos logit contributions
 cal+0.042
 alright+0.039
 swe+0.038
 ok+0.037
 lets+0.036
Neg logit contributions
ve-0.057
rie-0.046
iana-0.046
ged-0.044
makers-0.043
Token big
Token ID1263
Feature activation+4.55e-02

Pos logit contributions
 cal+0.188
 alright+0.178
 swe+0.178
 cleans+0.175
 lets+0.175
Neg logit contributions
ve-0.233
iana-0.173
rie-0.168
ged-0.160
chie-0.160
Token the
Token ID262
Feature activation-2.13e-02

Pos logit contributions
ve+0.071
rie+0.058
iana+0.058
ged+0.056
sel+0.052
Neg logit contributions
 cal-0.064
 alright-0.061
 ok-0.058
 swe-0.057
 shall-0.055
Token,
Token ID11
Feature activation+1.21e-03

Pos logit contributions
ve+0.203
rie+0.171
iana+0.162
makers+0.156
eg+0.155
Neg logit contributions
 cal-0.131
 alright-0.125
 swe-0.121
 ok-0.119
 calmed-0.113
Token and
Token ID290
Feature activation-5.33e-03

Pos logit contributions
ve+0.002
rie+0.001
iana+0.001
makers+0.001
ged+0.001
Neg logit contributions
 cal-0.002
 swe-0.002
 alright-0.002
 lets-0.002
 cleans-0.002
Token it
Token ID340
Feature activation-4.33e-02

Pos logit contributions
 cal+0.007
 swe+0.007
 alright+0.007
 ok+0.006
 lets+0.006
Neg logit contributions
ve-0.009
rie-0.007
iana-0.007
makers-0.007
ged-0.007
Token was
Token ID373
Feature activation+5.04e-03

Pos logit contributions
 cal+0.059
 swe+0.054
 alright+0.054
 ok+0.052
 lets+0.051
Neg logit contributions
ve-0.074
rie-0.059
iana-0.057
makers-0.054
ged-0.054
Token pri
Token ID1293
Feature activation-7.46e-02

Pos logit contributions
ve+0.008
rie+0.006
iana+0.006
makers+0.006
ged+0.006
Neg logit contributions
 cal-0.007
 swe-0.006
 alright-0.006
 lets-0.006
 calmed-0.006
Tokenve
Token ID303
Feature activation+4.71e-01

Pos logit contributions
 cal+0.122
 alright+0.119
 ok+0.115
 swe+0.113
 calmed+0.112
Neg logit contributions
ve-0.096
rie-0.075
iana-0.073
 ante-0.068
makers-0.068
Tokenleg
Token ID1455
Feature activation-2.45e-02

Pos logit contributions
ve+0.783
rie+0.620
iana+0.587
makers+0.571
ged+0.566
Neg logit contributions
 cal-0.661
 swe-0.619
 alright-0.611
 ok-0.588
 lets-0.585
Tokened
Token ID276
Feature activation+2.41e-01

Pos logit contributions
 cal+0.043
 swe+0.041
 alright+0.041
 ok+0.039
 lets+0.039
Neg logit contributions
ve-0.033
rie-0.024
iana-0.022
ged-0.022
makers-0.021
Token to
Token ID284
Feature activation-3.45e-02

Pos logit contributions
ve+0.372
rie+0.292
ged+0.273
iana+0.272
makers+0.267
Neg logit contributions
 cal-0.347
 swe-0.323
 alright-0.322
 lets-0.316
 calmed-0.310
Token be
Token ID307
Feature activation+1.26e-01

Pos logit contributions
 cal+0.037
 alright+0.034
 ok+0.032
 swe+0.031
 shall+0.031
Neg logit contributions
ve-0.073
rie-0.061
iana-0.058
makers-0.057
ged-0.057
Token part
Token ID636
Feature activation+3.29e-02

Pos logit contributions
ve+0.202
rie+0.157
ged+0.148
iana+0.148
sel+0.146
Neg logit contributions
 cal-0.172
 swe-0.163
 lets-0.159
 seasons-0.154
 alright-0.154
Token was
Token ID373
Feature activation-9.21e-04

Pos logit contributions
ve+0.093
rie+0.073
iana+0.069
ged+0.066
makers+0.065
Neg logit contributions
 cal-0.085
 swe-0.082
 alright-0.082
 ok-0.078
 flour-0.076
Token known
Token ID1900
Feature activation+4.04e-02

Pos logit contributions
 swe+0.001
 shall+0.001
 cal+0.001
 seasons+0.001
 wipes+0.001
Neg logit contributions
ve-0.001
makers-0.001
ged-0.001
rie-0.001
chie-0.001
Token as
Token ID355
Feature activation+9.42e-02

Pos logit contributions
ve+0.062
rie+0.048
iana+0.046
ged+0.043
makers+0.043
Neg logit contributions
 cal-0.064
 alright-0.060
 swe-0.059
 ok-0.058
 lets-0.056
Token the
Token ID262
Feature activation-2.69e-02

Pos logit contributions
ve+0.149
rie+0.116
iana+0.114
makers+0.107
ged+0.103
Neg logit contributions
 cal-0.129
 swe-0.126
 alright-0.118
 shall-0.117
 calmed-0.117
Token cle
Token ID1190
Feature activation-6.23e-03

Pos logit contributions
 cal+0.034
 swe+0.032
 calmed+0.031
 alright+0.031
 lets+0.030
Neg logit contributions
ve-0.047
rie-0.038
iana-0.036
makers-0.035
sel-0.033
Tokenve
Token ID303
Feature activation+4.70e-01

Pos logit contributions
 cal+0.012
 alright+0.012
 ok+0.012
 okay+0.011
adows+0.011
Neg logit contributions
ve-0.005
rie-0.005
ged-0.004
sed-0.004
ky-0.004
Tokenrest
Token ID2118
Feature activation-1.52e-02

Pos logit contributions
ve+0.857
rie+0.710
iana+0.693
eg+0.669
sed+0.657
Neg logit contributions
 cal-0.553
 alright-0.523
 lets-0.499
 ok-0.496
 calmed-0.487
Token mouse
Token ID10211
Feature activation+6.17e-02

Pos logit contributions
 cal+0.020
 swe+0.018
 alright+0.018
 ok+0.017
 lets+0.017
Neg logit contributions
ve-0.027
rie-0.022
iana-0.021
makers-0.020
ged-0.020
Token in
Token ID287
Feature activation-3.79e-02

Pos logit contributions
ve+0.107
rie+0.086
iana+0.082
makers+0.079
ged+0.079
Neg logit contributions
 cal-0.082
 swe-0.076
 alright-0.075
 ok-0.072
 lets-0.071
Token the
Token ID262
Feature activation-2.59e-02

Pos logit contributions
 cal+0.041
 alright+0.036
 ok+0.035
 swe+0.034
 lets+0.034
Neg logit contributions
ve-0.081
rie-0.067
iana-0.063
ged-0.061
makers-0.061
Token whole
Token ID2187
Feature activation+2.59e-01

Pos logit contributions
 cal+0.033
 swe+0.031
 alright+0.031
 ok+0.029
 lets+0.029
Neg logit contributions
ve-0.046
rie-0.037
iana-0.035
makers-0.034
ged-0.034
Token said
Token ID531
Feature activation+4.71e-02

Pos logit contributions
 cal+0.012
 swe+0.012
 alright+0.012
 ok+0.011
 lets+0.011
Neg logit contributions
ve-0.014
rie-0.011
iana-0.011
ged-0.011
makers-0.010
Token,
Token ID11
Feature activation-1.06e-02

Pos logit contributions
ve+0.069
iana+0.052
makers+0.051
rie+0.051
ged+0.050
Neg logit contributions
 cal-0.071
 alright-0.066
 swe-0.066
 lets-0.065
 seasons-0.064
Token "
Token ID366
Feature activation-1.66e-02

Pos logit contributions
 cal+0.015
 swe+0.014
 alright+0.014
 ok+0.013
 lets+0.013
Neg logit contributions
ve-0.017
rie-0.013
iana-0.013
ged-0.013
makers-0.012
TokenThe
Token ID464
Feature activation-1.67e-02

Pos logit contributions
 cal+0.024
 swe+0.022
 alright+0.022
 ok+0.021
 calmed+0.021
Neg logit contributions
ve-0.027
rie-0.022
iana-0.021
makers-0.020
ged-0.020
Token cle
Token ID1190
Feature activation-8.76e-03

Pos logit contributions
 cal+0.023
 swe+0.021
 alright+0.021
 ok+0.020
 lets+0.020
Neg logit contributions
ve-0.028
rie-0.023
iana-0.022
makers-0.021
ged-0.021
Tokenve
Token ID303
Feature activation+4.69e-01

Pos logit contributions
 cal+0.016
 alright+0.016
 ok+0.016
 shall+0.015
 okay+0.015
Neg logit contributions
rie-0.007
ve-0.007
ged-0.007
ky-0.006
iana-0.006
Tokenrest
Token ID2118
Feature activation-1.35e-02

Pos logit contributions
ve+0.780
rie+0.656
iana+0.620
eg+0.608
ula+0.599
Neg logit contributions
 cal-0.611
 alright-0.582
 lets-0.563
 ok-0.556
 shall-0.555
Token sheep
Token ID15900
Feature activation-8.75e-02

Pos logit contributions
 cal+0.020
 swe+0.019
 alright+0.019
 ok+0.018
 lets+0.018
Neg logit contributions
ve-0.021
rie-0.016
iana-0.016
makers-0.015
ged-0.015
Token ever
Token ID1683
Feature activation+1.36e-01

Pos logit contributions
 cal+0.123
 alright+0.113
 swe+0.113
 ok+0.109
 lets+0.108
Neg logit contributions
ve-0.145
rie-0.116
iana-0.110
makers-0.106
ged-0.106
Token!"
Token ID2474
Feature activation+1.10e-01

Pos logit contributions
ve+0.239
rie+0.201
iana+0.198
ula+0.193
eg+0.190
Neg logit contributions
 cal-0.176
 alright-0.169
 ok-0.160
 swe-0.159
 shall-0.157
Token<|endoftext|>
Token ID50256
Feature activation-4.16e-02

Pos logit contributions
ve+0.206
rie+0.167
iana+0.160
makers+0.155
ged+0.154
Neg logit contributions
 cal-0.135
 swe-0.121
 alright-0.119
 lets-0.117
 calmed-0.116
Token prints
Token ID20842
Feature activation+6.45e-02

Pos logit contributions
 cal+0.187
 swe+0.167
 alright+0.166
 shall+0.160
 ok+0.158
Neg logit contributions
ve-0.372
rie-0.307
ged-0.296
iana-0.291
makers-0.284
Token I
Token ID314
Feature activation-3.11e-02

Pos logit contributions
ve+0.110
rie+0.088
iana+0.084
ged+0.081
makers+0.081
Neg logit contributions
 cal-0.088
 alright-0.081
 swe-0.081
 ok-0.078
 lets-0.077
Tokenâ
Token ID22940
Feature activation+1.13e-02

Pos logit contributions
 cal+0.038
 shall+0.037
 lets+0.036
 alright+0.035
 calmed+0.034
Neg logit contributions
ve-0.054
rie-0.046
ging-0.043
iana-0.043
 spine-0.043
Token
Token ID26391
Feature activation+1.63e-02

Pos logit contributions
ve+0.018
rie+0.014
iana+0.014
ged+0.013
makers+0.013
Neg logit contributions
 cal-0.017
 alright-0.015
 swe-0.015
 ok-0.015
 lets-0.015
Token
Token ID8151
Feature activation-1.99e-01

Pos logit contributions
ve+0.027
rie+0.021
iana+0.020
makers+0.019
ged+0.019
Neg logit contributions
 cal-0.024
 swe-0.023
 alright-0.022
 ok-0.021
 calmed-0.021
Tokenve
Token ID303
Feature activation+4.69e-01

Pos logit contributions
 alright+0.273
 ok+0.268
 cal+0.265
 okay+0.256
 shall+0.256
Neg logit contributions
ve-0.289
rie-0.256
iana-0.236
 ante-0.227
lves-0.224
Token made
Token ID925
Feature activation+4.09e-02

Pos logit contributions
ve+0.815
rie+0.705
 spine+0.688
ula+0.679
inky+0.672
Neg logit contributions
 cal-0.551
 shall-0.535
 calmed-0.517
 alright-0.517
 lets-0.517
Token!
Token ID0
Feature activation+1.33e-01

Pos logit contributions
ve+0.068
rie+0.056
iana+0.053
ged+0.051
makers+0.050
Neg logit contributions
 cal-0.057
 alright-0.054
 swe-0.052
 ok-0.052
 shall-0.051
Tokenâ
Token ID22940
Feature activation+1.14e-02

Pos logit contributions
ve+0.188
rie+0.146
makers+0.135
iana+0.135
oys+0.132
Neg logit contributions
 cal-0.213
 swe-0.206
 alright-0.196
 calmed-0.192
 ok-0.185
Token
Token ID26391
Feature activation+1.61e-02

Pos logit contributions
ve+0.019
rie+0.015
iana+0.015
ged+0.014
makers+0.014
Neg logit contributions
 cal-0.016
 alright-0.015
 swe-0.014
 ok-0.014
 lets-0.014
Token Everyone
Token ID11075
Feature activation+4.42e-02

Pos logit contributions
ve+0.026
rie+0.020
iana+0.019
makers+0.018
ged+0.018
Neg logit contributions
 cal-0.024
 swe-0.023
 alright-0.022
 ok-0.021
 calmed-0.021
Token got
Token ID1392
Feature activation+1.01e-01

Pos logit contributions
ve+0.025
rie+0.020
iana+0.019
makers+0.019
ula+0.019
Neg logit contributions
 cal-0.017
 calmed-0.016
 shall-0.015
 lets-0.015
 alright-0.015
Token to
Token ID284
Feature activation-3.18e-02

Pos logit contributions
ve+0.120
iana+0.079
ged+0.072
rie+0.068
makers+0.066
Neg logit contributions
 cal-0.191
 lets-0.187
 alright-0.181
 seasons-0.181
 swe-0.175
Token the
Token ID262
Feature activation-1.81e-02

Pos logit contributions
 cal+0.030
 swe+0.027
 pic+0.025
 lets+0.025
 alright+0.024
Neg logit contributions
ve-0.073
rie-0.061
ged-0.057
makers-0.057
iana-0.056
Token orange
Token ID10912
Feature activation-7.85e-03

Pos logit contributions
 cal+0.019
 swe+0.018
 flour+0.016
 snow+0.016
 shall+0.016
Neg logit contributions
ve-0.040
ged-0.033
rie-0.032
iana-0.031
makers-0.030
Token gro
Token ID7128
Feature activation-1.28e-01

Pos logit contributions
 cal+0.008
 shall+0.008
 swe+0.007
 seasons+0.007
 flour+0.007
Neg logit contributions
ve-0.016
rie-0.013
makers-0.013
iana-0.012
ged-0.012
Tokenve
Token ID303
Feature activation+4.69e-01

Pos logit contributions
 cal+0.159
 alright+0.150
 ok+0.147
 swe+0.142
 exercises+0.136
Neg logit contributions
ve-0.199
ged-0.175
rie-0.171
iana-0.166
 pupil-0.165
Token,
Token ID11
Feature activation+3.96e-04

Pos logit contributions
ve+0.819
rie+0.659
iana+0.630
makers+0.607
ged+0.604
Neg logit contributions
 cal-0.614
 alright-0.569
 swe-0.562
 lets-0.541
 ok-0.540
Token Jane
Token ID12091
Feature activation-9.16e-02

Pos logit contributions
ve+0.001
iana+0.001
rie+0.001
makers+0.000
ged+0.000
Neg logit contributions
 cal-0.001
 swe-0.000
 alright-0.000
 cleans-0.000
 lets-0.000
Token and
Token ID290
Feature activation-8.41e-03

Pos logit contributions
 cal+0.124
 alright+0.117
 swe+0.116
 ok+0.112
 lets+0.106
Neg logit contributions
ve-0.159
rie-0.127
iana-0.122
ged-0.117
sel-0.113
Token her
Token ID607
Feature activation+4.20e-02

Pos logit contributions
 cal+0.013
 alright+0.012
 swe+0.012
 ok+0.011
 lets+0.011
Neg logit contributions
ve-0.013
rie-0.010
iana-0.010
ged-0.009
makers-0.009
Token mom
Token ID1995
Feature activation-5.00e-02

Pos logit contributions
ve+0.069
rie+0.054
iana+0.053
sed+0.050
eg+0.049
Neg logit contributions
 cal-0.061
 alright-0.057
 lets-0.055
 ok-0.053
 calmed-0.052
Token of
Token ID286
Feature activation-6.75e-02

Pos logit contributions
ve+0.096
rie+0.077
ged+0.074
iana+0.073
makers+0.073
Neg logit contributions
 cal-0.065
 alright-0.061
 swe-0.060
 lets-0.058
 ok-0.058
Token the
Token ID262
Feature activation-1.40e-02

Pos logit contributions
 cal+0.094
 alright+0.089
 swe+0.085
 lets+0.085
 ok+0.084
Neg logit contributions
ve-0.105
rie-0.084
iana-0.082
makers-0.081
ged-0.079
Token colours
Token ID18915
Feature activation+3.81e-03

Pos logit contributions
 cal+0.016
 lets+0.015
 alright+0.015
 ok+0.014
 swe+0.014
Neg logit contributions
ve-0.026
rie-0.021
makers-0.021
iana-0.020
ged-0.019
Token and
Token ID290
Feature activation-7.89e-03

Pos logit contributions
ve+0.006
rie+0.005
ged+0.004
sel+0.004
iana+0.004
Neg logit contributions
 cal-0.005
 alright-0.005
 swe-0.005
 lets-0.005
 ok-0.005
Token mar
Token ID1667
Feature activation+2.61e-01

Pos logit contributions
 cal+0.011
 alright+0.010
 ok+0.010
 lets+0.010
 swe+0.010
Neg logit contributions
ve-0.012
rie-0.010
iana-0.009
makers-0.009
ged-0.009
Tokenve
Token ID303
Feature activation+4.68e-01

Pos logit contributions
ve+0.391
rie+0.310
iana+0.293
ged+0.280
makers+0.280
Neg logit contributions
 cal-0.399
 swe-0.372
 alright-0.371
 ok-0.356
 lets-0.353
Tokenlled
Token ID3353
Feature activation+1.67e-01

Pos logit contributions
ve+0.696
rie+0.542
iana+0.525
eg+0.494
ula+0.479
Neg logit contributions
 cal-0.728
 alright-0.688
 swe-0.672
 calmed-0.666
 ok-0.658
Token at
Token ID379
Feature activation-2.98e-02

Pos logit contributions
ve+0.266
rie+0.204
ged+0.199
makers+0.193
iana+0.192
Neg logit contributions
 cal-0.227
 lets-0.218
 cleans-0.214
 alright-0.214
 ok-0.210
Token the
Token ID262
Feature activation-1.36e-02

Pos logit contributions
 cal+0.045
 alright+0.041
 swe+0.041
 lets+0.040
 shall+0.039
Neg logit contributions
ve-0.042
iana-0.035
rie-0.033
ged-0.033
makers-0.031
Token way
Token ID835
Feature activation-2.05e-02

Pos logit contributions
 cal+0.016
 lets+0.015
 alright+0.015
 swe+0.014
 ok+0.014
Neg logit contributions
ve-0.025
rie-0.020
makers-0.020
iana-0.019
oys-0.017
Token the
Token ID262
Feature activation-1.36e-02

Pos logit contributions
 cal+0.028
 swe+0.025
 alright+0.025
 lets+0.025
 ok+0.024
Neg logit contributions
ve-0.035
rie-0.028
iana-0.027
ged-0.026
makers-0.026
Token!
Token ID0
Feature activation+1.31e-01

Pos logit contributions
 cal+0.012
 calmed+0.012
 nods+0.011
 wipes+0.011
 cleans+0.011
Neg logit contributions
ve-0.011
rie-0.008
iana-0.008
makers-0.008
 ante-0.007
Token I
Token ID314
Feature activation-3.23e-02

Pos logit contributions
ve+0.138
rie+0.124
da+0.110
oys+0.107
iana+0.100
Neg logit contributions
 swe-0.228
 cal-0.223
 pic-0.212
 calmed-0.210
 alright-0.208
Tokenâ
Token ID22940
Feature activation+9.75e-03

Pos logit contributions
 cal+0.044
 alright+0.041
 swe+0.040
 ok+0.039
 lets+0.039
Neg logit contributions
ve-0.055
rie-0.045
iana-0.043
ged-0.041
makers-0.041
Token
Token ID26391
Feature activation+1.53e-02

Pos logit contributions
ve+0.015
rie+0.012
iana+0.012
makers+0.011
ged+0.011
Neg logit contributions
 cal-0.015
 swe-0.014
 alright-0.014
 ok-0.013
 lets-0.013
Token
Token ID8151
Feature activation-2.00e-01

Pos logit contributions
ve+0.025
rie+0.018
makers+0.018
iana+0.017
eg+0.016
Neg logit contributions
 swe-0.023
 cal-0.023
 alright-0.021
 calmed-0.020
 ok-0.020
Tokenve
Token ID303
Feature activation+4.68e-01

Pos logit contributions
 alright+0.244
 ok+0.239
 cal+0.234
 shall+0.230
 okay+0.230
Neg logit contributions
ve-0.319
rie-0.286
iana-0.267
 ante-0.260
ged-0.258
Token been
Token ID587
Feature activation+1.09e-01

Pos logit contributions
ve+0.840
rie+0.693
iana+0.672
ula+0.644
eg+0.635
Neg logit contributions
 cal-0.589
 alright-0.534
 lets-0.530
 calmed-0.525
 ok-0.509
Token staring
Token ID16143
Feature activation-3.26e-03

Pos logit contributions
ve+0.182
rie+0.140
iana+0.134
makers+0.131
ged+0.129
Neg logit contributions
 cal-0.153
 swe-0.144
 alright-0.140
 lets-0.136
 calmed-0.136
Token at
Token ID379
Feature activation-3.06e-02

Pos logit contributions
 cal+0.004
 swe+0.004
 alright+0.004
 calmed+0.004
 lets+0.004
Neg logit contributions
ve-0.006
rie-0.004
iana-0.004
ged-0.004
makers-0.004
Token it
Token ID340
Feature activation-3.73e-02

Pos logit contributions
 cal+0.041
 swe+0.037
 alright+0.037
 ok+0.036
 lets+0.035
Neg logit contributions
ve-0.053
rie-0.043
iana-0.041
makers-0.040
ged-0.039
Token for
Token ID329
Feature activation+2.43e-01

Pos logit contributions
 cal+0.053
 swe+0.048
 alright+0.048
 calmed+0.046
 ok+0.046
Neg logit contributions
ve-0.062
rie-0.049
iana-0.047
makers-0.045
ged-0.044
Token she
Token ID673
Feature activation-3.36e-03

Pos logit contributions
ve+0.007
iana+0.005
ged+0.005
rie+0.005
makers+0.004
Neg logit contributions
 cal-0.009
 swe-0.008
 alright-0.008
 ok-0.008
 lets-0.008
Token said
Token ID531
Feature activation+5.75e-02

Pos logit contributions
 cal+0.004
 swe+0.004
 alright+0.004
 ok+0.004
 lets+0.004
Neg logit contributions
ve-0.006
rie-0.005
iana-0.005
makers-0.004
ged-0.004
Token,
Token ID11
Feature activation+5.36e-03

Pos logit contributions
ve+0.078
ged+0.058
iana+0.057
makers+0.054
rie+0.052
Neg logit contributions
 cal-0.099
 alright-0.090
 swe-0.089
 seasons-0.088
 pic-0.086
Token "
Token ID366
Feature activation-3.72e-04

Pos logit contributions
ve+0.008
rie+0.006
ged+0.006
iana+0.006
makers+0.006
Neg logit contributions
 cal-0.009
 swe-0.008
 alright-0.008
 pic-0.008
 seasons-0.007
TokenRe
Token ID3041
Feature activation+1.05e-01

Pos logit contributions
 cal+0.000
 alright+0.000
 shall+0.000
 swe+0.000
 lets+0.000
Neg logit contributions
ve-0.001
rie-0.001
iana-0.001
eg-0.001
ula-0.001
Tokenve
Token ID303
Feature activation+4.67e-01

Pos logit contributions
ve+0.185
rie+0.143
iana+0.139
makers+0.134
ged+0.129
Neg logit contributions
 cal-0.145
 swe-0.135
 alright-0.134
 lets-0.129
 ok-0.129
Tokenal
Token ID282
Feature activation+1.25e-01

Pos logit contributions
ve+0.859
rie+0.724
 spine+0.693
ged+0.684
ula+0.684
Neg logit contributions
 cal-0.530
 lets-0.488
 alright-0.485
 shall-0.467
 calmed-0.463
Token!"
Token ID2474
Feature activation+1.23e-01

Pos logit contributions
ve+0.229
rie+0.189
iana+0.179
makers+0.175
ged+0.174
Neg logit contributions
 cal-0.153
 alright-0.141
 swe-0.141
 lets-0.136
 ok-0.135
Token Tim
Token ID5045
Feature activation-4.97e-02

Pos logit contributions
ve+0.212
rie+0.169
iana+0.164
ged+0.160
makers+0.156
Neg logit contributions
 cal-0.169
 alright-0.155
 swe-0.154
 ok-0.147
 lets-0.144
Token jumped
Token ID11687
Feature activation+8.99e-03

Pos logit contributions
 cal+0.064
 swe+0.059
 alright+0.059
 ok+0.056
 lets+0.056
Neg logit contributions
ve-0.089
rie-0.072
iana-0.069
makers-0.066
ged-0.066
Token out
Token ID503
Feature activation+6.13e-02

Pos logit contributions
ve+0.018
rie+0.015
iana+0.015
makers+0.014
ged+0.014
Neg logit contributions
 cal-0.010
 swe-0.009
 alright-0.009
 ok-0.008
 lets-0.008
Token
Token ID241
Feature activation+2.73e-02

Pos logit contributions
 cal+0.086
 alright+0.078
 ok+0.077
 swe+0.074
 lets+0.071
Neg logit contributions
ve-0.114
ucer-0.097
rie-0.096
iana-0.092
ged-0.090
TokenI
Token ID40
Feature activation-5.35e-02

Pos logit contributions
ve+0.050
rie+0.044
iana+0.042
eg+0.039
sel+0.039
Neg logit contributions
 cal-0.031
 alright-0.029
 swe-0.028
 ok-0.028
 lets-0.028
Tokenâ
Token ID22940
Feature activation+8.21e-03

Pos logit contributions
 cal+0.076
 alright+0.070
 shall+0.069
 ok+0.068
 lets+0.068
Neg logit contributions
ve-0.086
rie-0.072
iana-0.068
makers-0.067
ged-0.066
Token
Token ID26391
Feature activation+1.35e-02

Pos logit contributions
ve+0.013
rie+0.010
iana+0.009
makers+0.009
ged+0.009
Neg logit contributions
 cal-0.013
 swe-0.012
 alright-0.012
 ok-0.011
 lets-0.011
Token
Token ID8151
Feature activation-1.99e-01

Pos logit contributions
ve+0.021
rie+0.015
iana+0.014
ana+0.014
makers+0.014
Neg logit contributions
 cal-0.022
 swe-0.021
 alright-0.020
 seasons-0.019
 cleans-0.019
Tokenve
Token ID303
Feature activation+4.66e-01

Pos logit contributions
 alright+0.238
 ok+0.234
 cal+0.231
 okay+0.221
 shall+0.217
Neg logit contributions
ve-0.329
rie-0.302
iana-0.280
 ante-0.278
lves-0.269
Token sent
Token ID1908
Feature activation+5.51e-02

Pos logit contributions
ve+0.765
rie+0.688
 spine+0.664
iana+0.648
ula+0.647
Neg logit contributions
 cal-0.570
 alright-0.535
 shall-0.534
 calmed-0.531
 lets-0.528
Token you
Token ID345
Feature activation+2.09e-02

Pos logit contributions
ve+0.098
rie+0.082
iana+0.077
ged+0.074
makers+0.074
Neg logit contributions
 cal-0.071
 alright-0.066
 swe-0.065
 ok-0.064
 lets-0.062
Token this
Token ID428
Feature activation+1.16e-01

Pos logit contributions
ve+0.036
rie+0.029
iana+0.028
makers+0.027
ged+0.027
Neg logit contributions
 cal-0.028
 alright-0.026
 swe-0.026
 ok-0.025
 lets-0.024
Token squash
Token ID34613
Feature activation-1.59e-01

Pos logit contributions
ve+0.219
rie+0.176
iana+0.173
makers+0.165
ged+0.164
Neg logit contributions
 cal-0.139
 swe-0.126
 alright-0.125
 ok-0.119
 lets-0.119
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 cal+0.227
 swe+0.212
 alright+0.211
 ok+0.204
 lets+0.202
Neg logit contributions
ve-0.260
rie-0.208
iana-0.196
ged-0.190
makers-0.189
Token looked
Token ID3114
Feature activation-4.22e-02

Pos logit contributions
 cal+0.040
 alright+0.038
 swe+0.037
 seasons+0.037
 ok+0.037
Neg logit contributions
ve-0.047
ged-0.038
rie-0.037
iana-0.036
sel-0.035
Token it
Token ID340
Feature activation-4.58e-02

Pos logit contributions
 cal+0.056
 swe+0.052
 alright+0.052
 ok+0.049
 lets+0.049
Neg logit contributions
ve-0.074
rie-0.058
iana-0.056
ged-0.055
makers-0.054
Token over
Token ID625
Feature activation-7.61e-02

Pos logit contributions
 cal+0.062
 swe+0.057
 alright+0.056
 ok+0.054
 lets+0.054
Neg logit contributions
ve-0.079
rie-0.063
iana-0.060
ged-0.059
makers-0.058
Token,
Token ID11
Feature activation-3.53e-03

Pos logit contributions
 cal+0.094
 alright+0.085
 swe+0.085
 ok+0.081
 lets+0.080
Neg logit contributions
ve-0.142
rie-0.116
iana-0.111
makers-0.107
ged-0.106
Token mar
Token ID1667
Feature activation+2.57e-01

Pos logit contributions
 cal+0.005
 alright+0.004
 swe+0.004
 ok+0.004
 calmed+0.004
Neg logit contributions
ve-0.006
rie-0.005
iana-0.005
makers-0.005
eg-0.005
Tokenve
Token ID303
Feature activation+4.66e-01

Pos logit contributions
ve+0.213
rie+0.170
iana+0.140
makers+0.137
oys+0.128
Neg logit contributions
 cal-0.516
 alright-0.504
 ok-0.485
 shall-0.484
 calmed-0.478
Tokenlling
Token ID2680
Feature activation+9.27e-02

Pos logit contributions
ve+0.787
rie+0.633
iana+0.605
makers+0.575
eg+0.564
Neg logit contributions
 cal-0.637
 alright-0.596
 swe-0.580
 ok-0.567
 calmed-0.564
Token at
Token ID379
Feature activation-3.55e-02

Pos logit contributions
ve+0.163
rie+0.131
iana+0.126
makers+0.121
ged+0.121
Neg logit contributions
 cal-0.122
 swe-0.112
 alright-0.112
 ok-0.107
 lets-0.106
Token how
Token ID703
Feature activation-1.39e-01

Pos logit contributions
 cal+0.038
 alright+0.035
 swe+0.034
 ok+0.034
 lets+0.033
Neg logit contributions
ve-0.072
rie-0.059
iana-0.054
zel-0.054
makers-0.054
Token perfect
Token ID2818
Feature activation+8.37e-02

Pos logit contributions
 cal+0.197
 alright+0.188
 swe+0.188
 lets+0.184
 seasons+0.181
Neg logit contributions
ve-0.227
makers-0.164
rie-0.163
ged-0.159
iana-0.152
Token it
Token ID340
Feature activation-4.48e-02

Pos logit contributions
ve+0.138
rie+0.110
iana+0.107
ged+0.106
makers+0.103
Neg logit contributions
 cal-0.115
 alright-0.106
 swe-0.105
 ok-0.101
 lets-0.098
Token prove
Token ID5879
Feature activation+4.01e-02

Pos logit contributions
 cal+0.249
 alright+0.230
 swe+0.230
 ok+0.221
 lets+0.218
Neg logit contributions
ve-0.318
rie-0.255
iana-0.244
makers-0.235
ged-0.234
Token I
Token ID314
Feature activation-3.74e-02

Pos logit contributions
ve+0.058
rie+0.043
makers+0.043
iana+0.042
oys+0.040
Neg logit contributions
 cal-0.062
 swe-0.058
 alright-0.056
 calmed-0.056
 cleans-0.055
Token'm
Token ID1101
Feature activation+1.04e-01

Pos logit contributions
 cal+0.044
 swe+0.041
 alright+0.039
 ok+0.036
 calmed+0.035
Neg logit contributions
ve-0.073
rie-0.058
iana-0.056
makers-0.055
ged-0.053
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
ve+0.176
rie+0.132
makers+0.132
iana+0.128
ula+0.125
Neg logit contributions
 cal-0.135
 swe-0.130
 nods-0.128
 seasons-0.127
 shall-0.126
Token cle
Token ID1190
Feature activation-7.40e-03

Pos logit contributions
 cal+0.027
 calmed+0.025
 swe+0.025
 alright+0.024
 nods+0.024
Neg logit contributions
ve-0.028
rie-0.022
makers-0.021
iana-0.021
 ante-0.020
Tokenve
Token ID303
Feature activation+4.66e-01

Pos logit contributions
 cal+0.011
 alright+0.011
 ok+0.011
 lets+0.010
 shall+0.010
Neg logit contributions
ve-0.010
rie-0.009
iana-0.008
ged-0.008
makers-0.007
Tokenrest
Token ID2118
Feature activation-1.55e-02

Pos logit contributions
ve+0.815
rie+0.655
iana+0.628
makers+0.603
ged+0.602
Neg logit contributions
 cal-0.616
 alright-0.567
 swe-0.567
 ok-0.543
 lets-0.538
Token!
Token ID0
Feature activation+1.28e-01

Pos logit contributions
 cal+0.023
 swe+0.022
 alright+0.021
 ok+0.020
 calmed+0.020
Neg logit contributions
ve-0.025
rie-0.019
iana-0.019
makers-0.018
ged-0.018
Token I
Token ID314
Feature activation-3.69e-02

Pos logit contributions
ve+0.166
rie+0.143
oys+0.128
makers+0.125
eg+0.120
Neg logit contributions
 swe-0.201
 cal-0.201
 calmed-0.187
 flour-0.185
 alright-0.185
Token bet
Token ID731
Feature activation+3.98e-02

Pos logit contributions
 cal+0.045
 alright+0.041
 swe+0.041
 ok+0.039
 lets+0.038
Neg logit contributions
ve-0.069
rie-0.056
iana-0.054
makers-0.052
ged-0.052
Token I
Token ID314
Feature activation-3.68e-02

Pos logit contributions
ve+0.053
rie+0.038
makers+0.036
iana+0.035
ged+0.034
Neg logit contributions
 cal-0.067
 swe-0.063
 alright-0.060
 calmed-0.059
 pic-0.058
Token stone
Token ID7815
Feature activation+4.33e-03

Pos logit contributions
ve+0.013
iana+0.010
rie+0.010
ged+0.009
sel+0.009
Neg logit contributions
 cal-0.013
 alright-0.012
 swe-0.012
 ok-0.011
 calmed-0.011
Token I
Token ID314
Feature activation-3.75e-02

Pos logit contributions
ve+0.007
rie+0.006
iana+0.005
makers+0.005
ged+0.005
Neg logit contributions
 cal-0.006
 swe-0.006
 alright-0.006
 ok-0.006
 calmed-0.005
Tokenâ
Token ID22940
Feature activation+7.82e-03

Pos logit contributions
 cal+0.047
 shall+0.046
 lets+0.045
 alright+0.045
 calmed+0.043
Neg logit contributions
ve-0.061
rie-0.054
iana-0.051
ula-0.050
 ante-0.049
Token
Token ID26391
Feature activation+1.26e-02

Pos logit contributions
ve+0.012
rie+0.010
iana+0.009
makers+0.009
ged+0.009
Neg logit contributions
 cal-0.012
 alright-0.011
 swe-0.011
 ok-0.011
 lets-0.010
Token
Token ID8151
Feature activation-2.02e-01

Pos logit contributions
ve+0.022
rie+0.017
iana+0.016
makers+0.016
ged+0.015
Neg logit contributions
 cal-0.018
 swe-0.017
 alright-0.016
 ok-0.015
 calmed-0.015
Tokenve
Token ID303
Feature activation+4.65e-01

Pos logit contributions
 alright+0.253
 cal+0.253
 ok+0.247
 shall+0.235
 okay+0.233
Neg logit contributions
ve-0.322
rie-0.283
iana-0.265
 ante-0.252
ged-0.250
Token ever
Token ID1683
Feature activation+1.39e-01

Pos logit contributions
ve+0.708
rie+0.627
sed+0.595
eg+0.588
ula+0.586
Neg logit contributions
 shall-0.593
 lets-0.581
 cal-0.576
 alright-0.573
 calmed-0.553
Token seen
Token ID1775
Feature activation+9.62e-02

Pos logit contributions
ve+0.253
ula+0.222
iana+0.218
rie+0.214
inky+0.206
Neg logit contributions
 cal-0.167
 alright-0.158
 lets-0.155
 shall-0.154
 ok-0.149
Tokenâ
Token ID22940
Feature activation+8.13e-03

Pos logit contributions
ve+0.137
rie+0.097
iana+0.094
makers+0.094
ged+0.090
Neg logit contributions
 cal-0.160
 swe-0.150
 calmed-0.146
 alright-0.144
 cleans-0.141
Token
Token ID26391
Feature activation+1.29e-02

Pos logit contributions
ve+0.014
rie+0.011
iana+0.010
ged+0.010
makers+0.010
Neg logit contributions
 cal-0.011
 alright-0.011
 swe-0.010
 ok-0.010
 lets-0.010
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
ve+0.023
rie+0.018
iana+0.017
makers+0.016
ged+0.016
Neg logit contributions
 cal-0.018
 swe-0.017
 alright-0.016
 ok-0.015
 calmed-0.015
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
ve+0.031
rie+0.027
lves+0.025
iana+0.024
ged+0.024
Neg logit contributions
 alright-0.025
 cal-0.025
 ok-0.025
 okay-0.023
 shall-0.023
Token You
Token ID921
Feature activation+8.17e-02

Pos logit contributions
 cal+0.022
 swe+0.021
 alright+0.020
 calmed+0.020
 hurricanes+0.020
Neg logit contributions
ve-0.017
rie-0.013
iana-0.012
makers-0.011
oys-0.011
Tokenâ
Token ID22940
Feature activation+6.61e-03

Pos logit contributions
ve+0.139
rie+0.110
iana+0.106
makers+0.101
ged+0.099
Neg logit contributions
 cal-0.112
 swe-0.106
 alright-0.103
 ok-0.098
 lets-0.098
Token
Token ID26391
Feature activation+1.27e-02

Pos logit contributions
ve+0.011
rie+0.008
iana+0.008
ged+0.008
makers+0.008
Neg logit contributions
 cal-0.009
 alright-0.009
 swe-0.008
 ok-0.008
 lets-0.008
Token
Token ID8151
Feature activation-2.01e-01

Pos logit contributions
ve+0.022
rie+0.017
iana+0.016
makers+0.016
ged+0.015
Neg logit contributions
 cal-0.018
 swe-0.017
 alright-0.016
 calmed-0.016
 ok-0.015
Tokenve
Token ID303
Feature activation+4.65e-01

Pos logit contributions
 alright+0.242
 cal+0.236
 shall+0.236
 ok+0.234
 okay+0.222
Neg logit contributions
ve-0.328
rie-0.288
 ante-0.272
iana-0.270
sel-0.265
Token done
Token ID1760
Feature activation+3.79e-02

Pos logit contributions
ve+0.879
ula+0.772
 spine+0.752
rie+0.743
iana+0.738
Neg logit contributions
 cal-0.502
 calmed-0.479
 lets-0.454
 alright-0.448
 shall-0.447
Token a
Token ID257
Feature activation+3.18e-02

Pos logit contributions
ve+0.067
rie+0.056
iana+0.053
ged+0.051
makers+0.050
Neg logit contributions
 cal-0.049
 alright-0.046
 swe-0.045
 ok-0.044
 lets-0.042
Token great
Token ID1049
Feature activation+7.65e-03

Pos logit contributions
ve+0.062
rie+0.054
ged+0.054
iana+0.052
makers+0.050
Neg logit contributions
 cal-0.039
 swe-0.034
 alright-0.033
 ok-0.032
 shall-0.031
Token job
Token ID1693
Feature activation+1.07e-01

Pos logit contributions
ve+0.013
rie+0.011
ged+0.010
sel+0.010
iana+0.010
Neg logit contributions
 cal-0.010
 swe-0.009
 ok-0.009
 alright-0.009
 exercises-0.009
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
ve+0.169
rie+0.138
iana+0.129
ged+0.124
makers+0.120
Neg logit contributions
 cal-0.156
 alright-0.147
 swe-0.146
 ok-0.142
 lets-0.140
Token
Token ID241
Feature activation+2.82e-02

Pos logit contributions
 cal+0.091
 alright+0.078
 ok+0.077
 swe+0.074
 lets+0.074
Neg logit contributions
ve-0.118
ucer-0.098
rie-0.097
ged-0.095
iana-0.092
TokenI
Token ID40
Feature activation-5.24e-02

Pos logit contributions
ve+0.052
rie+0.045
iana+0.043
eg+0.040
ged+0.039
Neg logit contributions
 cal-0.032
 alright-0.029
 lets-0.029
 calmed-0.029
 shall-0.029
Tokenâ
Token ID22940
Feature activation+6.50e-03

Pos logit contributions
 cal+0.076
 alright+0.070
 shall+0.068
 lets+0.068
 calmed+0.067
Neg logit contributions
ve-0.083
rie-0.068
iana-0.065
makers-0.063
ged-0.063
Token
Token ID26391
Feature activation+1.23e-02

Pos logit contributions
ve+0.010
rie+0.008
iana+0.007
makers+0.007
ged+0.007
Neg logit contributions
 cal-0.010
 swe-0.010
 alright-0.010
 ok-0.009
 lets-0.009
Token
Token ID8151
Feature activation-2.01e-01

Pos logit contributions
ve+0.020
rie+0.015
iana+0.014
makers+0.014
da+0.013
Neg logit contributions
 cal-0.019
 swe-0.019
 alright-0.017
 ok-0.017
 seasons-0.016
Tokenve
Token ID303
Feature activation+4.65e-01

Pos logit contributions
 alright+0.240
 ok+0.236
 okay+0.228
 cal+0.227
 shall+0.218
Neg logit contributions
ve-0.320
rie-0.298
iana-0.276
 ante-0.272
sel-0.270
Token been
Token ID587
Feature activation+1.06e-01

Pos logit contributions
ve+0.786
ula+0.693
rie+0.685
iana+0.658
eg+0.652
Neg logit contributions
 cal-0.552
 lets-0.523
 shall-0.521
 alright-0.521
 calmed-0.516
Token waiting
Token ID4953
Feature activation-2.69e-02

Pos logit contributions
ve+0.183
rie+0.161
iana+0.152
ged+0.146
eg+0.144
Neg logit contributions
 cal-0.136
 alright-0.134
 ok-0.128
 swe-0.119
 okay-0.118
Token for
Token ID329
Feature activation+2.41e-01

Pos logit contributions
 cal+0.052
 alright+0.049
 swe+0.049
 ok+0.048
 lets+0.047
Neg logit contributions
ve-0.031
rie-0.022
iana-0.020
ged-0.019
makers-0.019
Token somebody
Token ID8276
Feature activation-2.50e-03

Pos logit contributions
ve+0.443
rie+0.375
iana+0.352
ged+0.337
makers+0.337
Neg logit contributions
 cal-0.299
 alright-0.280
 swe-0.275
 ok-0.268
 lets-0.255
Token to
Token ID284
Feature activation-3.22e-02

Pos logit contributions
 cal+0.003
 swe+0.003
 alright+0.003
 calmed+0.003
 seasons+0.003
Neg logit contributions
ve-0.004
rie-0.004
iana-0.003
makers-0.003
ged-0.003
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
 cal+0.051
 alright+0.048
 swe+0.048
 ok+0.046
 lets+0.045
Neg logit contributions
ve-0.062
rie-0.050
iana-0.048
ged-0.046
makers-0.045
Token I
Token ID314
Feature activation-3.55e-02

Pos logit contributions
 cal+0.019
 swe+0.018
 calmed+0.017
 alright+0.017
 hurricanes+0.016
Neg logit contributions
ve-0.016
rie-0.013
iana-0.012
oys-0.012
makers-0.012
Tokenâ
Token ID22940
Feature activation+7.16e-03

Pos logit contributions
 cal+0.043
 shall+0.041
 alright+0.040
 calmed+0.039
 ok+0.038
Neg logit contributions
ve-0.062
rie-0.054
ula-0.052
iana-0.052
oys-0.051
Token
Token ID26391
Feature activation+1.33e-02

Pos logit contributions
ve+0.012
rie+0.009
iana+0.009
makers+0.008
ged+0.008
Neg logit contributions
 cal-0.010
 alright-0.010
 swe-0.010
 ok-0.009
 lets-0.009
Token
Token ID8151
Feature activation-2.02e-01

Pos logit contributions
ve+0.025
rie+0.020
iana+0.019
makers+0.018
ged+0.018
Neg logit contributions
 cal-0.017
 swe-0.016
 alright-0.015
 ok-0.014
 calmed-0.014
Tokenve
Token ID303
Feature activation+4.65e-01

Pos logit contributions
 alright+0.236
 ok+0.230
 shall+0.228
 cal+0.228
 okay+0.220
Neg logit contributions
ve-0.327
rie-0.295
iana-0.279
 ante-0.277
sel-0.275
Token been
Token ID587
Feature activation+1.06e-01

Pos logit contributions
ve+0.830
ula+0.754
 spine+0.750
sed+0.720
rie+0.718
Neg logit contributions
 cal-0.502
 calmed-0.495
 shall-0.476
 lets-0.474
 alright-0.451
Token counting
Token ID14143
Feature activation+1.15e-03

Pos logit contributions
ve+0.185
rie+0.161
iana+0.153
oys+0.148
ged+0.146
Neg logit contributions
 cal-0.134
 alright-0.129
 ok-0.124
 swe-0.119
 okay-0.114
Token the
Token ID262
Feature activation-1.92e-02

Pos logit contributions
ve+0.002
rie+0.002
iana+0.002
makers+0.002
ged+0.002
Neg logit contributions
 cal-0.001
 swe-0.001
 alright-0.001
 ok-0.001
 lets-0.001
Token days
Token ID1528
Feature activation-3.67e-02

Pos logit contributions
 swe+0.024
 cal+0.023
 alright+0.022
 shall+0.022
 lets+0.021
Neg logit contributions
ve-0.036
rie-0.029
ged-0.028
makers-0.028
lves-0.028
Token until
Token ID1566
Feature activation+1.34e-01

Pos logit contributions
 cal+0.050
 swe+0.046
 alright+0.046
 ok+0.044
 calmed+0.044
Neg logit contributions
ve-0.063
rie-0.050
iana-0.048
makers-0.046
ged-0.046
Token were
Token ID547
Feature activation+6.77e-02

Pos logit contributions
ve+0.099
rie+0.073
iana+0.071
ged+0.069
makers+0.066
Neg logit contributions
 cal-0.103
 alright-0.097
 swe-0.097
 ok-0.095
 lets-0.090
Token a
Token ID257
Feature activation+3.22e-02

Pos logit contributions
ve+0.103
iana+0.077
rie+0.075
makers+0.072
ged+0.070
Neg logit contributions
 cal-0.100
 alright-0.092
 lets-0.091
 cleans-0.091
 nods-0.091
Token deep
Token ID2769
Feature activation+1.45e-02

Pos logit contributions
ve+0.057
rie+0.046
iana+0.044
ged+0.043
makers+0.043
Neg logit contributions
 cal-0.042
 swe-0.039
 alright-0.039
 ok-0.037
 lets-0.036
Token,
Token ID11
Feature activation+1.29e-03

Pos logit contributions
ve+0.024
rie+0.020
iana+0.019
ged+0.018
makers+0.018
Neg logit contributions
 cal-0.020
 alright-0.019
 ok-0.018
 swe-0.018
 lets-0.018
Token vel
Token ID11555
Feature activation+1.41e-02

Pos logit contributions
ve+0.002
rie+0.001
iana+0.001
makers+0.001
ged+0.001
Neg logit contributions
 cal-0.002
 alright-0.002
 cleans-0.002
 lets-0.002
 calmed-0.002
Tokenve
Token ID303
Feature activation+4.64e-01

Pos logit contributions
rie+0.005
oys+0.004
ged+0.003
 Jed+0.003
ve+0.003
Neg logit contributions
 cal-0.030
adow-0.029
 shall-0.029
 butter-0.029
adows-0.029
Tokenty
Token ID774
Feature activation+2.94e-02

Pos logit contributions
ve+0.814
rie+0.675
iana+0.636
makers+0.608
eg+0.605
Neg logit contributions
 cal-0.583
 alright-0.539
 swe-0.532
 lets-0.516
 ok-0.512
Token purple
Token ID14032
Feature activation+1.67e-02

Pos logit contributions
ve+0.054
rie+0.044
iana+0.043
ged+0.041
makers+0.041
Neg logit contributions
 cal-0.034
 swe-0.032
 alright-0.031
 lets-0.030
 ok-0.030
Token.
Token ID13
Feature activation-7.85e-03

Pos logit contributions
ve+0.031
rie+0.025
iana+0.024
ged+0.023
makers+0.023
Neg logit contributions
 cal-0.021
 alright-0.019
 swe-0.019
 ok-0.018
 lets-0.018
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
 cal+0.015
 alright+0.015
 ok+0.014
 swe+0.014
 calmed+0.014
Neg logit contributions
ve-0.007
rie-0.006
iana-0.006
da-0.005
 Bub-0.005
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
ve+0.018
rie+0.013
iana+0.013
makers+0.013
ged+0.012
Neg logit contributions
 cal-0.019
 swe-0.018
 alright-0.018
 ok-0.017
 lets-0.016
Token the
Token ID262
Feature activation-2.12e-02

Pos logit contributions
ve+0.122
rie+0.095
makers+0.092
iana+0.090
ged+0.090
Neg logit contributions
 cal-0.102
 alright-0.099
 ok-0.097
 swe-0.095
 lets-0.092
Token village
Token ID7404
Feature activation-6.13e-02

Pos logit contributions
 cal+0.025
 swe+0.023
 alright+0.023
 ok+0.021
 lets+0.021
Neg logit contributions
ve-0.041
rie-0.033
iana-0.032
ged-0.031
makers-0.031
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
 cal+0.077
 swe+0.071
 alright+0.071
 ok+0.068
 lets+0.067
Neg logit contributions
ve-0.111
rie-0.090
iana-0.086
makers-0.083
ged-0.083
Token She
Token ID1375
Feature activation-2.48e-02

Pos logit contributions
 cal+0.021
 alright+0.019
 swe+0.019
 ok+0.018
 lets+0.018
Neg logit contributions
ve-0.021
rie-0.017
iana-0.016
ged-0.015
makers-0.015
Token mar
Token ID1667
Feature activation+2.54e-01

Pos logit contributions
 cal+0.031
 swe+0.028
 alright+0.028
 calmed+0.027
 lets+0.027
Neg logit contributions
ve-0.045
rie-0.037
iana-0.036
makers-0.035
ged-0.034
Tokenve
Token ID303
Feature activation+4.63e-01

Pos logit contributions
ve+0.152
rie+0.148
 spine+0.145
airy+0.118
 ante+0.114
Neg logit contributions
 shall-0.490
 alright-0.484
 cal-0.480
 calmed-0.478
 flour-0.466
Tokenlled
Token ID3353
Feature activation+1.62e-01

Pos logit contributions
ve+0.783
ula+0.651
iana+0.640
eg+0.632
rie+0.628
Neg logit contributions
 calmed-0.565
 cal-0.534
 lets-0.519
 shall-0.507
 alright-0.497
Token at
Token ID379
Feature activation-3.73e-02

Pos logit contributions
ve+0.282
rie+0.223
ged+0.215
iana+0.213
makers+0.207
Neg logit contributions
 cal-0.211
 alright-0.197
 swe-0.197
 ok-0.191
 lets-0.191
Token the
Token ID262
Feature activation-2.02e-02

Pos logit contributions
 cal+0.043
 swe+0.040
 alright+0.039
 ok+0.037
 lets+0.037
Neg logit contributions
ve-0.072
rie-0.059
iana-0.057
makers-0.055
ged-0.055
Token bright
Token ID6016
Feature activation-1.88e-02

Pos logit contributions
 cal+0.028
 swe+0.027
 alright+0.026
 seasons+0.024
 ok+0.024
Neg logit contributions
ve-0.034
rie-0.027
iana-0.026
ged-0.026
makers-0.024
Token,
Token ID11
Feature activation-4.14e-03

Pos logit contributions
 cal+0.022
 swe+0.021
 alright+0.020
 lets+0.019
 calmed+0.019
Neg logit contributions
ve-0.036
rie-0.029
iana-0.028
makers-0.027
ged-0.027
Token g
Token ID308
Feature activation-5.46e-02

Pos logit contributions
ve+0.167
rie+0.133
iana+0.130
makers+0.123
ged+0.122
Neg logit contributions
 cal-0.141
 swe-0.130
 alright-0.130
 lets-0.127
 ok-0.125
Tokento
Token ID1462
Feature activation-5.37e-02

Pos logit contributions
 cal+0.067
 alright+0.065
 swe+0.064
 ok+0.063
 shall+0.062
Neg logit contributions
ve-0.092
rie-0.076
makers-0.072
iana-0.072
ucer-0.070
Token be
Token ID307
Feature activation+1.25e-01

Pos logit contributions
 cal+0.069
 alright+0.068
 ok+0.068
 shall+0.064
 lets+0.064
Neg logit contributions
ve-0.080
rie-0.073
iana-0.070
lves-0.069
 spine-0.066
Token the
Token ID262
Feature activation-2.07e-02

Pos logit contributions
ve+0.222
rie+0.178
iana+0.171
makers+0.165
ged+0.165
Neg logit contributions
 cal-0.163
 swe-0.149
 alright-0.148
 lets-0.142
 ok-0.141
Token cle
Token ID1190
Feature activation-9.83e-03

Pos logit contributions
 cal+0.024
 swe+0.022
 alright+0.022
 ok+0.021
 lets+0.020
Neg logit contributions
ve-0.041
rie-0.035
ged-0.034
iana-0.033
makers-0.032
Tokenve
Token ID303
Feature activation+4.63e-01

Pos logit contributions
 cal+0.020
 alright+0.020
 ok+0.020
 shall+0.020
 okay+0.019
Neg logit contributions
rie-0.006
ve-0.005
ky-0.004
ged-0.004
lves-0.004
Tokenrest
Token ID2118
Feature activation-1.86e-02

Pos logit contributions
ve+0.783
rie+0.642
iana+0.610
ged+0.581
eg+0.578
Neg logit contributions
 cal-0.601
 alright-0.577
 lets-0.553
 ok-0.553
 swe-0.547
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
 cal+0.027
 swe+0.025
 shall+0.025
 alright+0.025
 lets+0.024
Neg logit contributions
ve-0.029
rie-0.023
iana-0.022
ged-0.021
makers-0.021
Token But
Token ID887
Feature activation+4.64e-02

Pos logit contributions
 cal+0.017
 alright+0.016
 swe+0.015
 ok+0.015
 shall+0.015
Neg logit contributions
ve-0.025
rie-0.021
iana-0.020
ged-0.019
makers-0.019
Token in
Token ID287
Feature activation-3.80e-02

Pos logit contributions
ve+0.088
rie+0.073
iana+0.068
makers+0.067
ged+0.065
Neg logit contributions
 cal-0.055
 alright-0.052
 ok-0.050
 swe-0.050
 shall-0.047
Token the
Token ID262
Feature activation-2.05e-02

Pos logit contributions
 cal+0.042
 alright+0.037
 swe+0.037
 ok+0.036
 lets+0.036
Neg logit contributions
ve-0.080
rie-0.065
makers-0.061
iana-0.060
ged-0.059
Token proudly
Token ID23017
Feature activation+4.64e-02

Pos logit contributions
ve+0.106
rie+0.086
iana+0.084
makers+0.082
ged+0.079
Neg logit contributions
 cal-0.065
 swe-0.060
 alright-0.059
 ok-0.057
 lets-0.056
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
ve+0.084
rie+0.068
iana+0.066
makers+0.064
ged+0.062
Neg logit contributions
 cal-0.058
 swe-0.054
 alright-0.053
 ok-0.051
 lets-0.051
Token and
Token ID290
Feature activation-1.90e-02

Pos logit contributions
 cal+0.016
 alright+0.015
 swe+0.015
 ok+0.014
 lets+0.014
Neg logit contributions
ve-0.017
rie-0.013
iana-0.013
ged-0.012
makers-0.012
Token everyone
Token ID2506
Feature activation+1.54e-02

Pos logit contributions
 cal+0.025
 swe+0.023
 alright+0.023
 ok+0.022
 lets+0.022
Neg logit contributions
ve-0.034
rie-0.027
iana-0.026
makers-0.025
ged-0.025
Token mar
Token ID1667
Feature activation+2.54e-01

Pos logit contributions
ve+0.027
rie+0.021
iana+0.020
ged+0.020
makers+0.020
Neg logit contributions
 cal-0.021
 alright-0.019
 swe-0.019
 ok-0.019
 lets-0.018
Tokenve
Token ID303
Feature activation+4.63e-01

Pos logit contributions
ve+0.296
rie+0.250
 spine+0.231
makers+0.228
iana+0.228
Neg logit contributions
 cal-0.410
 alright-0.389
 swe-0.382
 shall-0.380
 calmed-0.378
Tokenlled
Token ID3353
Feature activation+1.61e-01

Pos logit contributions
ve+0.780
ula+0.692
sed+0.674
 spine+0.663
eg+0.654
Neg logit contributions
 calmed-0.560
 cal-0.546
 alright-0.522
 lets-0.507
 shall-0.505
Token at
Token ID379
Feature activation-3.77e-02

Pos logit contributions
ve+0.287
rie+0.231
iana+0.220
ged+0.218
makers+0.214
Neg logit contributions
 cal-0.202
 alright-0.186
 swe-0.186
 lets-0.180
 ok-0.179
Token his
Token ID465
Feature activation+4.21e-02

Pos logit contributions
 cal+0.046
 alright+0.042
 swe+0.042
 ok+0.040
 lets+0.039
Neg logit contributions
ve-0.070
rie-0.057
iana-0.056
ged-0.054
makers-0.053
Token kindness
Token ID23887
Feature activation-8.67e-02

Pos logit contributions
ve+0.074
rie+0.060
iana+0.058
makers+0.056
ged+0.055
Neg logit contributions
 cal-0.055
 swe-0.051
 alright-0.051
 ok-0.049
 lets-0.048
Token and
Token ID290
Feature activation-1.76e-02

Pos logit contributions
 cal+0.107
 swe+0.099
 alright+0.098
 ok+0.094
 lets+0.093
Neg logit contributions
ve-0.159
rie-0.129
iana-0.124
makers-0.120
ged-0.119
Token
Token ID198
Feature activation-5.82e-03

Pos logit contributions
 cal+0.048
 swe+0.045
 alright+0.045
 ok+0.043
 lets+0.042
Neg logit contributions
ve-0.060
rie-0.048
iana-0.046
ged-0.045
makers-0.044
Token
Token ID198
Feature activation-6.28e-03

Pos logit contributions
 cal+0.008
 swe+0.008
 alright+0.007
 ok+0.007
 lets+0.007
Neg logit contributions
ve-0.010
rie-0.008
iana-0.007
makers-0.007
ged-0.007
TokenFe
Token ID14304
Feature activation+5.48e-02

Pos logit contributions
 cal+0.008
 swe+0.008
 alright+0.008
 ok+0.007
 lets+0.007
Neg logit contributions
ve-0.011
rie-0.009
iana-0.009
makers-0.008
ged-0.008
Tokeneling
Token ID10809
Feature activation-2.47e-02

Pos logit contributions
ve+0.110
rie+0.096
oys+0.090
sed+0.090
eg+0.090
Neg logit contributions
 cal-0.059
 alright-0.050
 swe-0.048
 lets-0.047
 calmed-0.045
Token much
Token ID881
Feature activation-5.32e-02

Pos logit contributions
 cal+0.032
 swe+0.031
 pic+0.031
 seasons+0.030
 alright+0.030
Neg logit contributions
ve-0.041
rie-0.032
ged-0.032
makers-0.031
iana-0.031
Token cal
Token ID2386
Feature activation-3.69e-01

Pos logit contributions
 pic+0.058
 shall+0.057
 hurricanes+0.056
 seasons+0.055
 Fog+0.054
Neg logit contributions
ve-0.088
ging-0.074
rie-0.074
ged-0.072
ky-0.068
Tokenmer
Token ID647
Feature activation+2.60e-01

Pos logit contributions
 cal+0.867
 swe+0.828
 alright+0.828
 ok+0.808
 lets+0.804
Neg logit contributions
ve-0.269
rie-0.141
iana-0.120
makers-0.101
ged-0.099
Token,
Token ID11
Feature activation-2.02e-02

Pos logit contributions
ve+0.473
rie+0.383
iana+0.368
ged+0.350
makers+0.350
Neg logit contributions
 cal-0.323
 alright-0.297
 swe-0.294
 ok-0.285
 lets-0.281
Token Amy
Token ID14235
Feature activation-1.76e-01

Pos logit contributions
 cal+0.026
 swe+0.024
 alright+0.024
 ok+0.023
 lets+0.023
Neg logit contributions
ve-0.035
rie-0.028
iana-0.027
makers-0.026
ged-0.026
Token held
Token ID2714
Feature activation-3.65e-02

Pos logit contributions
 cal+0.233
 swe+0.221
 alright+0.221
 ok+0.210
 hurricanes+0.208
Neg logit contributions
ve-0.298
rie-0.238
iana-0.227
ged-0.222
makers-0.216
Token the
Token ID262
Feature activation-3.70e-02

Pos logit contributions
 cal+0.054
 alright+0.051
 swe+0.051
 calmed+0.049
 seasons+0.049
Neg logit contributions
ve-0.057
ged-0.045
iana-0.045
rie-0.044
ging-0.041
Token because
Token ID780
Feature activation+5.78e-02

Pos logit contributions
 cal+0.023
 swe+0.021
 alright+0.021
 lets+0.020
 calmed+0.020
Neg logit contributions
ve-0.026
rie-0.020
iana-0.019
ged-0.019
makers-0.019
Token soon
Token ID2582
Feature activation-1.25e-03

Pos logit contributions
ve+0.084
rie+0.063
iana+0.062
makers+0.061
ged+0.057
Neg logit contributions
 cal-0.089
 swe-0.081
 alright-0.079
 calmed-0.079
 nods-0.078
Token the
Token ID262
Feature activation-3.36e-02

Pos logit contributions
 cal+0.002
 swe+0.002
 alright+0.002
 ok+0.001
 lets+0.001
Neg logit contributions
ve-0.002
rie-0.002
iana-0.002
ged-0.002
makers-0.001
Token fish
Token ID5916
Feature activation-9.67e-02

Pos logit contributions
 cal+0.044
 alright+0.041
 swe+0.040
 lets+0.039
 calmed+0.039
Neg logit contributions
ve-0.057
rie-0.045
iana-0.043
makers-0.042
ged-0.041
Token was
Token ID373
Feature activation-7.31e-03

Pos logit contributions
 cal+0.137
 swe+0.130
 alright+0.127
 ok+0.122
 lets+0.120
Neg logit contributions
ve-0.160
rie-0.123
iana-0.119
ged-0.116
makers-0.115
Token cal
Token ID2386
Feature activation-3.68e-01

Pos logit contributions
 cal+0.011
 swe+0.010
 lets+0.010
 nods+0.010
 shall+0.010
Neg logit contributions
ve-0.011
makers-0.008
ged-0.008
rie-0.008
ging-0.008
Tokenmer
Token ID647
Feature activation+2.64e-01

Pos logit contributions
 cal+0.850
 alright+0.804
 swe+0.804
 lets+0.782
 ok+0.782
Neg logit contributions
ve-0.285
rie-0.153
iana-0.144
ged-0.114
makers-0.114
Token and
Token ID290
Feature activation-2.60e-02

Pos logit contributions
ve+0.472
rie+0.382
iana+0.367
makers+0.349
ged+0.349
Neg logit contributions
 cal-0.339
 alright-0.312
 swe-0.310
 ok-0.299
 lets-0.295
Token more
Token ID517
Feature activation-8.74e-02

Pos logit contributions
birds+0.027
 pic+0.027
adows+0.027
 seasons+0.027
rises+0.027
Neg logit contributions
ve-0.042
rie-0.033
makers-0.032
ged-0.031
chie-0.030
Token relaxed
Token ID18397
Feature activation-2.16e-01

Pos logit contributions
 cal+0.121
 shall+0.118
 cleans+0.116
 swe+0.116
 lets+0.116
Neg logit contributions
ve-0.133
rie-0.099
makers-0.097
ging-0.097
ged-0.096
Token.
Token ID13
Feature activation-3.15e-02

Pos logit contributions
 cal+0.297
 swe+0.275
 alright+0.273
 lets+0.264
 ok+0.263
Neg logit contributions
ve-0.361
rie-0.284
iana-0.271
ged-0.269
makers-0.269
Tokenicker
Token ID15799
Feature activation+2.04e-03

Pos logit contributions
ve+0.145
rie+0.127
makers+0.124
iana+0.122
 ante+0.121
Neg logit contributions
 cal-0.083
 alright-0.079
 ok-0.075
 shall-0.072
 nods-0.070
Token and
Token ID290
Feature activation-1.38e-02

Pos logit contributions
ve+0.003
rie+0.003
iana+0.003
ged+0.003
makers+0.003
Neg logit contributions
 cal-0.003
 alright-0.003
 swe-0.003
 ok-0.002
 lets-0.002
Token soon
Token ID2582
Feature activation+2.14e-03

Pos logit contributions
 cal+0.018
 alright+0.017
 swe+0.017
 ok+0.016
 shall+0.016
Neg logit contributions
ve-0.023
rie-0.019
ged-0.018
iana-0.018
makers-0.017
Token felt
Token ID2936
Feature activation+6.29e-02

Pos logit contributions
ve+0.004
rie+0.003
iana+0.003
ged+0.003
makers+0.003
Neg logit contributions
 cal-0.003
 alright-0.003
 swe-0.002
 ok-0.002
 lets-0.002
Token much
Token ID881
Feature activation-4.25e-02

Pos logit contributions
ve+0.096
rie+0.073
ged+0.072
makers+0.072
iana+0.070
Neg logit contributions
 swe-0.080
 cal-0.080
 lets-0.078
 seasons-0.078
 pic-0.078
Token cal
Token ID2386
Feature activation-3.66e-01

Pos logit contributions
 hurricanes+0.052
 pic+0.051
 seasons+0.049
adoes+0.049
adows+0.048
Neg logit contributions
ve-0.064
sel-0.051
rie-0.051
ged-0.049
ging-0.049
Tokenmer
Token ID647
Feature activation+2.63e-01

Pos logit contributions
 cal+0.814
 swe+0.757
 alright+0.753
 calmed+0.742
 lets+0.741
Neg logit contributions
ve-0.315
iana-0.185
rie-0.181
eg-0.144
ged-0.142
Token.
Token ID13
Feature activation-2.26e-02

Pos logit contributions
ve+0.484
rie+0.396
iana+0.383
eg+0.359
ged+0.356
Neg logit contributions
 cal-0.321
 alright-0.295
 swe-0.289
 ok-0.283
 lets-0.276
Token Her
Token ID2332
Feature activation-2.29e-02

Pos logit contributions
 cal+0.033
 swe+0.032
 alright+0.031
 ok+0.030
 calmed+0.029
Neg logit contributions
ve-0.035
rie-0.028
iana-0.027
makers-0.025
ged-0.025
Token brother
Token ID3956
Feature activation-1.77e-02

Pos logit contributions
 cal+0.031
 alright+0.030
 swe+0.029
 lets+0.028
 calmed+0.028
Neg logit contributions
ve-0.038
rie-0.029
iana-0.027
sed-0.027
makers-0.027
Token eventually
Token ID4191
Feature activation-1.76e-02

Pos logit contributions
 cal+0.024
 swe+0.022
 alright+0.022
 ok+0.021
 lets+0.021
Neg logit contributions
ve-0.030
rie-0.024
iana-0.023
ged-0.022
makers-0.022
Token around
Token ID1088
Feature activation-4.14e-02

Pos logit contributions
 cal+0.031
 alright+0.029
 swe+0.029
 ok+0.028
 lets+0.027
Neg logit contributions
ve-0.047
rie-0.039
iana-0.037
makers-0.036
ged-0.036
Token him
Token ID683
Feature activation+2.78e-02

Pos logit contributions
 cal+0.052
 swe+0.049
 alright+0.048
 ok+0.046
 calmed+0.045
Neg logit contributions
ve-0.076
rie-0.062
iana-0.058
makers-0.057
ucer-0.056
Token and
Token ID290
Feature activation-2.15e-02

Pos logit contributions
ve+0.045
rie+0.035
ged+0.033
iana+0.033
makers+0.032
Neg logit contributions
 cal-0.040
 swe-0.037
 alright-0.037
 lets-0.036
 pic-0.035
Token he
Token ID339
Feature activation+1.99e-02

Pos logit contributions
 cal+0.028
 alright+0.026
 swe+0.026
 ok+0.025
 lets+0.024
Neg logit contributions
ve-0.037
rie-0.030
ged-0.029
iana-0.029
makers-0.027
Token felt
Token ID2936
Feature activation+6.43e-02

Pos logit contributions
ve+0.034
rie+0.027
ged+0.026
iana+0.025
sel+0.024
Neg logit contributions
 cal-0.026
 alright-0.025
 swe-0.025
 ok-0.024
 lets-0.023
Token cal
Token ID2386
Feature activation-3.66e-01

Pos logit contributions
ve+0.102
rie+0.075
ged+0.075
iana+0.074
makers+0.073
Neg logit contributions
 cal-0.084
 swe-0.083
 seasons-0.081
 lets-0.080
 hurricanes-0.079
Tokenmer
Token ID647
Feature activation+2.64e-01

Pos logit contributions
 cal+0.800
 alright+0.743
 swe+0.739
 calmed+0.733
 lets+0.716
Neg logit contributions
ve-0.324
iana-0.208
rie-0.197
eg-0.164
oys-0.159
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
ve+0.487
rie+0.401
iana+0.387
ged+0.360
eg+0.358
Neg logit contributions
 cal-0.320
 alright-0.297
 swe-0.291
 ok-0.286
 lets-0.274
Token His
Token ID2399
Feature activation+2.88e-02

Pos logit contributions
 cal+0.037
 swe+0.035
 alright+0.034
 ok+0.032
 lets+0.032
Neg logit contributions
ve-0.037
rie-0.029
iana-0.027
ged-0.026
ula-0.026
Token mom
Token ID1995
Feature activation-5.56e-02

Pos logit contributions
ve+0.043
sed+0.033
rie+0.032
 pupil+0.031
eg+0.031
Neg logit contributions
 cal-0.042
 alright-0.039
 lets-0.039
 hurricanes-0.038
 shall-0.037
Token knew
Token ID2993
Feature activation+1.44e-01

Pos logit contributions
 cal+0.075
 swe+0.072
 alright+0.070
 pic+0.068
 seasons+0.067
Neg logit contributions
ve-0.095
rie-0.075
ged-0.070
iana-0.070
makers-0.067
Token leaves
Token ID5667
Feature activation-1.63e-01

Pos logit contributions
 cal+0.087
 alright+0.082
 ok+0.079
 calmed+0.076
 swe+0.076
Neg logit contributions
ve-0.148
rie-0.113
makers-0.112
ged-0.109
iana-0.108
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
 cal+0.244
 alright+0.229
 swe+0.229
 ok+0.222
 pic+0.215
Neg logit contributions
ve-0.257
ged-0.194
rie-0.185
makers-0.185
iana-0.184
Token he
Token ID339
Feature activation+2.37e-02

Pos logit contributions
 cal+0.023
 alright+0.021
 swe+0.020
 ok+0.020
 lets+0.019
Neg logit contributions
ve-0.032
rie-0.025
iana-0.024
makers-0.024
ged-0.024
Token felt
Token ID2936
Feature activation+6.56e-02

Pos logit contributions
ve+0.040
rie+0.031
iana+0.029
ged+0.029
makers+0.028
Neg logit contributions
 cal-0.033
 swe-0.031
 alright-0.031
 ok-0.030
 lets-0.029
Token much
Token ID881
Feature activation-4.30e-02

Pos logit contributions
ve+0.093
makers+0.065
rie+0.065
iana+0.065
ged+0.064
Neg logit contributions
 swe-0.095
 cal-0.095
 alright-0.092
 lets-0.091
 pic-0.090
Token cal
Token ID2386
Feature activation-3.66e-01

Pos logit contributions
 hurricanes+0.048
 pic+0.046
 seasons+0.044
rises+0.044
adoes+0.044
Neg logit contributions
ve-0.071
rie-0.056
makers-0.055
sel-0.054
ged-0.053
Tokenmer
Token ID647
Feature activation+2.68e-01

Pos logit contributions
 cal+0.775
 calmed+0.706
 alright+0.705
 swe+0.704
 lets+0.702
Neg logit contributions
ve-0.341
iana-0.216
rie-0.203
eg-0.176
oys-0.169
Token.
Token ID13
Feature activation-2.49e-02

Pos logit contributions
ve+0.503
rie+0.419
iana+0.403
eg+0.378
ucer+0.376
Neg logit contributions
 cal-0.311
 alright-0.286
 ok-0.276
 swe-0.275
 lets-0.266
Token He
Token ID679
Feature activation+9.23e-03

Pos logit contributions
 cal+0.037
 swe+0.035
 alright+0.034
 ok+0.033
 calmed+0.032
Neg logit contributions
ve-0.039
rie-0.031
iana-0.030
makers-0.028
ged-0.028
Token hopped
Token ID45230
Feature activation-8.36e-02

Pos logit contributions
ve+0.015
rie+0.011
ged+0.011
iana+0.011
sel+0.010
Neg logit contributions
 alright-0.013
 cal-0.013
 swe-0.012
 ok-0.012
 hurricanes-0.012
Token up
Token ID510
Feature activation-9.03e-02

Pos logit contributions
 cal+0.087
 swe+0.075
 alright+0.075
 ok+0.071
 calmed+0.070
Neg logit contributions
ve-0.169
rie-0.148
iana-0.140
makers-0.134
eg-0.133
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
 cal+0.113
 swe+0.105
 alright+0.104
 lets+0.100
 ok+0.099
Neg logit contributions
ve-0.154
rie-0.123
iana-0.118
ged-0.114
makers-0.113
Token The
Token ID383
Feature activation+2.02e-02

Pos logit contributions
 cal+0.027
 swe+0.026
 alright+0.025
 lets+0.024
 ok+0.024
Neg logit contributions
ve-0.026
rie-0.022
iana-0.020
makers-0.019
ula-0.019
Token water
Token ID1660
Feature activation-8.25e-02

Pos logit contributions
ve+0.032
rie+0.026
iana+0.025
 ante+0.024
makers+0.024
Neg logit contributions
 cal-0.027
 alright-0.027
 swe-0.026
 ok-0.026
 lets-0.026
Token was
Token ID373
Feature activation-3.77e-03

Pos logit contributions
 cal+0.118
 alright+0.112
 swe+0.112
 ok+0.108
 lets+0.107
Neg logit contributions
ve-0.131
rie-0.098
ged-0.098
iana-0.097
makers-0.092
Token much
Token ID881
Feature activation-4.71e-02

Pos logit contributions
 cal+0.005
 lets+0.005
 nods+0.005
 swe+0.005
 cleans+0.005
Neg logit contributions
ve-0.006
ged-0.004
makers-0.004
rie-0.004
ging-0.004
Token cal
Token ID2386
Feature activation-3.65e-01

Pos logit contributions
adows+0.066
adow+0.064
 pic+0.058
 Dinner+0.058
Sun+0.057
Neg logit contributions
ve-0.059
 for-0.051
ging-0.047
 whole-0.047
 few-0.046
Tokenmer
Token ID647
Feature activation+2.62e-01

Pos logit contributions
 cal+0.802
 swe+0.745
 alright+0.744
 calmed+0.733
 lets+0.727
Neg logit contributions
ve-0.319
iana-0.192
rie-0.177
ged-0.153
eg-0.149
Token now
Token ID783
Feature activation+6.29e-02

Pos logit contributions
ve+0.475
rie+0.386
iana+0.371
ged+0.353
makers+0.350
Neg logit contributions
 cal-0.327
 alright-0.302
 swe-0.299
 ok-0.289
 lets-0.283
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
ve+0.102
rie+0.078
ged+0.075
iana+0.075
makers+0.074
Neg logit contributions
 cal-0.090
 swe-0.086
 alright-0.083
 ok-0.080
 lets-0.080
Token Everyone
Token ID11075
Feature activation+3.12e-02

Pos logit contributions
 cal+0.028
 swe+0.027
 alright+0.026
 ok+0.025
 lets+0.025
Neg logit contributions
ve-0.031
rie-0.025
iana-0.024
makers-0.023
ged-0.022
Token was
Token ID373
Feature activation-2.98e-03

Pos logit contributions
ve+0.050
rie+0.039
ged+0.037
iana+0.037
makers+0.037
Neg logit contributions
 cal-0.045
 swe-0.044
 alright-0.043
 ok-0.042
 pic-0.040
Token just
Token ID655
Feature activation-6.60e-02

Pos logit contributions
ve+0.119
rie+0.096
iana+0.092
makers+0.088
ged+0.088
Neg logit contributions
 cal-0.089
 alright-0.082
 swe-0.082
 ok-0.079
 lets-0.078
Token curious
Token ID11040
Feature activation-1.24e-01

Pos logit contributions
 cal+0.092
 swe+0.086
 alright+0.085
 lets+0.081
 ok+0.081
Neg logit contributions
ve-0.110
rie-0.086
iana-0.082
makers-0.081
ged-0.080
Token kids
Token ID3988
Feature activation+7.24e-03

Pos logit contributions
 cal+0.208
 alright+0.190
 lets+0.190
Fly+0.190
 cleans+0.188
Neg logit contributions
ve-0.147
makers-0.116
rie-0.111
ged-0.104
oys-0.102
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
ve+0.012
makers+0.009
rie+0.009
ged+0.009
iana+0.008
Neg logit contributions
 cal-0.011
 swe-0.010
 alright-0.009
 pic-0.009
 seasons-0.009
Token He
Token ID679
Feature activation+1.67e-02

Pos logit contributions
 cal+0.017
 swe+0.017
 alright+0.017
 ok+0.016
 calmed+0.016
Neg logit contributions
ve-0.016
rie-0.013
iana-0.012
makers-0.011
eg-0.011
Token cal
Token ID2386
Feature activation-3.64e-01

Pos logit contributions
ve+0.030
rie+0.025
iana+0.023
makers+0.022
ged+0.022
Neg logit contributions
 cal-0.020
 alright-0.020
 swe-0.019
 ok-0.019
 shall-0.018
Tokenms
Token ID907
Feature activation+1.75e-02

Pos logit contributions
 cal+0.546
 calmed+0.500
 lets+0.483
 alright+0.477
 swe+0.475
Neg logit contributions
ve-0.567
iana-0.440
rie-0.437
eg-0.398
ula-0.394
Token down
Token ID866
Feature activation-4.59e-02

Pos logit contributions
ve+0.034
rie+0.029
iana+0.027
makers+0.026
sel+0.026
Neg logit contributions
 cal-0.019
 alright-0.017
 swe-0.017
 lets-0.017
 ok-0.017
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
 cal+0.058
 alright+0.054
 swe+0.053
 ok+0.051
 lets+0.050
Neg logit contributions
ve-0.083
rie-0.067
iana-0.064
makers-0.062
ged-0.062
Token explains
Token ID6688
Feature activation-1.02e-01

Pos logit contributions
 alright+0.017
 ok+0.016
rises+0.015
 pic+0.015
 swe+0.015
Neg logit contributions
ve-0.020
rie-0.017
ged-0.016
iana-0.015
makers-0.015
Token to
Token ID284
Feature activation-4.36e-02

Pos logit contributions
 cal+0.154
 alright+0.150
 lets+0.147
 pic+0.147
 shall+0.146
Neg logit contributions
ve-0.134
iana-0.100
ged-0.099
makers-0.099
rie-0.098
Token,
Token ID11
Feature activation-9.57e-03

Pos logit contributions
 cal+0.111
 pic+0.106
 seasons+0.104
 lets+0.103
 hurricanes+0.102
Neg logit contributions
ve-0.103
ged-0.075
rie-0.074
sel-0.070
iana-0.069
Token she
Token ID673
Feature activation-1.10e-02

Pos logit contributions
 cal+0.016
 swe+0.016
 snowball+0.016
 cleans+0.015
 pic+0.015
Neg logit contributions
ve-0.011
iana-0.008
ged-0.007
 center-0.007
 centre-0.007
Token felt
Token ID2936
Feature activation+6.62e-02

Pos logit contributions
 cal+0.017
 swe+0.017
 hurricanes+0.017
 alright+0.016
 flour+0.016
Neg logit contributions
ve-0.016
rie-0.011
ged-0.010
iana-0.010
sel-0.010
Token herself
Token ID5223
Feature activation+1.22e-01

Pos logit contributions
ve+0.094
ged+0.070
ging+0.066
iana+0.065
makers+0.065
Neg logit contributions
 swe-0.090
 pic-0.089
 seasons-0.089
birds-0.088
rises-0.088
Token getting
Token ID1972
Feature activation+6.92e-02

Pos logit contributions
ve+0.194
ged+0.136
rie+0.131
 spine+0.128
makers+0.128
Neg logit contributions
 cal-0.174
 pic-0.164
 lets-0.164
 swe-0.163
 seasons-0.160
Token cal
Token ID2386
Feature activation-3.64e-01

Pos logit contributions
ve+0.091
ged+0.069
ging+0.067
ula+0.061
 missing+0.060
Neg logit contributions
 pic-0.106
 seasons-0.102
 nods-0.101
adoes-0.099
 lets-0.098
Tokenmer
Token ID647
Feature activation+2.67e-01

Pos logit contributions
 cal+0.841
 alright+0.814
 swe+0.811
 ok+0.800
 shall+0.788
Neg logit contributions
ve-0.276
rie-0.162
iana-0.123
makers-0.117
ged-0.113
Token and
Token ID290
Feature activation-2.01e-02

Pos logit contributions
ve+0.459
rie+0.367
iana+0.352
makers+0.345
ged+0.342
Neg logit contributions
 cal-0.359
 swe-0.331
 alright-0.329
 lets-0.314
 ok-0.314
Token cal
Token ID2386
Feature activation-3.63e-01

Pos logit contributions
birds+0.025
adoes+0.023
 pic+0.023
rises+0.022
 seasons+0.022
Neg logit contributions
ve-0.029
 centre-0.022
 for-0.022
ged-0.021
 center-0.021
Tokenmer
Token ID647
Feature activation+2.68e-01

Pos logit contributions
 cal+0.841
 swe+0.803
 alright+0.803
 ok+0.783
 lets+0.780
Neg logit contributions
ve-0.277
rie-0.152
iana-0.130
makers-0.112
ged-0.110
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
ve+0.483
rie+0.392
iana+0.376
makers+0.360
ged+0.360
Neg logit contributions
 cal-0.338
 alright-0.311
 swe-0.310
 ok-0.297
 lets-0.293
Token herself
Token ID5223
Feature activation+1.22e-01

Pos logit contributions
ve+0.094
ged+0.070
ging+0.066
iana+0.065
makers+0.065
Neg logit contributions
 swe-0.090
 pic-0.089
 seasons-0.089
birds-0.088
rises-0.088
Token getting
Token ID1972
Feature activation+6.92e-02

Pos logit contributions
ve+0.194
ged+0.136
rie+0.131
 spine+0.128
makers+0.128
Neg logit contributions
 cal-0.174
 pic-0.164
 lets-0.164
 swe-0.163
 seasons-0.160
Token cal
Token ID2386
Feature activation-3.64e-01

Pos logit contributions
ve+0.091
ged+0.069
ging+0.067
ula+0.061
 missing+0.060
Neg logit contributions
 pic-0.106
 seasons-0.102
 nods-0.101
adoes-0.099
 lets-0.098
Tokenmer
Token ID647
Feature activation+2.67e-01

Pos logit contributions
 cal+0.841
 alright+0.814
 swe+0.811
 ok+0.800
 shall+0.788
Neg logit contributions
ve-0.276
rie-0.162
iana-0.123
makers-0.117
ged-0.113
Token and
Token ID290
Feature activation-2.01e-02

Pos logit contributions
ve+0.459
rie+0.367
iana+0.352
makers+0.345
ged+0.342
Neg logit contributions
 cal-0.359
 swe-0.331
 alright-0.329
 lets-0.314
 ok-0.314
Token cal
Token ID2386
Feature activation-3.63e-01

Pos logit contributions
birds+0.025
adoes+0.023
 pic+0.023
rises+0.022
 seasons+0.022
Neg logit contributions
ve-0.029
 centre-0.022
 for-0.022
ged-0.021
 center-0.021
Tokenmer
Token ID647
Feature activation+2.68e-01

Pos logit contributions
 cal+0.841
 swe+0.803
 alright+0.803
 ok+0.783
 lets+0.780
Neg logit contributions
ve-0.277
rie-0.152
iana-0.130
makers-0.112
ged-0.110
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
ve+0.483
rie+0.392
iana+0.376
makers+0.360
ged+0.360
Neg logit contributions
 cal-0.338
 alright-0.311
 swe-0.310
 ok-0.297
 lets-0.293
Token She
Token ID1375
Feature activation-2.64e-02

Pos logit contributions
 swe+0.034
 cal+0.033
 alright+0.031
 calmed+0.030
 lets+0.029
Neg logit contributions
ve-0.027
rie-0.020
iana-0.020
 centre-0.018
makers-0.018
Token kept
Token ID4030
Feature activation-7.66e-02

Pos logit contributions
 hurricanes+0.039
 alright+0.039
 swe+0.039
 flour+0.038
 cal+0.038
Neg logit contributions
ve-0.038
rie-0.028
ged-0.025
iana-0.025
da-0.025
Token taking
Token ID2263
Feature activation+4.77e-02

Pos logit contributions
 cal+0.113
 seasons+0.113
 shall+0.110
 swe+0.109
igators+0.108
Neg logit contributions
ve-0.113
ged-0.091
ging-0.087
iana-0.074
ga-0.071
Token happier
Token ID23030
Feature activation-1.18e-01

Pos logit contributions
ve+0.107
rie+0.085
iana+0.081
makers+0.079
ged+0.078
Neg logit contributions
 cal-0.089
 swe-0.085
 alright-0.083
 lets-0.080
 ok-0.079
Token and
Token ID290
Feature activation-2.52e-02

Pos logit contributions
 cal+0.157
 alright+0.146
 swe+0.141
 ok+0.139
 calmed+0.137
Neg logit contributions
ve-0.210
rie-0.174
iana-0.163
eg-0.157
makers-0.156
Token the
Token ID262
Feature activation-2.72e-02

Pos logit contributions
 cal+0.031
 swe+0.030
 alright+0.030
 flour+0.029
 seasons+0.029
Neg logit contributions
ve-0.041
rie-0.034
iana-0.032
makers-0.031
ged-0.031
Token house
Token ID2156
Feature activation-1.05e-02

Pos logit contributions
 cal+0.038
 alright+0.035
 swe+0.035
 lets+0.035
 calmed+0.034
Neg logit contributions
ve-0.044
rie-0.035
iana-0.034
makers-0.033
sel-0.030
Token felt
Token ID2936
Feature activation+6.63e-02

Pos logit contributions
 cal+0.014
 swe+0.014
 alright+0.014
 ok+0.013
 pic+0.013
Neg logit contributions
ve-0.017
rie-0.013
iana-0.013
makers-0.012
ged-0.012
Token cal
Token ID2386
Feature activation-3.63e-01

Pos logit contributions
ve+0.110
rie+0.087
iana+0.083
makers+0.083
ged+0.080
Neg logit contributions
 cal-0.084
 swe-0.081
 lets-0.078
 alright-0.077
 seasons-0.077
Tokenmer
Token ID647
Feature activation+2.65e-01

Pos logit contributions
 cal+0.809
 alright+0.757
 swe+0.754
 calmed+0.742
 lets+0.733
Neg logit contributions
ve-0.303
iana-0.177
rie-0.172
eg-0.142
ged-0.135
Token.
Token ID13
Feature activation-2.63e-02

Pos logit contributions
ve+0.486
rie+0.398
iana+0.383
ged+0.361
eg+0.358
Neg logit contributions
 cal-0.326
 alright-0.303
 swe-0.295
 ok-0.291
 lets-0.280
Token Life
Token ID5155
Feature activation-8.07e-02

Pos logit contributions
 cal+0.035
 alright+0.032
 swe+0.032
 ok+0.030
 lets+0.030
Neg logit contributions
ve-0.046
rie-0.037
iana-0.036
ged-0.034
makers-0.034
Token was
Token ID373
Feature activation-3.30e-03

Pos logit contributions
 cal+0.107
 swe+0.100
 alright+0.099
 ok+0.095
 lets+0.094
Neg logit contributions
ve-0.140
rie-0.112
iana-0.107
makers-0.104
ged-0.103
Token so
Token ID523
Feature activation+3.09e-02

Pos logit contributions
 cal+0.004
 swe+0.004
 lets+0.004
 nods+0.004
 wipes+0.004
Neg logit contributions
ve-0.005
rie-0.004
makers-0.004
iana-0.004
ged-0.004
Token
Token ID198
Feature activation+4.72e-03

Pos logit contributions
ve+0.009
rie+0.007
iana+0.007
makers+0.007
ged+0.007
Neg logit contributions
 cal-0.006
 alright-0.006
 swe-0.006
 ok-0.005
 lets-0.005
TokenDad
Token ID46270
Feature activation-2.13e-02

Pos logit contributions
ve+0.008
rie+0.007
iana+0.007
makers+0.006
ged+0.006
Neg logit contributions
 cal-0.006
 alright-0.006
 swe-0.006
 ok-0.005
 lets-0.005
Token sigh
Token ID19680
Feature activation-9.32e-02

Pos logit contributions
 alright+0.026
 ok+0.026
 pic+0.025
 swe+0.025
 okay+0.024
Neg logit contributions
ve-0.038
rie-0.032
iana-0.031
ged-0.028
sel-0.028
Tokens
Token ID82
Feature activation+1.95e-01

Pos logit contributions
 cal+0.167
 swe+0.158
 alright+0.156
 lets+0.151
 ok+0.150
Neg logit contributions
ve-0.120
rie-0.088
iana-0.084
makers-0.078
ged-0.077
Token and
Token ID290
Feature activation-1.09e-02

Pos logit contributions
ve+0.278
ged+0.208
makers+0.196
rie+0.196
ging+0.194
Neg logit contributions
 pic-0.315
 cal-0.300
 shall-0.294
 lets-0.291
 seasons-0.289
Token cal
Token ID2386
Feature activation-3.62e-01

Pos logit contributions
 alright+0.014
 ok+0.014
adows+0.013
rises+0.013
 seasons+0.013
Neg logit contributions
ve-0.016
rie-0.014
iana-0.013
ged-0.012
 cash-0.012
Tokenms
Token ID907
Feature activation+2.20e-02

Pos logit contributions
 cal+0.661
 swe+0.613
 alright+0.605
 calmed+0.603
 lets+0.597
Neg logit contributions
ve-0.453
rie-0.322
iana-0.313
makers-0.280
ged-0.277
Token down
Token ID866
Feature activation-4.42e-02

Pos logit contributions
ve+0.042
rie+0.034
iana+0.033
makers+0.032
ged+0.031
Neg logit contributions
 cal-0.026
 alright-0.024
 swe-0.023
 lets-0.022
 ok-0.022
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
 cal+0.055
 swe+0.050
 alright+0.050
 ok+0.048
 lets+0.047
Neg logit contributions
ve-0.082
rie-0.066
iana-0.064
makers-0.061
ged-0.061
Token He
Token ID679
Feature activation+9.37e-03

Pos logit contributions
 cal+0.019
 swe+0.019
 alright+0.018
 ok+0.017
 shall+0.017
Neg logit contributions
ve-0.017
rie-0.014
iana-0.013
oys-0.012
ged-0.012
Token sees
Token ID7224
Feature activation+1.36e-01

Pos logit contributions
ve+0.016
rie+0.013
iana+0.012
ged+0.011
sel+0.011
Neg logit contributions
 alright-0.012
 ok-0.012
 swe-0.011
 cal-0.011
 hurricanes-0.011
Token down
Token ID866
Feature activation-3.82e-02

Pos logit contributions
 cal+0.309
 alright+0.296
 ok+0.290
 lets+0.287
 shall+0.282
Neg logit contributions
ve-0.335
rie-0.268
ged-0.265
iana-0.250
ging-0.244
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
 cal+0.048
 alright+0.044
 swe+0.044
 ok+0.042
 lets+0.042
Neg logit contributions
ve-0.069
rie-0.056
iana-0.054
ged-0.052
makers-0.052
Token She
Token ID1375
Feature activation-2.52e-02

Pos logit contributions
 cal+0.022
 swe+0.021
 alright+0.021
 ok+0.019
 shall+0.019
Neg logit contributions
ve-0.016
rie-0.013
iana-0.012
sed-0.012
makers-0.011
Token felt
Token ID2936
Feature activation+6.83e-02

Pos logit contributions
 cal+0.038
 alright+0.037
 swe+0.036
 ok+0.036
 hurricanes+0.034
Neg logit contributions
ve-0.038
rie-0.029
ged-0.027
iana-0.026
da-0.024
Token much
Token ID881
Feature activation-4.05e-02

Pos logit contributions
ve+0.107
rie+0.082
ged+0.079
iana+0.079
makers+0.078
Neg logit contributions
 cal-0.092
 swe-0.091
 alright-0.088
 lets-0.087
 pic-0.085
Token cal
Token ID2386
Feature activation-3.62e-01

Pos logit contributions
adoes+0.050
 hurricanes+0.050
birds+0.050
 seasons+0.049
 pic+0.048
Neg logit contributions
ve-0.059
ged-0.045
ging-0.045
rie-0.045
 whole-0.043
Tokenmer
Token ID647
Feature activation+2.66e-01

Pos logit contributions
 cal+0.739
 calmed+0.674
 alright+0.673
 swe+0.673
 lets+0.657
Neg logit contributions
ve-0.376
iana-0.253
rie-0.235
eg-0.218
oys-0.210
Token now
Token ID783
Feature activation+7.13e-02

Pos logit contributions
ve+0.490
rie+0.400
iana+0.385
ged+0.363
makers+0.361
Neg logit contributions
 cal-0.327
 alright-0.302
 swe-0.297
 ok-0.289
 lets-0.282
Token,
Token ID11
Feature activation-1.99e-03

Pos logit contributions
ve+0.125
rie+0.100
iana+0.096
makers+0.093
ged+0.092
Neg logit contributions
 cal-0.094
 swe-0.087
 alright-0.087
 ok-0.083
 lets-0.082
Token knowing
Token ID6970
Feature activation+2.30e-01

Pos logit contributions
 cal+0.003
 swe+0.003
 alright+0.003
 seasons+0.003
 cleans+0.003
Neg logit contributions
ve-0.003
rie-0.002
ged-0.002
iana-0.002
makers-0.002
Token that
Token ID326
Feature activation+1.07e-01

Pos logit contributions
ve+0.285
iana+0.206
ged+0.199
makers+0.190
rie+0.189
Neg logit contributions
 cal-0.387
 swe-0.368
 alright-0.357
 shall-0.353
 nods-0.352
Token see
Token ID766
Feature activation+3.97e-02

Pos logit contributions
 cal+0.022
 swe+0.021
 alright+0.021
 calmed+0.021
 seasons+0.020
Neg logit contributions
ve-0.018
makers-0.013
rie-0.013
 spine-0.012
ucer-0.012
Token the
Token ID262
Feature activation-2.78e-02

Pos logit contributions
ve+0.040
 for+0.035
 imaginary+0.027
makers+0.026
ged+0.025
Neg logit contributions
 calmed-0.068
 cal-0.062
 slows-0.061
 nods-0.061
brush-0.060
Token sun
Token ID4252
Feature activation-1.33e-01

Pos logit contributions
 calmed+0.034
 cal+0.033
 cleans+0.032
 nods+0.032
 lets+0.032
Neg logit contributions
ve-0.045
 whole-0.034
makers-0.033
rie-0.033
iana-0.033
Token together
Token ID1978
Feature activation-2.83e-02

Pos logit contributions
 cal+0.209
 alright+0.194
 ok+0.192
 calmed+0.190
 wipes+0.184
Neg logit contributions
ve-0.193
rie-0.133
makers-0.131
ucer-0.127
 whole-0.125
Token,
Token ID11
Feature activation-9.43e-03

Pos logit contributions
 cal+0.045
 calmed+0.042
 swe+0.041
 cleans+0.041
 alright+0.041
Neg logit contributions
ve-0.040
makers-0.027
rie-0.027
sel-0.026
ged-0.026
Token alright
Token ID23036
Feature activation-3.62e-01

Pos logit contributions
 cal+0.014
 calmed+0.013
lim+0.013
 cleans+0.013
 swe+0.013
Neg logit contributions
ve-0.011
 for-0.008
makers-0.008
 spine-0.008
rie-0.008
Token?"
Token ID1701
Feature activation-1.38e-01

Pos logit contributions
 cal+0.624
 calmed+0.594
 cleans+0.576
 swe+0.562
 pic+0.556
Neg logit contributions
ve-0.436
makers-0.334
oys-0.301
 for-0.284
rie-0.278
Token
Token ID198
Feature activation-2.55e-03

Pos logit contributions
 cal+0.199
 alright+0.188
 swe+0.187
 ok+0.179
 lets+0.173
Neg logit contributions
ve-0.221
rie-0.174
iana-0.169
ged-0.163
makers-0.160
Token
Token ID198
Feature activation-3.38e-03

Pos logit contributions
 cal+0.004
 alright+0.004
 swe+0.004
 ok+0.004
 lets+0.003
Neg logit contributions
ve-0.004
rie-0.003
iana-0.003
makers-0.003
ged-0.003
TokenSo
Token ID2396
Feature activation-1.76e-01

Pos logit contributions
 cal+0.005
 swe+0.005
 alright+0.005
 ok+0.004
 lets+0.004
Neg logit contributions
ve-0.006
rie-0.004
iana-0.004
makers-0.004
ged-0.004
Token,
Token ID11
Feature activation-2.01e-02

Pos logit contributions
 cal+0.211
 swe+0.194
 alright+0.194
 ok+0.186
 lets+0.184
Neg logit contributions
ve-0.332
rie-0.270
iana-0.258
makers-0.252
ged-0.252
Token cookie
Token ID19751
Feature activation-1.29e-01

Pos logit contributions
ve+0.047
rie+0.036
iana+0.035
ucer+0.035
eg+0.034
Neg logit contributions
 cal-0.042
 alright-0.041
 ok-0.039
 hurricanes-0.039
 seasons-0.038
Token.
Token ID13
Feature activation-2.63e-03

Pos logit contributions
 cal+0.156
 swe+0.143
 alright+0.140
 lets+0.134
 ok+0.134
Neg logit contributions
ve-0.243
rie-0.199
iana-0.191
makers-0.183
ged-0.182
Token
Token ID198
Feature activation+1.36e-02

Pos logit contributions
 cal+0.004
 alright+0.004
 swe+0.004
 ok+0.004
adows+0.004
Neg logit contributions
ve-0.004
rie-0.003
iana-0.003
makers-0.003
ula-0.003
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
ve+0.028
rie+0.024
iana+0.023
sel+0.023
chie+0.022
Neg logit contributions
 cal-0.014
 lets-0.012
 swe-0.012
 calmed-0.011
 alright-0.011
TokenAnna
Token ID31160
Feature activation-5.57e-03

Pos logit contributions
ve+0.028
rie+0.024
iana+0.024
sel+0.022
ged+0.022
Neg logit contributions
 cal-0.015
 swe-0.013
 lets-0.013
 alright-0.012
 cleans-0.012
Token cal
Token ID2386
Feature activation-3.62e-01

Pos logit contributions
 alright+0.008
 ok+0.008
 hurricanes+0.008
 swe+0.008
adows+0.008
Neg logit contributions
ve-0.008
rie-0.006
iana-0.006
eg-0.006
 ante-0.006
Tokenms
Token ID907
Feature activation+9.92e-03

Pos logit contributions
 cal+0.465
 calmed+0.409
 swe+0.402
 lets+0.400
 alright+0.390
Neg logit contributions
ve-0.652
rie-0.520
iana-0.516
ged-0.471
makers-0.471
Token down
Token ID866
Feature activation-4.74e-02

Pos logit contributions
ve+0.018
rie+0.014
iana+0.014
ged+0.014
makers+0.013
Neg logit contributions
 cal-0.013
 alright-0.012
 swe-0.012
 ok-0.011
 pic-0.011
Token and
Token ID290
Feature activation-8.50e-04

Pos logit contributions
 cal+0.064
 alright+0.063
 swe+0.061
 ok+0.060
 shall+0.060
Neg logit contributions
ve-0.078
rie-0.062
iana-0.058
ged-0.058
makers-0.057
Token forg
Token ID5784
Feature activation-4.21e-02

Pos logit contributions
 alright+0.001
 ok+0.001
adows+0.001
 seasons+0.001
 shall+0.001
Neg logit contributions
ve-0.001
 cash-0.001
rie-0.001
ged-0.001
iana-0.001
Tokenives
Token ID1083
Feature activation+4.77e-02

Pos logit contributions
 cal+0.067
 swe+0.061
 alright+0.061
 lets+0.060
 calmed+0.059
Neg logit contributions
ve-0.065
iana-0.049
rie-0.048
makers-0.045
ged-0.045
Token.
Token ID13
Feature activation-1.33e-03

Pos logit contributions
ve+0.139
rie+0.112
iana+0.108
ged+0.104
makers+0.103
Neg logit contributions
 cal-0.096
 swe-0.088
 alright-0.087
 ok-0.083
 lets-0.083
Token She
Token ID1375
Feature activation-1.96e-02

Pos logit contributions
 cal+0.002
 alright+0.002
 swe+0.002
 ok+0.002
 shall+0.002
Neg logit contributions
ve-0.002
rie-0.002
iana-0.001
ged-0.001
makers-0.001
Token stops
Token ID9911
Feature activation-1.36e-01

Pos logit contributions
 alright+0.025
 ok+0.024
 cal+0.024
 swe+0.023
 okay+0.022
Neg logit contributions
ve-0.033
rie-0.028
iana-0.026
eg-0.024
ged-0.024
Token them
Token ID606
Feature activation+2.06e-02

Pos logit contributions
 cal+0.184
 swe+0.169
 alright+0.169
 ok+0.161
 lets+0.160
Neg logit contributions
ve-0.237
rie-0.191
iana-0.183
makers-0.174
ged-0.172
Token and
Token ID290
Feature activation+1.18e-03

Pos logit contributions
ve+0.030
rie+0.023
ged+0.022
makers+0.021
iana+0.020
Neg logit contributions
 cal-0.032
 swe-0.031
 alright-0.031
 ok-0.030
 seasons-0.029
Token cal
Token ID2386
Feature activation-3.60e-01

Pos logit contributions
ve+0.002
rie+0.002
iana+0.002
ged+0.002
 ante+0.001
Neg logit contributions
 alright-0.002
 ok-0.001
 cal-0.001
 swe-0.001
 okay-0.001
Tokenms
Token ID907
Feature activation+1.05e-02

Pos logit contributions
 cal+0.544
 swe+0.495
 calmed+0.485
 alright+0.483
 lets+0.481
Neg logit contributions
ve-0.562
rie-0.433
iana-0.428
makers-0.387
ged-0.384
Token them
Token ID606
Feature activation+1.89e-02

Pos logit contributions
ve+0.020
rie+0.016
iana+0.015
makers+0.015
ged+0.015
Neg logit contributions
 cal-0.013
 swe-0.012
 alright-0.012
 ok-0.011
 lets-0.011
Token down
Token ID866
Feature activation-4.67e-02

Pos logit contributions
ve+0.027
ged+0.020
makers+0.020
rie+0.020
ging+0.020
Neg logit contributions
 cal-0.029
 alright-0.028
 pic-0.028
 ok-0.027
 shall-0.027
Token.
Token ID13
Feature activation-3.29e-03

Pos logit contributions
 cal+0.066
 alright+0.062
 swe+0.061
 ok+0.059
 lets+0.059
Neg logit contributions
ve-0.077
rie-0.060
makers-0.057
iana-0.056
ged-0.056
Token She
Token ID1375
Feature activation-2.11e-02

Pos logit contributions
 cal+0.005
 alright+0.005
 swe+0.005
 ok+0.004
 shall+0.004
Neg logit contributions
ve-0.005
rie-0.004
iana-0.004
ged-0.004
makers-0.004
Token.
Token ID13
Feature activation-3.77e-03

Pos logit contributions
ve+0.023
ged+0.018
sel+0.016
ging+0.015
ula+0.015
Neg logit contributions
 pic-0.036
 cal-0.036
 hurricanes-0.033
 seasons-0.033
 alright-0.033
Token
Token ID198
Feature activation+1.53e-02

Pos logit contributions
 cal+0.006
 alright+0.006
 swe+0.006
 ok+0.005
 seasons+0.005
Neg logit contributions
ve-0.005
rie-0.004
ged-0.004
iana-0.004
eg-0.004
Token
Token ID198
Feature activation+1.55e-02

Pos logit contributions
ve+0.031
rie+0.027
iana+0.025
sel+0.024
makers+0.024
Neg logit contributions
 cal-0.016
 lets-0.014
 alright-0.014
 calmed-0.014
 swe-0.014
TokenThe
Token ID464
Feature activation-6.17e-03

Pos logit contributions
ve+0.030
rie+0.025
iana+0.025
makers+0.023
ged+0.023
Neg logit contributions
 cal-0.018
 lets-0.016
 alright-0.015
 calmed-0.015
 swe-0.015
Token kitten
Token ID42143
Feature activation-9.06e-03

Pos logit contributions
 cal+0.008
 swe+0.007
 alright+0.007
 ok+0.007
 lets+0.007
Neg logit contributions
ve-0.011
rie-0.009
iana-0.009
ged-0.008
makers-0.008
Token cal
Token ID2386
Feature activation-3.59e-01

Pos logit contributions
 alright+0.012
 ok+0.012
 hurricanes+0.011
adows+0.011
rises+0.011
Neg logit contributions
ve-0.014
sed-0.011
rie-0.010
iana-0.010
ged-0.009
Tokenms
Token ID907
Feature activation+1.55e-02

Pos logit contributions
 cal+0.488
 calmed+0.441
 lets+0.429
 nods+0.420
 swe+0.417
Neg logit contributions
ve-0.612
iana-0.492
rie-0.482
ula-0.453
eg-0.440
Token down
Token ID866
Feature activation-4.07e-02

Pos logit contributions
ve+0.029
rie+0.024
iana+0.023
ged+0.022
makers+0.022
Neg logit contributions
 cal-0.018
 swe-0.017
 alright-0.017
 ok-0.016
 lets-0.016
Token and
Token ID290
Feature activation-1.77e-03

Pos logit contributions
 cal+0.054
 alright+0.051
 swe+0.051
 ok+0.049
 shall+0.048
Neg logit contributions
ve-0.070
rie-0.056
iana-0.053
ged-0.053
makers-0.052
Token pur
Token ID1308
Feature activation-1.90e-01

Pos logit contributions
 alright+0.002
 ok+0.002
adows+0.002
 shall+0.002
 okay+0.002
Neg logit contributions
ve-0.003
ged-0.002
sed-0.002
rie-0.002
 cash-0.002
Tokenrs
Token ID3808
Feature activation+6.61e-02

Pos logit contributions
 cal+0.275
 swe+0.243
 calmed+0.239
 alright+0.238
 lets+0.236
Neg logit contributions
ve-0.322
iana-0.252
rie-0.249
makers-0.223
ucer-0.223
Token rh
Token ID9529
Feature activation+1.86e-01

Pos logit contributions
ve+0.059
rie+0.050
iana+0.049
ged+0.048
ula+0.047
Neg logit contributions
 cal-0.051
 swe-0.045
 flour-0.044
 snow-0.044
 lets-0.043
Tokenino
Token ID2879
Feature activation+2.29e-01

Pos logit contributions
ve+0.239
iana+0.210
rie+0.202
 spine+0.185
ula+0.181
Neg logit contributions
 cal-0.301
 lets-0.276
 cleans-0.272
 shall-0.266
 nods-0.265
Token eats
Token ID25365
Feature activation-3.61e-02

Pos logit contributions
ve+0.390
rie+0.296
sel+0.280
ged+0.276
iana+0.276
Neg logit contributions
 alright-0.303
 ok-0.288
 cal-0.286
 swe-0.282
 hurricanes-0.273
Token it
Token ID340
Feature activation-3.29e-02

Pos logit contributions
 cal+0.048
 alright+0.046
 ok+0.044
 swe+0.044
 lets+0.042
Neg logit contributions
ve-0.060
rie-0.047
ged-0.045
iana-0.045
makers-0.044
Token and
Token ID290
Feature activation+3.26e-03

Pos logit contributions
 cal+0.048
 alright+0.047
 ok+0.045
 hurricanes+0.044
 swe+0.044
Neg logit contributions
ve-0.049
rie-0.036
ged-0.035
makers-0.034
sel-0.033
Token cal
Token ID2386
Feature activation-3.59e-01

Pos logit contributions
ve+0.006
rie+0.005
iana+0.005
ged+0.005
makers+0.005
Neg logit contributions
 alright-0.004
 cal-0.004
 ok-0.003
 swe-0.003
 shall-0.003
Tokenms
Token ID907
Feature activation+1.52e-02

Pos logit contributions
 cal+0.595
 calmed+0.534
 lets+0.530
 swe+0.529
 alright+0.513
Neg logit contributions
ve-0.515
iana-0.399
rie-0.390
ula-0.345
oys-0.335
Token down
Token ID866
Feature activation-4.22e-02

Pos logit contributions
ve+0.028
rie+0.022
iana+0.022
ged+0.021
makers+0.021
Neg logit contributions
 cal-0.019
 swe-0.017
 alright-0.017
 ok-0.016
 lets-0.016
Token.
Token ID13
Feature activation+8.74e-05

Pos logit contributions
 cal+0.057
 alright+0.054
 swe+0.053
 ok+0.052
 shall+0.050
Neg logit contributions
ve-0.071
rie-0.057
makers-0.053
iana-0.053
ged-0.053
Token
Token ID198
Feature activation+1.69e-02

Pos logit contributions
ve+0.000
rie+0.000
iana+0.000
eg+0.000
makers+0.000
Neg logit contributions
 cal-0.000
 swe-0.000
 alright-0.000
 ok-0.000
 hurricanes-0.000
Token
Token ID198
Feature activation+1.68e-02

Pos logit contributions
ve+0.033
rie+0.027
iana+0.026
ged+0.025
makers+0.025
Neg logit contributions
 cal-0.020
 alright-0.017
 swe-0.017
 lets-0.017
 calmed-0.016
Token,
Token ID11
Feature activation-1.27e-02

Pos logit contributions
 cal+0.246
 swe+0.220
 calmed+0.213
 alright+0.210
 hurricanes+0.206
Neg logit contributions
ve-0.231
iana-0.168
rie-0.156
makers-0.152
ops-0.150
Token there
Token ID612
Feature activation-2.05e-01

Pos logit contributions
 cal+0.017
 swe+0.016
 alright+0.015
 calmed+0.015
 ok+0.015
Neg logit contributions
ve-0.021
rie-0.017
iana-0.016
makers-0.016
ged-0.015
Token.
Token ID13
Feature activation-2.83e-02

Pos logit contributions
 cal+0.307
 swe+0.284
 alright+0.272
 calmed+0.269
 ok+0.261
Neg logit contributions
ve-0.316
iana-0.237
rie-0.232
makers-0.221
sel-0.213
Token Everything
Token ID11391
Feature activation-1.52e-01

Pos logit contributions
 cal+0.044
 swe+0.041
 alright+0.040
 calmed+0.039
 ok+0.037
Neg logit contributions
ve-0.041
rie-0.034
iana-0.031
makers-0.031
oys-0.030
Token is
Token ID318
Feature activation-2.50e-02

Pos logit contributions
 cal+0.217
 alright+0.204
 swe+0.202
 ok+0.199
 calmed+0.189
Neg logit contributions
ve-0.239
rie-0.186
makers-0.175
iana-0.174
ged-0.165
Token alright
Token ID23036
Feature activation-3.58e-01

Pos logit contributions
 cal+0.035
 swe+0.032
 lets+0.031
 alright+0.031
 nods+0.031
Neg logit contributions
ve-0.041
rie-0.031
makers-0.030
iana-0.030
ged-0.028
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
 cal+0.543
 swe+0.490
 calmed+0.478
 cleans+0.472
 lets+0.470
Neg logit contributions
ve-0.560
rie-0.418
makers-0.412
iana-0.408
oys-0.402
Token If
Token ID1002
Feature activation-7.75e-02

Pos logit contributions
 cal+0.033
 swe+0.032
 calmed+0.030
 alright+0.030
 cleans+0.029
Neg logit contributions
ve-0.030
rie-0.024
makers-0.022
iana-0.021
oys-0.021
Token you
Token ID345
Feature activation+2.21e-03

Pos logit contributions
 cal+0.110
 swe+0.102
 alright+0.102
 ok+0.098
 lets+0.097
Neg logit contributions
ve-0.128
rie-0.102
iana-0.097
makers-0.093
ged-0.093
Token go
Token ID467
Feature activation-1.48e-01

Pos logit contributions
ve+0.004
rie+0.003
iana+0.003
makers+0.002
sel+0.002
Neg logit contributions
 cal-0.003
 swe-0.003
 alright-0.003
 flour-0.003
 hurricanes-0.003
Token back
Token ID736
Feature activation+3.78e-02

Pos logit contributions
 cal+0.198
 swe+0.182
 alright+0.182
 ok+0.174
 lets+0.173
Neg logit contributions
ve-0.257
rie-0.205
iana-0.197
makers-0.190
ged-0.189
Token made
Token ID925
Feature activation+3.57e-02

Pos logit contributions
 cal+0.058
 alright+0.054
 swe+0.054
 ok+0.052
 lets+0.051
Neg logit contributions
ve-0.071
rie-0.056
iana-0.053
ged-0.051
makers-0.051
Token the
Token ID262
Feature activation-2.03e-02

Pos logit contributions
ve+0.067
rie+0.055
iana+0.054
ged+0.051
makers+0.051
Neg logit contributions
 cal-0.044
 alright-0.040
 swe-0.040
 ok-0.038
 calmed-0.037
Token little
Token ID1310
Feature activation+3.62e-02

Pos logit contributions
 cal+0.023
 alright+0.020
 swe+0.020
 shall+0.020
 snow+0.020
Neg logit contributions
ve-0.040
ged-0.033
rie-0.032
iana-0.032
makers-0.031
Token monkey
Token ID21657
Feature activation+7.30e-02

Pos logit contributions
ve+0.077
iana+0.061
rie+0.059
ged+0.059
ula+0.059
Neg logit contributions
 cal-0.039
 swe-0.037
 snow-0.034
 nods-0.033
 seasons-0.032
Token feel
Token ID1254
Feature activation+5.96e-02

Pos logit contributions
ve+0.126
rie+0.098
iana+0.093
makers+0.091
ged+0.091
Neg logit contributions
 cal-0.098
 swe-0.091
 alright-0.091
 lets-0.087
 ok-0.087
Token cal
Token ID2386
Feature activation-3.58e-01

Pos logit contributions
ve+0.084
ged+0.063
makers+0.060
ging+0.058
rie+0.057
Neg logit contributions
 swe-0.084
 pic-0.082
 seasons-0.082
rises-0.082
 flour-0.081
Tokenmer
Token ID647
Feature activation+2.68e-01

Pos logit contributions
 cal+0.737
 calmed+0.670
 alright+0.666
 swe+0.659
 lets+0.649
Neg logit contributions
ve-0.361
iana-0.255
rie-0.226
eg-0.206
oys-0.196
Token and
Token ID290
Feature activation-1.26e-02

Pos logit contributions
ve+0.514
rie+0.429
iana+0.416
eg+0.387
ucer+0.385
Neg logit contributions
 cal-0.305
 alright-0.281
 ok-0.271
 swe-0.271
 calmed-0.259
Token sensing
Token ID34244
Feature activation+2.84e-02

Pos logit contributions
 alright+0.017
 flour+0.017
 pic+0.016
rises+0.016
 swe+0.016
Neg logit contributions
ve-0.017
ged-0.013
rie-0.013
makers-0.012
 ante-0.012
Token a
Token ID257
Feature activation+3.07e-02

Pos logit contributions
ve+0.048
rie+0.039
iana+0.037
makers+0.036
ged+0.036
Neg logit contributions
 cal-0.038
 alright-0.036
 swe-0.036
 ok-0.034
 lets-0.034
Token warm
Token ID5814
Feature activation-1.43e-01

Pos logit contributions
ve+0.053
rie+0.045
iana+0.043
ged+0.041
makers+0.041
Neg logit contributions
 cal-0.042
 swe-0.038
 alright-0.038
 ok-0.036
 calmed-0.036
Token was
Token ID373
Feature activation+4.63e-03

Pos logit contributions
 cal+0.009
 swe+0.008
 alright+0.008
 lets+0.008
 ok+0.008
Neg logit contributions
ve-0.012
rie-0.009
iana-0.009
makers-0.009
ged-0.009
Token finished
Token ID5201
Feature activation+9.35e-03

Pos logit contributions
ve+0.007
rie+0.005
ged+0.004
iana+0.004
makers+0.004
Neg logit contributions
 cal-0.007
 swe-0.007
 seasons-0.007
 pic-0.007
 lets-0.007
Token she
Token ID673
Feature activation-6.19e-03

Pos logit contributions
ve+0.015
iana+0.011
ged+0.011
rie+0.011
makers+0.010
Neg logit contributions
 cal-0.014
 alright-0.013
 swe-0.013
 lets-0.013
 ok-0.012
Token felt
Token ID2936
Feature activation+7.03e-02

Pos logit contributions
 cal+0.009
 swe+0.008
 alright+0.008
 ok+0.008
 hurricanes+0.008
Neg logit contributions
ve-0.010
rie-0.008
iana-0.007
ged-0.007
sel-0.007
Token much
Token ID881
Feature activation-3.82e-02

Pos logit contributions
ve+0.113
rie+0.084
ged+0.084
iana+0.083
makers+0.080
Neg logit contributions
 cal-0.091
 swe-0.090
 seasons-0.089
 pic-0.089
 lets-0.087
Token cal
Token ID2386
Feature activation-3.58e-01

Pos logit contributions
 hurricanes+0.049
 seasons+0.049
adoes+0.047
birds+0.047
igators+0.047
Neg logit contributions
ve-0.058
ged-0.044
ging-0.044
rie-0.043
sel-0.042
Tokenmer
Token ID647
Feature activation+2.71e-01

Pos logit contributions
 cal+0.783
 alright+0.728
 swe+0.727
 calmed+0.714
 lets+0.710
Neg logit contributions
ve-0.319
iana-0.188
rie-0.182
ged-0.152
eg-0.149
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
ve+0.487
rie+0.395
iana+0.380
ged+0.360
makers+0.359
Neg logit contributions
 cal-0.343
 alright-0.316
 swe-0.313
 ok-0.303
 lets-0.298
Token The
Token ID383
Feature activation+2.93e-02

Pos logit contributions
 cal+0.020
 alright+0.019
 swe+0.019
 ok+0.018
 calmed+0.018
Neg logit contributions
ve-0.017
rie-0.014
iana-0.014
makers-0.012
ged-0.012
Token girl
Token ID2576
Feature activation+9.61e-02

Pos logit contributions
ve+0.047
rie+0.036
iana+0.036
makers+0.033
ged+0.033
Neg logit contributions
 cal-0.044
 alright-0.040
 swe-0.040
 lets-0.039
 ok-0.039
Token was
Token ID373
Feature activation+5.09e-03

Pos logit contributions
ve+0.146
rie+0.107
ged+0.100
iana+0.099
sel+0.091
Neg logit contributions
 cal-0.149
 alright-0.147
 swe-0.145
 ok-0.141
 hurricanes-0.140
Token tissue
Token ID10712
Feature activation-8.62e-02

Pos logit contributions
ve+0.053
rie+0.044
iana+0.043
makers+0.043
 ante+0.040
Neg logit contributions
 cal-0.034
 swe-0.032
 alright-0.030
 shall-0.030
 flour-0.030
Token and
Token ID290
Feature activation-1.85e-02

Pos logit contributions
 cal+0.101
 swe+0.090
 alright+0.089
 calmed+0.086
 ok+0.085
Neg logit contributions
ve-0.168
rie-0.137
iana-0.132
makers-0.127
ged-0.125
Token rubbed
Token ID31862
Feature activation-1.63e-01

Pos logit contributions
 cal+0.025
 alright+0.024
 seasons+0.023
 ok+0.023
 swe+0.022
Neg logit contributions
ve-0.030
rie-0.024
ged-0.023
iana-0.023
sel-0.022
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
 cal+0.185
 swe+0.170
 alright+0.168
 ok+0.160
 lets+0.157
Neg logit contributions
ve-0.321
rie-0.264
iana-0.254
makers-0.247
ged-0.240
Token scrib
Token ID44661
Feature activation+1.94e-01

Pos logit contributions
 swe+0.023
 cal+0.023
 flour+0.022
 lets+0.021
 alright+0.021
Neg logit contributions
ve-0.037
makers-0.030
rie-0.029
lves-0.029
iana-0.029
Tokenbles
Token ID7689
Feature activation+2.63e-01

Pos logit contributions
ve+0.269
rie+0.207
iana+0.196
makers+0.190
ged+0.180
Neg logit contributions
 cal-0.317
 alright-0.303
 swe-0.295
 ok-0.294
 calmed-0.283
Token away
Token ID1497
Feature activation+1.07e-01

Pos logit contributions
ve+0.429
rie+0.331
ged+0.317
iana+0.313
sel+0.311
Neg logit contributions
 cal-0.370
 alright-0.350
 pic-0.336
 swe-0.335
 ok-0.334
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
ve+0.195
rie+0.157
iana+0.151
makers+0.145
ged+0.145
Neg logit contributions
 cal-0.137
 swe-0.126
 alright-0.125
 ok-0.119
 lets-0.117
Token
Token ID220
Feature activation-3.53e-02

Pos logit contributions
 cal+0.027
 alright+0.025
 swe+0.025
 ok+0.024
 lets+0.023
Neg logit contributions
ve-0.035
rie-0.028
iana-0.027
ged-0.026
makers-0.026
Token
Token ID198
Feature activation-3.50e-03

Pos logit contributions
 cal+0.047
 swe+0.043
 alright+0.043
 ok+0.041
 lets+0.041
Neg logit contributions
ve-0.062
rie-0.050
iana-0.048
makers-0.046
ged-0.046
Token
Token ID198
Feature activation-3.34e-03

Pos logit contributions
 cal+0.003
 alright+0.003
 swe+0.003
 shall+0.003
 lets+0.003
Neg logit contributions
ve-0.007
rie-0.006
iana-0.006
sel-0.006
sed-0.006
Token.
Token ID13
Feature activation-3.11e-03

Pos logit contributions
 cal+0.129
 alright+0.118
 pic+0.118
 swe+0.116
 hurricanes+0.113
Neg logit contributions
ve-0.129
rie-0.101
ged-0.098
iana-0.094
makers-0.091
Token He
Token ID679
Feature activation+1.65e-02

Pos logit contributions
 alright+0.006
 cal+0.006
 swe+0.006
 ok+0.006
rises+0.006
Neg logit contributions
ve-0.003
rie-0.002
iana-0.002
ula-0.002
eg-0.002
Token saw
Token ID2497
Feature activation+1.21e-01

Pos logit contributions
ve+0.029
rie+0.023
iana+0.022
makers+0.022
ged+0.021
Neg logit contributions
 cal-0.022
 swe-0.020
 alright-0.020
 calmed-0.020
 lets-0.020
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
ve+0.243
rie+0.191
iana+0.180
makers+0.178
chie+0.175
Neg logit contributions
 cal-0.149
 swe-0.137
 lets-0.130
 calmed-0.126
 alright-0.125
Token new
Token ID649
Feature activation-1.04e-02

Pos logit contributions
 swe+0.023
 cal+0.022
 flour+0.022
 butter+0.021
 rub+0.021
Neg logit contributions
ve-0.024
ged-0.020
rie-0.018
makers-0.018
ley-0.018
Token er
Token ID1931
Feature activation+2.79e-01

Pos logit contributions
 swe+0.014
 rub+0.013
 seasons+0.013
 flour+0.013
 cal+0.012
Neg logit contributions
ve-0.017
ged-0.014
oor-0.014
eg-0.013
ging-0.013
Tokenaser
Token ID6005
Feature activation+1.53e-01

Pos logit contributions
 spine+0.434
ve+0.429
sed+0.406
makers+0.380
Ar+0.378
Neg logit contributions
brush-0.282
 cal-0.277
birds-0.267
 shall-0.261
 butter-0.261
Token in
Token ID287
Feature activation-4.14e-02

Pos logit contributions
ve+0.253
rie+0.196
ged+0.189
iana+0.188
makers+0.185
Neg logit contributions
 cal-0.217
 alright-0.202
 swe-0.200
 ok-0.192
 calmed-0.187
Token her
Token ID607
Feature activation+4.43e-02

Pos logit contributions
 cal+0.037
 swe+0.032
 lets+0.031
 flour+0.030
 pic+0.029
Neg logit contributions
ve-0.101
rie-0.080
makers-0.077
iana-0.075
lves-0.075
Token hand
Token ID1021
Feature activation+3.38e-02

Pos logit contributions
ve+0.102
makers+0.087
ged+0.079
rie+0.078
ging+0.078
Neg logit contributions
 cal-0.036
 ways-0.036
 swe-0.036
 lets-0.035
 wipes-0.034
Token.
Token ID13
Feature activation-2.34e-03

Pos logit contributions
ve+0.054
rie+0.043
ged+0.041
iana+0.041
makers+0.039
Neg logit contributions
 cal-0.049
 alright-0.045
 swe-0.045
 ok-0.043
 pic-0.042
Token He
Token ID679
Feature activation+2.22e-02

Pos logit contributions
 cal+0.008
 swe+0.007
 alright+0.007
 lets+0.007
 ok+0.007
Neg logit contributions
ve-0.011
rie-0.008
iana-0.008
makers-0.008
ged-0.008
Token grabs
Token ID22378
Feature activation+1.07e-01

Pos logit contributions
ve+0.043
rie+0.035
iana+0.034
makers+0.033
ged+0.033
Neg logit contributions
 cal-0.025
 swe-0.023
 alright-0.023
 ok-0.022
 lets-0.021
Token the
Token ID262
Feature activation-5.85e-03

Pos logit contributions
ve+0.242
rie+0.195
iana+0.191
makers+0.190
ucer+0.186
Neg logit contributions
 cal-0.106
 swe-0.100
 lets-0.089
 flour-0.086
 alright-0.079
Token pe
Token ID613
Feature activation+2.79e-02

Pos logit contributions
 swe+0.005
 cal+0.005
 lets+0.005
 flour+0.004
 snow+0.004
Neg logit contributions
ve-0.012
rie-0.012
ilda-0.011
iana-0.011
makers-0.011
Tokenb
Token ID65
Feature activation+1.81e-01

Pos logit contributions
ve+0.035
rie+0.028
iana+0.026
makers+0.025
ged+0.023
Neg logit contributions
 cal-0.049
 swe-0.046
 alright-0.045
 lets-0.045
 ok-0.044
Tokenbles
Token ID7689
Feature activation+2.65e-01

Pos logit contributions
ve+0.270
rie+0.251
iana+0.219
makers+0.219
lves+0.213
Neg logit contributions
 cal-0.240
 shall-0.234
 lets-0.232
 alright-0.230
 ok-0.228
Token from
Token ID422
Feature activation+6.29e-02

Pos logit contributions
ve+0.475
rie+0.391
iana+0.371
makers+0.355
ged+0.352
Neg logit contributions
 cal-0.340
 swe-0.313
 alright-0.310
 ok-0.297
 lets-0.296
Token Anna
Token ID11735
Feature activation-1.39e-02

Pos logit contributions
ve+0.146
rie+0.117
ucer+0.113
iana+0.113
makers+0.112
Neg logit contributions
 cal-0.061
 swe-0.056
 lets-0.049
 thunder-0.046
 calmed-0.046
Token's
Token ID338
Feature activation-5.43e-02

Pos logit contributions
 cal+0.017
 alright+0.016
 swe+0.016
 ok+0.015
 lets+0.015
Neg logit contributions
ve-0.026
rie-0.021
iana-0.020
makers-0.019
ged-0.019
Token hand
Token ID1021
Feature activation+4.75e-02

Pos logit contributions
 cal+0.068
 swe+0.064
 alright+0.061
 lets+0.060
 ok+0.060
Neg logit contributions
ve-0.100
rie-0.082
iana-0.077
makers-0.076
ged-0.075
Token and
Token ID290
Feature activation-1.99e-03

Pos logit contributions
ve+0.077
rie+0.061
ged+0.058
iana+0.057
makers+0.056
Neg logit contributions
 cal-0.069
 alright-0.063
 swe-0.063
 ok-0.060
 pic-0.060
Token sh
Token ID427
Feature activation-1.27e-01

Pos logit contributions
ve+0.051
rie+0.044
iana+0.041
makers+0.040
ged+0.038
Neg logit contributions
 cal-0.037
 swe-0.034
 alright-0.034
 ok-0.032
 calmed-0.032
Tokeniver
Token ID1428
Feature activation-1.52e-01

Pos logit contributions
 cal+0.106
 alright+0.094
 ok+0.086
 calmed+0.085
 swe+0.085
Neg logit contributions
ve-0.278
rie-0.239
iana-0.233
ged-0.225
makers-0.224
Token go
Token ID467
Feature activation-1.33e-01

Pos logit contributions
 cal+0.223
 swe+0.208
 alright+0.207
 lets+0.207
 pic+0.201
Neg logit contributions
ve-0.231
rie-0.182
ged-0.176
makers-0.172
iana-0.172
Token down
Token ID866
Feature activation-3.68e-02

Pos logit contributions
 cal+0.167
 alright+0.154
 swe+0.153
 ok+0.148
 lets+0.147
Neg logit contributions
ve-0.242
rie-0.194
iana-0.186
makers-0.181
ged-0.180
Token his
Token ID465
Feature activation+4.45e-02

Pos logit contributions
 cal+0.046
 swe+0.043
 alright+0.043
 ok+0.041
 lets+0.040
Neg logit contributions
ve-0.067
rie-0.054
iana-0.052
makers-0.050
ged-0.050
Token spine
Token ID19656
Feature activation+3.00e-01

Pos logit contributions
ve+0.068
rie+0.055
iana+0.054
sel+0.053
ged+0.051
Neg logit contributions
 swe-0.066
 cal-0.065
 snow-0.062
 seasons-0.061
 shade-0.060
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
ve+0.596
rie+0.497
iana+0.483
makers+0.463
eg+0.448
Neg logit contributions
 cal-0.320
 alright-0.294
 swe-0.293
 ok-0.277
 calmed-0.276
Token He
Token ID679
Feature activation+1.38e-02

Pos logit contributions
 cal+0.027
 swe+0.024
 alright+0.024
 lets+0.023
 cleans+0.023
Neg logit contributions
ve-0.019
rie-0.014
iana-0.014
eg-0.013
ged-0.013
Token looked
Token ID3114
Feature activation-4.11e-02

Pos logit contributions
ve+0.021
rie+0.016
ged+0.015
iana+0.015
ley+0.014
Neg logit contributions
 cal-0.021
 alright-0.020
 swe-0.020
 ok-0.019
 lets-0.019
Token around
Token ID1088
Feature activation-3.42e-02

Pos logit contributions
 cal+0.053
 swe+0.049
 alright+0.049
 lets+0.048
 ok+0.047
Neg logit contributions
ve-0.072
rie-0.057
iana-0.054
ged-0.054
makers-0.052
Token but
Token ID475
Feature activation-3.77e-04

Pos logit contributions
 cal+0.038
 alright+0.034
 swe+0.034
 ok+0.032
 calmed+0.031
Neg logit contributions
ve-0.069
rie-0.059
iana-0.056
sel-0.053
makers-0.053
Token He
Token ID679
Feature activation+1.63e-02

Pos logit contributions
 cal+0.017
 swe+0.016
 alright+0.015
 lets+0.015
 calmed+0.015
Neg logit contributions
ve-0.019
rie-0.016
iana-0.015
makers-0.014
ged-0.014
Token tug
Token ID27762
Feature activation+1.02e-01

Pos logit contributions
ve+0.029
rie+0.023
iana+0.022
makers+0.021
ged+0.021
Neg logit contributions
 cal-0.021
 swe-0.020
 alright-0.020
 ok-0.019
 lets-0.019
Tokenged
Token ID2004
Feature activation+3.66e-01

Pos logit contributions
ve+0.082
rie+0.063
inky+0.058
iana+0.058
 ante+0.053
Neg logit contributions
 calmed-0.207
 cal-0.207
 alright-0.204
 lets-0.198
 ok-0.198
Token and
Token ID290
Feature activation-9.43e-03

Pos logit contributions
ve+0.641
ged+0.499
rie+0.493
iana+0.473
ging+0.465
Neg logit contributions
 cal-0.492
 swe-0.464
 alright-0.446
 seasons-0.440
 pic-0.436
Token tug
Token ID27762
Feature activation+1.02e-01

Pos logit contributions
 cal+0.011
 alright+0.011
 calmed+0.010
 swe+0.010
 ok+0.010
Neg logit contributions
ve-0.018
rie-0.015
iana-0.014
makers-0.014
eg-0.013
Tokenged
Token ID2004
Feature activation+3.66e-01

Pos logit contributions
ve+0.075
rie+0.058
iana+0.050
 ante+0.044
eg+0.043
Neg logit contributions
 alright-0.219
 cal-0.218
 calmed-0.214
 ok-0.213
 swe-0.208
Token,
Token ID11
Feature activation-8.57e-04

Pos logit contributions
ve+0.599
ged+0.458
rie+0.440
ging+0.426
iana+0.420
Neg logit contributions
 cal-0.539
 swe-0.509
 pic-0.490
 seasons-0.488
 alright-0.481
Token but
Token ID475
Feature activation+3.14e-03

Pos logit contributions
 cal+0.001
 alright+0.001
 swe+0.001
 ok+0.001
 calmed+0.001
Neg logit contributions
ve-0.002
rie-0.001
iana-0.001
makers-0.001
ged-0.001
Token he
Token ID339
Feature activation+2.82e-02

Pos logit contributions
ve+0.005
rie+0.004
iana+0.004
ged+0.004
makers+0.004
Neg logit contributions
 cal-0.004
 swe-0.004
 alright-0.004
 lets-0.004
 ok-0.004
Token couldn
Token ID3521
Feature activation+7.95e-02

Pos logit contributions
ve+0.044
rie+0.034
iana+0.033
makers+0.031
ged+0.031
Neg logit contributions
 cal-0.043
 swe-0.040
 alright-0.040
 ok-0.038
 lets-0.038
Token't
Token ID470
Feature activation+1.60e-01

Pos logit contributions
ve+0.091
rie+0.064
iana+0.059
makers+0.055
ged+0.055
Neg logit contributions
 cal-0.153
 swe-0.145
 alright-0.145
 ok-0.141
 lets-0.140
Token's
Token ID338
Feature activation-6.24e-02

Pos logit contributions
ve+0.099
rie+0.079
iana+0.074
makers+0.071
ged+0.070
Neg logit contributions
 cal-0.083
 alright-0.075
 swe-0.074
 calmed-0.073
 ok-0.072
Token house
Token ID2156
Feature activation-1.50e-02

Pos logit contributions
 alright+0.092
 cal+0.092
 calmed+0.089
 ok+0.089
 lets+0.085
Neg logit contributions
ve-0.084
iana-0.067
rie-0.066
 ante-0.061
 cash-0.061
Token and
Token ID290
Feature activation-1.30e-04

Pos logit contributions
 cal+0.020
 alright+0.019
 swe+0.019
 ok+0.018
 lets+0.018
Neg logit contributions
ve-0.025
rie-0.020
iana-0.019
makers-0.019
ged-0.018
Token bring
Token ID2222
Feature activation-1.00e-02

Pos logit contributions
 cal+0.000
 alright+0.000
 swe+0.000
 ok+0.000
 calmed+0.000
Neg logit contributions
ve-0.000
rie-0.000
iana-0.000
makers-0.000
ged-0.000
Token her
Token ID607
Feature activation+4.77e-02

Pos logit contributions
 cal+0.013
 swe+0.012
 alright+0.012
 ok+0.011
 lets+0.011
Neg logit contributions
ve-0.018
rie-0.015
iana-0.014
ged-0.014
makers-0.014
Token your
Token ID534
Feature activation+6.29e-02

Pos logit contributions
ve+0.062
rie+0.046
ucer+0.043
iana+0.042
 whole+0.042
Neg logit contributions
 cal-0.074
 alright-0.071
 calmed-0.069
 ok-0.068
 lets-0.067
Token dictionary
Token ID22155
Feature activation-3.71e-02

Pos logit contributions
ve+0.093
rie+0.070
iana+0.068
makers+0.064
sed+0.064
Neg logit contributions
 cal-0.096
 alright-0.090
 calmed-0.087
 ok-0.086
 lets-0.085
Token.
Token ID13
Feature activation-4.72e-03

Pos logit contributions
 cal+0.052
 alright+0.048
 swe+0.046
 ok+0.046
 calmed+0.046
Neg logit contributions
ve-0.061
rie-0.049
iana-0.046
makers-0.045
ged-0.044
Token Maybe
Token ID6674
Feature activation-9.47e-02

Pos logit contributions
 cal+0.008
 calmed+0.007
 swe+0.007
 alright+0.007
 hurricanes+0.007
Neg logit contributions
ve-0.006
rie-0.005
oys-0.005
makers-0.004
iana-0.004
Token you
Token ID345
Feature activation+1.60e-02

Pos logit contributions
 cal+0.152
 alright+0.136
 calmed+0.134
 ok+0.130
 swe+0.130
Neg logit contributions
ve-0.134
rie-0.105
iana-0.103
oys-0.095
makers-0.094
Token can
Token ID460
Feature activation-1.14e-01

Pos logit contributions
ve+0.034
rie+0.028
iana+0.026
makers+0.026
sel+0.025
Neg logit contributions
 cal-0.016
 swe-0.015
 alright-0.014
 ok-0.013
 calmed-0.013
Token She
Token ID1375
Feature activation-2.12e-02

Pos logit contributions
 cal+0.017
 alright+0.017
 ok+0.016
 hurricanes+0.015
 swe+0.015
Neg logit contributions
ve-0.008
rie-0.005
iana-0.005
 center-0.005
da-0.005
Token felt
Token ID2936
Feature activation+7.06e-02

Pos logit contributions
 cal+0.021
 swe+0.021
 lets+0.020
 calmed+0.020
 alright+0.019
Neg logit contributions
ve-0.041
rie-0.035
makers-0.033
iana-0.033
ging-0.032
Token very
Token ID845
Feature activation+1.02e-01

Pos logit contributions
ve+0.120
rie+0.095
iana+0.091
makers+0.088
ged+0.087
Neg logit contributions
 cal-0.097
 swe-0.090
 alright-0.089
 lets-0.085
 ok-0.085
Token lucky
Token ID9670
Feature activation+7.23e-02

Pos logit contributions
ve+0.189
rie+0.157
iana+0.149
ged+0.144
makers+0.144
Neg logit contributions
 cal-0.125
 alright-0.115
 swe-0.115
 ok-0.110
 lets-0.107
Token to
Token ID284
Feature activation-3.27e-02

Pos logit contributions
ve+0.115
rie+0.092
iana+0.088
makers+0.086
ged+0.085
Neg logit contributions
 cal-0.108
 alright-0.097
 swe-0.096
 lets-0.094
 ok-0.093
Token find
Token ID1064
Feature activation+1.95e-01

Pos logit contributions
 cal+0.035
 lets+0.034
 shall+0.033
 swe+0.032
 alright+0.032
Neg logit contributions
ve-0.065
rie-0.053
lves-0.051
ged-0.051
ucer-0.050
Token them
Token ID606
Feature activation+1.89e-02

Pos logit contributions
ve+0.324
rie+0.261
iana+0.249
ged+0.239
makers+0.238
Neg logit contributions
 cal-0.267
 alright-0.245
 swe-0.244
 ok-0.236
 lets-0.234
Token.
Token ID13
Feature activation-7.31e-03

Pos logit contributions
ve+0.029
rie+0.022
iana+0.021
makers+0.020
ged+0.020
Neg logit contributions
 cal-0.030
 calmed-0.026
 lets-0.026
 nods-0.026
 swe-0.026
Token She
Token ID1375
Feature activation-2.06e-02

Pos logit contributions
 cal+0.016
 alright+0.015
 swe+0.014
 ok+0.014
 colder+0.014
Neg logit contributions
ve-0.007
 These-0.004
da-0.004
 This-0.004
iana-0.003
Token wanted
Token ID2227
Feature activation+2.93e-03

Pos logit contributions
 cal+0.028
 alright+0.025
 swe+0.025
 ok+0.024
 lets+0.024
Neg logit contributions
ve-0.036
rie-0.029
iana-0.028
ged-0.027
makers-0.027
Token to
Token ID284
Feature activation-3.26e-02

Pos logit contributions
ve+0.005
rie+0.004
iana+0.004
makers+0.004
ged+0.003
Neg logit contributions
 cal-0.004
 alright-0.004
 swe-0.004
 ok-0.004
 lets-0.004
Token The
Token ID383
Feature activation+2.54e-02

Pos logit contributions
 cal+0.033
 alright+0.032
 swe+0.032
 ok+0.030
 hurricanes+0.029
Neg logit contributions
ve-0.026
rie-0.020
iana-0.019
eg-0.018
 ante-0.017
Token frog
Token ID21264
Feature activation-2.16e-02

Pos logit contributions
ve+0.044
rie+0.035
iana+0.034
makers+0.032
ged+0.032
Neg logit contributions
 cal-0.034
 alright-0.032
 swe-0.031
 ok-0.031
 lets-0.030
Token cro
Token ID6763
Feature activation+5.23e-02

Pos logit contributions
 cal+0.031
 swe+0.029
 alright+0.029
 ok+0.028
 lets+0.027
Neg logit contributions
ve-0.036
rie-0.028
iana-0.027
ged-0.026
makers-0.026
Tokenaked
Token ID4335
Feature activation+2.52e-02

Pos logit contributions
ve+0.094
rie+0.076
iana+0.074
makers+0.071
ged+0.069
Neg logit contributions
 cal-0.066
 alright-0.060
 swe-0.059
 lets-0.057
 calmed-0.057
Token excited
Token ID6568
Feature activation-7.68e-02

Pos logit contributions
ve+0.042
rie+0.032
ged+0.032
iana+0.031
makers+0.030
Neg logit contributions
 cal-0.035
 alright-0.033
 swe-0.033
 lets-0.033
 seasons-0.032
Tokenly
Token ID306
Feature activation+1.93e-01

Pos logit contributions
 cal+0.136
 swe+0.129
 alright+0.128
 ok+0.124
 calmed+0.123
Neg logit contributions
ve-0.101
rie-0.074
iana-0.070
makers-0.065
ged-0.065
Token and
Token ID290
Feature activation-1.37e-02

Pos logit contributions
ve+0.347
rie+0.282
iana+0.271
makers+0.260
ged+0.258
Neg logit contributions
 cal-0.245
 swe-0.225
 alright-0.225
 ok-0.215
 lets-0.213
Token put
Token ID1234
Feature activation-1.06e-02

Pos logit contributions
 cal+0.017
 alright+0.016
 swe+0.016
 ok+0.015
 lets+0.015
Neg logit contributions
ve-0.025
rie-0.020
iana-0.019
ged-0.019
makers-0.019
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
 cal+0.010
 swe+0.009
 alright+0.009
 lets+0.009
 ok+0.008
Neg logit contributions
ve-0.023
rie-0.020
iana-0.018
makers-0.018
oys-0.018
Token bracelet
Token ID42893
Feature activation-1.11e-02

Pos logit contributions
 cal+0.022
 swe+0.022
 alright+0.021
 ok+0.020
 shall+0.019
Neg logit contributions
ve-0.047
rie-0.041
makers-0.038
iana-0.038
eg-0.037
Token around
Token ID1088
Feature activation-3.75e-02

Pos logit contributions
 cal+0.015
 alright+0.014
 swe+0.014
 lets+0.013
 calmed+0.013
Neg logit contributions
ve-0.019
rie-0.015
iana-0.015
ged-0.014
makers-0.014
Token,
Token ID11
Feature activation+3.29e-03

Pos logit contributions
 cal+0.080
 alright+0.074
 swe+0.073
 ok+0.069
 lets+0.069
Neg logit contributions
ve-0.129
rie-0.107
iana-0.102
eg-0.099
ged-0.099
Token she
Token ID673
Feature activation-1.73e-03

Pos logit contributions
ve+0.006
rie+0.005
iana+0.005
makers+0.004
ged+0.004
Neg logit contributions
 cal-0.004
 swe-0.004
 alright-0.004
 ok-0.004
 lets-0.004
Token wanted
Token ID2227
Feature activation+4.85e-03

Pos logit contributions
 cal+0.002
 swe+0.002
 alright+0.002
 ok+0.002
 lets+0.002
Neg logit contributions
ve-0.003
rie-0.002
iana-0.002
ged-0.002
makers-0.002
Token to
Token ID284
Feature activation-3.07e-02

Pos logit contributions
ve+0.007
iana+0.005
rie+0.005
makers+0.005
ged+0.005
Neg logit contributions
 cal-0.007
 alright-0.007
 swe-0.007
 cleans-0.007
 nods-0.007
Token call
Token ID869
Feature activation+5.54e-02

Pos logit contributions
 cal+0.047
 swe+0.045
 alright+0.044
 ok+0.042
 flour+0.042
Neg logit contributions
ve-0.046
rie-0.034
iana-0.033
makers-0.032
eg-0.030
Token out
Token ID503
Feature activation+5.98e-02

Pos logit contributions
ve+0.088
rie+0.069
iana+0.067
makers+0.064
ged+0.064
Neg logit contributions
 cal-0.080
 alright-0.073
 swe-0.073
 lets-0.071
 calmed-0.071
Token to
Token ID284
Feature activation-3.05e-02

Pos logit contributions
ve+0.096
rie+0.073
iana+0.071
makers+0.069
ged+0.067
Neg logit contributions
 cal-0.084
 alright-0.078
 swe-0.077
 lets-0.077
 calmed-0.077
Token them
Token ID606
Feature activation+2.01e-02

Pos logit contributions
 cal+0.040
 swe+0.037
 alright+0.037
 ok+0.035
 lets+0.034
Neg logit contributions
ve-0.054
rie-0.043
iana-0.042
makers-0.040
ged-0.040
Token but
Token ID475
Feature activation+6.87e-03

Pos logit contributions
ve+0.033
rie+0.025
iana+0.024
makers+0.023
ged+0.023
Neg logit contributions
 cal-0.029
 swe-0.027
 alright-0.026
 calmed-0.026
 lets-0.026
Token she
Token ID673
Feature activation-7.95e-04

Pos logit contributions
ve+0.012
rie+0.010
iana+0.010
makers+0.009
ged+0.009
Neg logit contributions
 cal-0.009
 alright-0.008
 swe-0.008
 ok-0.008
 lets-0.008
Token was
Token ID373
Feature activation+1.06e-02

Pos logit contributions
 cal+0.001
 alright+0.001
 swe+0.001
 ok+0.001
 calmed+0.001
Neg logit contributions
ve-0.001
rie-0.001
iana-0.001
makers-0.001
ged-0.001
Token quickly
Token ID2952
Feature activation+4.03e-03

Pos logit contributions
 cal+0.000
 swe+0.000
 alright+0.000
 ok+0.000
 lets+0.000
Neg logit contributions
ve-0.000
rie-0.000
iana-0.000
ged-0.000
makers-0.000
Token took
Token ID1718
Feature activation-2.66e-03

Pos logit contributions
ve+0.007
rie+0.006
ged+0.005
iana+0.005
sel+0.005
Neg logit contributions
 cal-0.005
 alright-0.005
 swe-0.005
 ok-0.005
 shall-0.005
Token out
Token ID503
Feature activation+4.85e-02

Pos logit contributions
 cal+0.004
 swe+0.004
 alright+0.004
 lets+0.004
 ok+0.004
Neg logit contributions
ve-0.004
rie-0.003
iana-0.003
ged-0.003
makers-0.003
Token the
Token ID262
Feature activation-2.90e-02

Pos logit contributions
ve+0.070
rie+0.054
iana+0.052
ged+0.048
makers+0.047
Neg logit contributions
 cal-0.074
 alright-0.072
 swe-0.070
 lets-0.069
 ok-0.069
Token pe
Token ID613
Feature activation+1.78e-02

Pos logit contributions
 cal+0.035
 swe+0.031
 alright+0.031
 lets+0.030
 calmed+0.030
Neg logit contributions
ve-0.056
rie-0.044
iana-0.042
makers-0.042
ged-0.040
Tokenb
Token ID65
Feature activation+1.82e-01

Pos logit contributions
ve+0.028
rie+0.022
iana+0.021
makers+0.020
ged+0.020
Neg logit contributions
 cal-0.027
 swe-0.025
 alright-0.025
 ok-0.024
 lets-0.024
Tokenble
Token ID903
Feature activation+2.58e-02

Pos logit contributions
ve+0.284
rie+0.238
iana+0.219
makers+0.217
ged+0.210
Neg logit contributions
 cal-0.256
 alright-0.244
 ok-0.235
 swe-0.233
 shall-0.232
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
ve+0.044
rie+0.035
iana+0.033
ged+0.032
makers+0.032
Neg logit contributions
 cal-0.035
 alright-0.033
 swe-0.032
 lets-0.031
 ok-0.031
Token p
Token ID279
Feature activation-1.16e-01

Pos logit contributions
 cal+0.020
 alright+0.020
 swe+0.019
 seasons+0.019
 ok+0.019
Neg logit contributions
ve-0.025
rie-0.020
ged-0.019
iana-0.019
makers-0.018
Tokenatted
Token ID16898
Feature activation-1.22e-01

Pos logit contributions
 cal+0.124
 swe+0.112
 alright+0.112
 ok+0.106
 lets+0.105
Neg logit contributions
ve-0.233
rie-0.193
iana-0.186
makers-0.180
ged-0.180
Token it
Token ID340
Feature activation-5.23e-02

Pos logit contributions
 cal+0.156
 alright+0.145
 swe+0.145
 lets+0.139
 ok+0.138
Neg logit contributions
ve-0.212
rie-0.169
ged-0.164
iana-0.162
makers-0.159
Token finally
Token ID3443
Feature activation+3.85e-05

Pos logit contributions
 cal+0.041
 alright+0.038
 ok+0.037
 shall+0.036
 nod+0.036
Neg logit contributions
ve-0.077
rie-0.068
ged-0.062
lves-0.061
ucer-0.061
Token have
Token ID423
Feature activation+3.78e-02

Pos logit contributions
ve+0.000
rie+0.000
iana+0.000
makers+0.000
ged+0.000
Neg logit contributions
 cal-0.000
 alright-0.000
 swe-0.000
 ok-0.000
 lets-0.000
Token some
Token ID617
Feature activation-2.65e-02

Pos logit contributions
ve+0.054
sel+0.040
iana+0.039
ged+0.038
makers+0.038
Neg logit contributions
 cal-0.056
 swe-0.052
 nods-0.051
rises-0.051
 alright-0.051
Token tasty
Token ID25103
Feature activation-1.15e-01

Pos logit contributions
 cal+0.035
Fly+0.033
 slows+0.033
 Fog+0.033
rises+0.033
Neg logit contributions
ve-0.043
iana-0.029
 for-0.028
makers-0.028
sel-0.027
Token vegetables
Token ID13701
Feature activation-2.03e-01

Pos logit contributions
 cal+0.159
 calmed+0.146
 alright+0.145
 slows+0.142
 ok+0.140
Neg logit contributions
ve-0.189
iana-0.135
ged-0.129
 for-0.127
sel-0.126
Token to
Token ID284
Feature activation-3.71e-02

Pos logit contributions
 cal+0.257
 swe+0.236
 alright+0.236
 ok+0.225
 lets+0.223
Neg logit contributions
ve-0.365
rie-0.295
iana-0.283
makers-0.273
ged-0.272
Token buy
Token ID2822
Feature activation+1.32e-01

Pos logit contributions
 cal+0.051
 swe+0.047
 alright+0.047
 ok+0.045
 lets+0.045
Neg logit contributions
ve-0.063
rie-0.050
iana-0.048
makers-0.046
ged-0.046
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
ve+0.221
iana+0.167
rie+0.166
makers+0.160
sel+0.158
Neg logit contributions
 cal-0.179
 alright-0.163
 swe-0.162
 lets-0.160
 calmed-0.160
Token They
Token ID1119
Feature activation+2.03e-02

Pos logit contributions
 cal+0.015
 alright+0.014
 swe+0.014
 lets+0.013
 ok+0.013
Neg logit contributions
ve-0.022
rie-0.018
iana-0.017
ged-0.016
makers-0.016
Token ate
Token ID15063
Feature activation-1.19e-02

Pos logit contributions
ve+0.035
rie+0.029
makers+0.027
iana+0.026
ged+0.026
Neg logit contributions
 cal-0.028
 swe-0.025
 alright-0.025
 calmed-0.024
 lets-0.024
Token them
Token ID606
Feature activation+1.12e-02

Pos logit contributions
 cal+0.018
 alright+0.017
 lets+0.017
 ok+0.016
 calmed+0.016
Neg logit contributions
ve-0.017
ged-0.012
iana-0.012
makers-0.012
sel-0.012
Token to
Token ID284
Feature activation-3.78e-02

Pos logit contributions
ve+0.102
rie+0.084
iana+0.082
ged+0.080
makers+0.079
Neg logit contributions
 cal-0.044
 swe-0.043
 alright-0.039
 lets-0.038
 shall-0.037
Token be
Token ID307
Feature activation+1.26e-01

Pos logit contributions
 shall+0.034
 swe+0.034
 cal+0.033
 lets+0.031
 nod+0.031
Neg logit contributions
ve-0.082
rie-0.070
ged-0.069
ging-0.067
makers-0.067
Token a
Token ID257
Feature activation+2.96e-02

Pos logit contributions
ve+0.248
rie+0.200
iana+0.196
ged+0.195
makers+0.186
Neg logit contributions
 cal-0.147
 swe-0.138
 alright-0.134
 ok-0.130
 okay-0.123
Token witch
Token ID16365
Feature activation+4.71e-02

Pos logit contributions
ve+0.054
ged+0.051
ging+0.045
rie+0.044
ops+0.043
Neg logit contributions
 cal-0.036
 swe-0.035
 pic-0.033
 shall-0.032
 lets-0.031
Token!"
Token ID2474
Feature activation+1.16e-01

Pos logit contributions
ve+0.077
rie+0.061
iana+0.058
makers+0.056
ged+0.055
Neg logit contributions
 cal-0.067
 swe-0.062
 alright-0.062
 ok-0.060
 lets-0.059
Token Lily
Token ID20037
Feature activation+4.91e-02

Pos logit contributions
ve+0.246
rie+0.196
makers+0.192
ucer+0.188
chie+0.184
Neg logit contributions
 swe-0.121
 cal-0.120
 lets-0.112
 calmed-0.110
 alright-0.106
Token said
Token ID531
Feature activation+4.99e-02

Pos logit contributions
ve+0.088
rie+0.070
iana+0.068
makers+0.066
ged+0.065
Neg logit contributions
 cal-0.063
 swe-0.059
 alright-0.058
 ok-0.056
 lets-0.055
Token.
Token ID13
Feature activation-1.79e-02

Pos logit contributions
ve+0.074
ged+0.057
rie+0.055
iana+0.053
makers+0.052
Neg logit contributions
 cal-0.080
 pic-0.077
 seasons-0.076
 lets-0.073
 swe-0.073
Token She
Token ID1375
Feature activation-2.45e-02

Pos logit contributions
 cal+0.023
 swe+0.021
 alright+0.021
 ok+0.020
 lets+0.020
Neg logit contributions
ve-0.032
rie-0.026
iana-0.025
makers-0.024
ged-0.024
Token put
Token ID1234
Feature activation-7.56e-03

Pos logit contributions
 cal+0.033
 alright+0.031
 swe+0.031
 ok+0.029
 hurricanes+0.029
Neg logit contributions
ve-0.043
rie-0.034
iana-0.033
ged-0.031
sel-0.030
Token on
Token ID319
Feature activation-4.16e-02

Pos logit contributions
 cal+0.009
 swe+0.008
 alright+0.008
 lets+0.008
 ok+0.008
Neg logit contributions
ve-0.014
rie-0.012
makers-0.011
iana-0.011
ged-0.011
Token go
Token ID467
Feature activation-1.25e-01

Pos logit contributions
 cal+0.033
 alright+0.032
 swe+0.031
 ok+0.030
 hurricanes+0.029
Neg logit contributions
ve-0.042
rie-0.034
iana-0.033
ged-0.030
makers-0.030
Token to
Token ID284
Feature activation-3.97e-02

Pos logit contributions
 cal+0.141
 swe+0.126
 alright+0.124
 lets+0.120
 calmed+0.118
Neg logit contributions
ve-0.245
rie-0.204
iana-0.197
makers-0.187
ged-0.187
Token Lily
Token ID20037
Feature activation+5.26e-02

Pos logit contributions
 cal+0.030
 swe+0.025
 nod+0.024
 thunder+0.023
 pic+0.022
Neg logit contributions
ve-0.100
rie-0.084
sel-0.082
ged-0.079
ucer-0.079
Token's
Token ID338
Feature activation-6.53e-02

Pos logit contributions
ve+0.101
rie+0.084
iana+0.080
makers+0.078
ged+0.077
Neg logit contributions
 cal-0.060
 alright-0.055
 swe-0.055
 ok-0.052
 lets-0.051
Token room
Token ID2119
Feature activation-6.84e-02

Pos logit contributions
 cal+0.088
 alright+0.086
 ok+0.082
 swe+0.080
 calmed+0.079
Neg logit contributions
ve-0.106
rie-0.087
iana-0.085
makers-0.079
eg-0.077
Token.
Token ID13
Feature activation-7.80e-03

Pos logit contributions
 cal+0.082
 swe+0.075
 alright+0.075
 ok+0.071
 calmed+0.070
Neg logit contributions
ve-0.128
rie-0.105
iana-0.101
makers-0.097
ged-0.096
Token Lily
Token ID20037
Feature activation+5.30e-02

Pos logit contributions
 cal+0.011
 swe+0.010
 alright+0.010
 ok+0.009
 lets+0.009
Neg logit contributions
ve-0.014
rie-0.011
iana-0.010
makers-0.010
ged-0.010
Token gives
Token ID3607
Feature activation-1.03e-01

Pos logit contributions
ve+0.073
rie+0.058
iana+0.053
makers+0.048
ged+0.047
Neg logit contributions
 alright-0.084
 ok-0.083
 cal-0.079
 swe-0.077
 okay-0.076
Token Ben
Token ID3932
Feature activation+4.04e-02

Pos logit contributions
 cal+0.154
 alright+0.145
 ok+0.139
 swe+0.139
 calmed+0.137
Neg logit contributions
ve-0.157
rie-0.127
iana-0.123
ged-0.116
makers-0.113
Token a
Token ID257
Feature activation+3.39e-02

Pos logit contributions
ve+0.061
rie+0.049
iana+0.046
ged+0.044
makers+0.044
Neg logit contributions
 cal-0.059
 alright-0.057
 swe-0.054
 ok-0.054
 shall-0.053
Token pair
Token ID5166
Feature activation+1.25e-01

Pos logit contributions
ve+0.056
rie+0.045
iana+0.043
eg+0.041
ucer+0.040
Neg logit contributions
 cal-0.046
 alright-0.045
 ok-0.042
 swe-0.041
 lets-0.041
Token of
Token ID286
Feature activation-7.81e-02

Pos logit contributions
ve+0.221
rie+0.184
oys+0.179
ula+0.173
iana+0.168
Neg logit contributions
 cal-0.217
 alright-0.203
 swe-0.196
 ok-0.190
 nods-0.190
Token his
Token ID465
Feature activation+3.51e-02

Pos logit contributions
 cal+0.094
 swe+0.086
 alright+0.084
 lets+0.082
 ok+0.081
Neg logit contributions
ve-0.152
rie-0.124
iana-0.116
makers-0.115
ged-0.110
Token hat
Token ID6877
Feature activation-3.36e-02

Pos logit contributions
ve+0.061
rie+0.048
iana+0.046
ged+0.044
makers+0.044
Neg logit contributions
 cal-0.048
 alright-0.044
 swe-0.044
 ok-0.042
 lets-0.042
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
 cal+0.040
 swe+0.037
 alright+0.037
 ok+0.035
 lets+0.035
Neg logit contributions
ve-0.062
rie-0.051
iana-0.049
makers-0.047
ged-0.047
Token He
Token ID679
Feature activation+5.13e-03

Pos logit contributions
 cal+0.046
 alright+0.046
 swe+0.044
 ok+0.043
 calmed+0.041
Neg logit contributions
ve-0.016
eg-0.014
rie-0.012
 ante-0.012
 These-0.011
Token went
Token ID1816
Feature activation-1.11e-01

Pos logit contributions
ve+0.009
rie+0.007
iana+0.006
ged+0.006
makers+0.006
Neg logit contributions
 cal-0.007
 alright-0.007
 swe-0.007
 ok-0.007
 lets-0.006
Token to
Token ID284
Feature activation-4.32e-02

Pos logit contributions
 cal+0.098
 swe+0.090
 lets+0.080
 alright+0.079
 calmed+0.075
Neg logit contributions
ve-0.242
rie-0.204
iana-0.200
sel-0.195
makers-0.193
Token a
Token ID257
Feature activation+2.18e-02

Pos logit contributions
 cal+0.055
 swe+0.050
 alright+0.049
 lets+0.048
 ok+0.047
Neg logit contributions
ve-0.080
rie-0.065
iana-0.062
makers-0.060
ged-0.060
Token park
Token ID3952
Feature activation-4.61e-02

Pos logit contributions
ve+0.036
rie+0.029
iana+0.027
makers+0.026
ged+0.026
Neg logit contributions
 cal-0.030
 alright-0.028
 swe-0.028
 ok-0.027
 lets-0.027
Token and
Token ID290
Feature activation-2.13e-02

Pos logit contributions
 cal+0.048
 lets+0.042
 calmed+0.041
 swe+0.041
 shall+0.040
Neg logit contributions
ve-0.095
rie-0.079
iana-0.077
ged-0.073
sel-0.073
Token saw
Token ID2497
Feature activation+1.16e-01

Pos logit contributions
 cal+0.026
 lets+0.024
 calmed+0.024
 swe+0.024
 shall+0.023
Neg logit contributions
ve-0.039
rie-0.031
iana-0.030
makers-0.030
ula-0.029
Token
Token ID198
Feature activation+9.43e-03

Pos logit contributions
ve+0.018
rie+0.016
iana+0.015
ged+0.014
makers+0.014
Neg logit contributions
 cal-0.011
 swe-0.009
 alright-0.009
 lets-0.009
 shall-0.009
TokenJ
Token ID41
Feature activation+1.50e-01

Pos logit contributions
ve+0.017
rie+0.014
iana+0.014
ged+0.013
makers+0.013
Neg logit contributions
 cal-0.012
 swe-0.011
 alright-0.011
 lets-0.010
 shall-0.010
Tokenenny
Token ID11870
Feature activation+2.14e-01

Pos logit contributions
ve+0.189
rie+0.131
makers+0.125
iana+0.125
ged+0.125
Neg logit contributions
 cal-0.262
 alright-0.249
 swe-0.245
 shall-0.244
 lets-0.242
Token was
Token ID373
Feature activation+7.55e-03

Pos logit contributions
ve+0.375
rie+0.302
iana+0.289
makers+0.278
ged+0.278
Neg logit contributions
 cal-0.280
 swe-0.259
 alright-0.259
 ok-0.247
 lets-0.244
Token happy
Token ID3772
Feature activation-1.55e-02

Pos logit contributions
ve+0.012
rie+0.009
iana+0.009
makers+0.009
ged+0.009
Neg logit contributions
 cal-0.011
 swe-0.010
 alright-0.010
 lets-0.010
 ok-0.010
Token and
Token ID290
Feature activation-8.54e-03

Pos logit contributions
 cal+0.018
 alright+0.016
 swe+0.016
 calmed+0.016
 ok+0.015
Neg logit contributions
ve-0.030
rie-0.025
iana-0.024
sel-0.023
eg-0.023
Token decided
Token ID3066
Feature activation-8.75e-02

Pos logit contributions
 cal+0.011
 swe+0.010
 alright+0.010
 ok+0.009
 lets+0.009
Neg logit contributions
ve-0.016
rie-0.013
iana-0.012
makers-0.012
ged-0.012
Token to
Token ID284
Feature activation-3.30e-02

Pos logit contributions
 cal+0.088
 swe+0.080
 alright+0.079
 ok+0.075
 lets+0.075
Neg logit contributions
ve-0.182
rie-0.153
iana-0.146
ged-0.141
makers-0.141
Token write
Token ID3551
Feature activation+1.86e-02

Pos logit contributions
 cal+0.049
 swe+0.046
 alright+0.046
 ok+0.043
 calmed+0.042
Neg logit contributions
ve-0.051
rie-0.041
iana-0.040
makers-0.037
eg-0.036
Token a
Token ID257
Feature activation+3.38e-02

Pos logit contributions
ve+0.031
rie+0.025
iana+0.023
ged+0.023
makers+0.022
Neg logit contributions
 cal-0.026
 swe-0.024
 alright-0.024
 lets-0.023
 calmed-0.023
Token letter
Token ID3850
Feature activation-2.85e-02

Pos logit contributions
ve+0.064
rie+0.053
iana+0.051
ged+0.050
makers+0.050
Neg logit contributions
 cal-0.041
 swe-0.038
 alright-0.037
 shall-0.036
 ok-0.036
Token water
Token ID1660
Feature activation-6.97e-02

Pos logit contributions
 alright+0.024
 cal+0.023
 ok+0.023
adows+0.023
 hurricanes+0.022
Neg logit contributions
ve-0.035
ula-0.026
ged-0.026
 whole-0.025
 for-0.025
Token and
Token ID290
Feature activation-4.50e-06

Pos logit contributions
 cal+0.106
 alright+0.104
 ok+0.100
 swe+0.095
 lets+0.095
Neg logit contributions
ve-0.101
ged-0.077
rie-0.076
iana-0.075
makers-0.072
Token ate
Token ID15063
Feature activation-8.35e-03

Feature not active on prev token;
no predictions were affected.
Token some
Token ID617
Feature activation-2.04e-02

Pos logit contributions
 alright+0.014
 cal+0.014
 ok+0.014
adows+0.013
 calmed+0.013
Neg logit contributions
ve-0.009
ged-0.007
ula-0.007
 for-0.007
makers-0.006
Token more
Token ID517
Feature activation-7.18e-02

Pos logit contributions
 alright+0.026
 ok+0.025
 cal+0.024
 hurricanes+0.023
 calmed+0.023
Neg logit contributions
ve-0.033
ula-0.025
 for-0.024
makers-0.024
ged-0.023
Token cookies
Token ID14746
Feature activation-2.13e-01

Pos logit contributions
 alright+0.111
 ok+0.109
 cal+0.102
 calmed+0.096
lim+0.095
Neg logit contributions
ve-0.100
 for-0.072
ops-0.071
makers-0.070
ula-0.070
Token.
Token ID13
Feature activation-1.23e-03

Pos logit contributions
 cal+0.281
 alright+0.260
 swe+0.257
 ok+0.248
 lets+0.244
Neg logit contributions
ve-0.375
rie-0.300
iana-0.288
makers-0.280
ged-0.278
Token Lily
Token ID20037
Feature activation+6.11e-02

Pos logit contributions
 cal+0.002
 alright+0.002
 swe+0.002
 ok+0.002
 calmed+0.002
Neg logit contributions
ve-0.002
rie-0.001
iana-0.001
makers-0.001
oys-0.001
Token said
Token ID531
Feature activation+6.46e-02

Pos logit contributions
ve+0.088
rie+0.067
iana+0.064
ged+0.059
sel+0.058
Neg logit contributions
 cal-0.093
 alright-0.092
 swe-0.089
 hurricanes-0.089
 ok-0.088
Token,
Token ID11
Feature activation+8.65e-03

Pos logit contributions
ve+0.078
ged+0.057
iana+0.050
makers+0.048
ula+0.047
Neg logit contributions
 cal-0.118
 seasons-0.111
 alright-0.109
 pic-0.108
 hurricanes-0.106
Token "
Token ID366
Feature activation+5.11e-03

Pos logit contributions
ve+0.013
ged+0.010
rie+0.009
iana+0.009
makers+0.009
Neg logit contributions
 cal-0.014
 swe-0.013
 alright-0.013
 seasons-0.012
 pic-0.012
Token told
Token ID1297
Feature activation-1.71e-02

Pos logit contributions
 cal+0.085
 swe+0.079
 alright+0.077
 ok+0.074
 seasons+0.072
Neg logit contributions
ve-0.106
rie-0.085
iana-0.082
ged-0.077
makers-0.077
Token her
Token ID607
Feature activation+4.02e-02

Pos logit contributions
 cal+0.020
 swe+0.019
 alright+0.018
 lets+0.018
 ok+0.018
Neg logit contributions
ve-0.033
rie-0.026
makers-0.025
iana-0.025
ged-0.025
Token she
Token ID673
Feature activation-5.71e-03

Pos logit contributions
ve+0.063
rie+0.049
iana+0.048
makers+0.047
oys+0.043
Neg logit contributions
 cal-0.062
 swe-0.056
 alright-0.056
 lets-0.054
 shall-0.053
Token needed
Token ID2622
Feature activation+1.16e-01

Pos logit contributions
 cal+0.009
 swe+0.008
 alright+0.008
 ok+0.008
 lets+0.008
Neg logit contributions
ve-0.009
rie-0.007
iana-0.007
makers-0.006
ged-0.006
Token to
Token ID284
Feature activation-3.43e-02

Pos logit contributions
ve+0.165
iana+0.121
oys+0.119
rie+0.118
makers+0.117
Neg logit contributions
 cal-0.181
 alright-0.164
 nods-0.163
 lets-0.161
 calmed-0.159
Token hurry
Token ID23290
Feature activation-1.87e-01

Pos logit contributions
 cal+0.053
 nods+0.048
 swe+0.048
 calmed+0.048
 lets+0.048
Neg logit contributions
ve-0.048
iana-0.037
eg-0.034
 ante-0.033
sel-0.033
Token to
Token ID284
Feature activation-3.39e-02

Pos logit contributions
 cal+0.189
 swe+0.179
 alright+0.178
 ok+0.167
 lets+0.155
Neg logit contributions
ve-0.382
rie-0.317
iana-0.302
ged-0.296
eg-0.294
Token take
Token ID1011
Feature activation+3.71e-02

Pos logit contributions
 cal+0.048
 alright+0.045
 swe+0.044
 nods+0.044
 lets+0.044
Neg logit contributions
ve-0.052
iana-0.040
rie-0.039
makers-0.037
 ante-0.037
Token a
Token ID257
Feature activation+3.11e-02

Pos logit contributions
ve+0.062
rie+0.050
iana+0.048
makers+0.046
ged+0.046
Neg logit contributions
 cal-0.052
 alright-0.048
 swe-0.048
 ok-0.046
 lets-0.046
Token bath
Token ID7837
Feature activation+2.91e-02

Pos logit contributions
ve+0.046
rie+0.033
iana+0.032
ula+0.030
makers+0.030
Neg logit contributions
 cal-0.049
 alright-0.047
 nods-0.046
 lets-0.046
 ok-0.045
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
ve+0.052
rie+0.042
iana+0.041
makers+0.039
ged+0.039
Neg logit contributions
 cal-0.037
 swe-0.034
 alright-0.034
 ok-0.033
 lets-0.032
Token but
Token ID475
Feature activation-6.47e-03

Pos logit contributions
 cal+0.000
 swe+0.000
 alright+0.000
 cleans+0.000
 shall+0.000
Neg logit contributions
ve-0.000
rie-0.000
ged-0.000
iana-0.000
makers-0.000
Token she
Token ID673
Feature activation-1.16e-02

Pos logit contributions
 cal+0.010
 swe+0.009
 alright+0.009
 pic+0.009
 shall+0.009
Neg logit contributions
ve-0.009
rie-0.007
iana-0.007
ged-0.007
makers-0.007
Token felt
Token ID2936
Feature activation+6.81e-02

Pos logit contributions
 cal+0.017
 swe+0.017
 alright+0.016
 hurricanes+0.016
 ok+0.016
Neg logit contributions
ve-0.018
rie-0.014
iana-0.013
ged-0.012
makers-0.012
Token a
Token ID257
Feature activation+2.89e-02

Pos logit contributions
ve+0.089
ged+0.073
ging+0.071
makers+0.071
rie+0.069
Neg logit contributions
 seasons-0.095
 pic-0.094
rises-0.093
 hurricanes-0.091
 shall-0.088
Token little
Token ID1310
Feature activation+2.97e-02

Pos logit contributions
ve+0.052
rie+0.044
iana+0.043
makers+0.041
ged+0.039
Neg logit contributions
 cal-0.038
 alright-0.034
 swe-0.034
 calmed-0.033
 ok-0.032
Token better
Token ID1365
Feature activation-1.68e-01

Pos logit contributions
ve+0.052
rie+0.042
makers+0.040
ged+0.040
sel+0.040
Neg logit contributions
 pic-0.033
 hurricanes-0.032
 seasons-0.032
adows-0.031
rises-0.031
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
 cal+0.159
 alright+0.142
 calmed+0.138
 swe+0.136
 ok+0.136
Neg logit contributions
ve-0.354
rie-0.305
iana-0.295
chie-0.281
lves-0.278
Token
Token ID198
Feature activation+6.07e-03

Pos logit contributions
 cal+0.018
 alright+0.017
 swe+0.017
 ok+0.016
 calmed+0.016
Neg logit contributions
ve-0.013
rie-0.011
iana-0.010
oys-0.010
ged-0.009
Token
Token ID198
Feature activation+6.08e-03

Pos logit contributions
ve+0.012
rie+0.010
iana+0.010
sel+0.009
makers+0.009
Neg logit contributions
 cal-0.007
 swe-0.006
 alright-0.006
 lets-0.006
 calmed-0.006
Token"
Token ID1
Feature activation+2.15e-02

Pos logit contributions
ve+0.012
rie+0.010
iana+0.009
sel+0.009
makers+0.009
Neg logit contributions
 cal-0.007
 swe-0.006
 alright-0.006
 lets-0.006
 calmed-0.006
TokenI
Token ID40
Feature activation-6.16e-02

Pos logit contributions
ve+0.033
rie+0.025
iana+0.025
ged+0.024
oys+0.023
Neg logit contributions
 cal-0.032
 alright-0.031
adows-0.030
 ok-0.030
 swe-0.030
Token on
Token ID319
Feature activation-4.04e-02

Pos logit contributions
ve+0.091
rie+0.070
makers+0.068
ged+0.066
iana+0.065
Neg logit contributions
 cal-0.075
 swe-0.070
 alright-0.068
 calmed-0.067
 lets-0.067
Token his
Token ID465
Feature activation+4.70e-02

Pos logit contributions
 cal+0.055
 alright+0.051
 swe+0.050
 calmed+0.049
 lets+0.049
Neg logit contributions
ve-0.066
rie-0.052
iana-0.050
makers-0.050
ged-0.049
Token snacks
Token ID28487
Feature activation-7.12e-02

Pos logit contributions
ve+0.074
rie+0.054
 spine+0.052
eg+0.052
iana+0.051
Neg logit contributions
 cal-0.067
 alright-0.063
 calmed-0.062
 lets-0.061
 cleans-0.060
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
 cal+0.090
 alright+0.083
 swe+0.083
 ok+0.079
 lets+0.078
Neg logit contributions
ve-0.128
rie-0.104
iana-0.100
makers-0.096
ged-0.096
Token He
Token ID679
Feature activation+1.41e-02

Pos logit contributions
 cal+0.014
 swe+0.014
 alright+0.013
 ok+0.013
 calmed+0.012
Neg logit contributions
ve-0.016
rie-0.013
iana-0.012
makers-0.012
ged-0.011
Token slowly
Token ID6364
Feature activation-1.69e-01

Pos logit contributions
ve+0.024
rie+0.019
iana+0.018
ged+0.018
makers+0.017
Neg logit contributions
 cal-0.019
 swe-0.018
 alright-0.018
 ok-0.017
 lets-0.016
Token crept
Token ID47092
Feature activation-1.19e-01

Pos logit contributions
 alright+0.203
 cal+0.201
 ok+0.194
 swe+0.192
 seasons+0.190
Neg logit contributions
ve-0.296
rie-0.251
ged-0.245
iana-0.237
sel-0.232
Token over
Token ID625
Feature activation-7.49e-02

Pos logit contributions
 cal+0.130
 alright+0.117
 swe+0.116
 ok+0.111
 lets+0.107
Neg logit contributions
ve-0.237
rie-0.197
iana-0.189
makers-0.184
ged-0.180
Token to
Token ID284
Feature activation-3.61e-02

Pos logit contributions
 cal+0.097
 alright+0.092
 swe+0.091
 ok+0.088
 lets+0.086
Neg logit contributions
ve-0.130
rie-0.103
iana-0.100
ged-0.099
makers-0.096
Token M
Token ID337
Feature activation-9.25e-03

Pos logit contributions
 cal+0.046
 alright+0.042
 swe+0.042
 ok+0.040
 lets+0.040
Neg logit contributions
ve-0.066
rie-0.054
iana-0.051
makers-0.049
ged-0.049
Tokenomm
Token ID2002
Feature activation+5.40e-02

Pos logit contributions
 cal+0.016
 swe+0.015
 alright+0.014
 lets+0.014
 calmed+0.014
Neg logit contributions
ve-0.014
iana-0.010
rie-0.010
makers-0.009
ula-0.009
Token troubled
Token ID17840
Feature activation+5.98e-02

Pos logit contributions
ve+0.190
rie+0.151
iana+0.147
makers+0.142
ged+0.139
Neg logit contributions
 cal-0.131
 swe-0.120
 alright-0.119
 lets-0.115
 ok-0.113
Token.
Token ID13
Feature activation-7.05e-03

Pos logit contributions
ve+0.092
rie+0.071
makers+0.068
iana+0.068
ged+0.067
Neg logit contributions
 cal-0.090
 swe-0.083
 lets-0.082
 alright-0.082
 calmed-0.080
Token She
Token ID1375
Feature activation-1.80e-02

Pos logit contributions
 cal+0.011
 alright+0.010
 swe+0.010
 cleans+0.010
 lets+0.010
Neg logit contributions
ve-0.010
iana-0.008
rie-0.008
makers-0.007
 ante-0.007
Token was
Token ID373
Feature activation+1.08e-02

Pos logit contributions
 cal+0.025
 swe+0.024
 alright+0.023
 ok+0.023
 lets+0.022
Neg logit contributions
ve-0.030
rie-0.024
iana-0.023
makers-0.022
ged-0.022
Token thinking
Token ID3612
Feature activation+1.58e-01

Pos logit contributions
ve+0.018
rie+0.014
iana+0.013
makers+0.013
ged+0.012
Neg logit contributions
 cal-0.016
 swe-0.015
 alright-0.015
 lets-0.014
 ok-0.014
Token about
Token ID546
Feature activation-1.75e-01

Pos logit contributions
ve+0.311
rie+0.260
iana+0.253
ged+0.246
ula+0.241
Neg logit contributions
 cal-0.165
 lets-0.142
 swe-0.141
 pic-0.138
 alright-0.137
Token how
Token ID703
Feature activation-1.32e-01

Pos logit contributions
 cal+0.292
 calmed+0.270
 nods+0.269
 nodded+0.258
 snowball+0.258
Neg logit contributions
ve-0.189
 er-0.146
iana-0.143
 for-0.139
 pupil-0.138
Token she
Token ID673
Feature activation-2.88e-03

Pos logit contributions
 cal+0.208
 snowball+0.199
 lets+0.197
 pic+0.193
 alright+0.192
Neg logit contributions
ve-0.177
makers-0.124
iana-0.119
rie-0.118
 pupil-0.114
Token wished
Token ID16555
Feature activation+8.10e-02

Pos logit contributions
 cal+0.005
 swe+0.004
 alright+0.004
 ok+0.004
 flour+0.004
Neg logit contributions
ve-0.004
rie-0.003
iana-0.003
ged-0.003
ley-0.003
Token her
Token ID607
Feature activation+4.48e-02

Pos logit contributions
ve+0.128
rie+0.100
iana+0.095
ged+0.091
makers+0.091
Neg logit contributions
 cal-0.121
 alright-0.114
 swe-0.113
 ok-0.110
 lets-0.107
Token life
Token ID1204
Feature activation-7.75e-02

Pos logit contributions
ve+0.067
rie+0.052
iana+0.051
 pupil+0.048
 spine+0.048
Neg logit contributions
 cal-0.069
 alright-0.062
 lets-0.061
 calmed-0.060
 swe-0.059