TOP ACTIVATIONS
MAX = 0.177

drain by cleaning it ." He ran inside and asked his
of Brave Adventure rs ". He felt excited and he walked
trust this ig loo ". He stepped inside carefully and lay
to his other toys . He was very happy . He
bear nodded in agreement . He said , " This is
over and over again . He liked to pretend that the
walk in the park . He ran and ran , chasing
Turn !" it said . He shook and was scared ,
around in the tub . He had so much fun !
hopped around the garden . He explored the flower beds ,
off down the street . He went faster and faster until
excited to explore it . He picked it up and shook
said , " 2016 ". He asked his parents , "
dough in the field . He wanted to choose the dough
and climbed over it . He tried to jump on the
trees . Rover smiled . He was having such a good
, banana , banana ". He said it again and again
toys around the room . He had fun watching them zoom
little fish sh ivered . He needed to find a way
lying in the grass . He decided to take the spear

MIN ACTIVATIONS
MIN = -0.291

to take a journey to outer space . He hopped into
what the rocket discovered in outer space . It was a
the stars and planets in outer space . So one day
special star that comes from outer space ," she explained .
Emma who loved learning about outer space . Everyday , she
izz ed off back into outer space . Earth was an
, and even went to outer space ! It was the
looked like it came from outer space . L
that they were going to outer space . They watched out
. Soon they were in outer space and having fun .
mouse never suffered from the rud eness of other animals again
getting very annoyed with his rud eness and complained that he
She takes the v ase from them and puts
punished for knocking over the v ase . Tim my felt
fold the knob into an orig ami figure . But no
they had never taken the v ase . They wished they
sailed for days , exploring unch arted waters until he came
he couldn 't touch the v ase . Tim my felt
brother accidentally knocked over the v ase and it broke into
and help me fix the v ase . And you have

INTERVAL 0.060 - 0.177
CONTAINS 17.940%

from the pond . Then he hopped away , leaving the
couldn 't . Her dream was gone . And so was
but a baby bird . She gently lifted the egg and
picture and they could see it much better . Tim my
fragile house . The bird looked at the house and saw

INTERVAL -0.057 - 0.060
CONTAINS 63.552%

the swings , the slide and the sees aw . They
realized how important it was to make sure his hiding place
see what was up there , so he crawled onto a
A man heard them and came to see what was
. I didn 't know that . Thank you , Lily

INTERVAL -0.174 - -0.057
CONTAINS 18.459%

asked Jill why she thought golf was not normal .
the water . One day , Tim saw a
. They thanked Lucy for inviting them and promised to play
, " V anish !" The toy car was gone .
and run . They saw a big tent with many colors

INTERVAL -0.291 - -0.174
CONTAINS 0.050%

it could fit in its j ogging pants , it was
the little boy had the y umm iest treat . He
to go back to his ig loo and ask his family
" Oh no , the v ase is broken ! Can
could help get the car unst uck . The
Token drain
Token ID14782
Feature activation-2.10e-02

Pos logit contributions
igsaw+0.210
opl+0.172
umm+0.169
ummy+0.167
Whe+0.160
Neg logit contributions
 could-0.217
 didn-0.215
 decided-0.215
 tried-0.213
 remembered-0.212
Token by
Token ID416
Feature activation-1.40e-02

Pos logit contributions
igsaw+0.047
 difference+0.040
ummy+0.039
opl+0.039
umm+0.038
Neg logit contributions
 remembered-0.033
 tried-0.033
 decided-0.033
 could-0.032
 didn-0.032
Token cleaning
Token ID12724
Feature activation-5.82e-02

Pos logit contributions
igsaw+0.030
 difference+0.025
ummy+0.025
umm+0.024
opl+0.024
Neg logit contributions
 remembered-0.024
 tried-0.024
 decided-0.023
 didn-0.023
 could-0.023
Token it
Token ID340
Feature activation+6.23e-02

Pos logit contributions
igsaw+0.120
 difference+0.099
opl+0.099
umm+0.097
ummy+0.096
Neg logit contributions
 could-0.102
 tried-0.101
 remembered-0.101
 decided-0.101
 didn-0.100
Token."
Token ID526
Feature activation-1.58e-02

Pos logit contributions
 tried+0.096
 could+0.095
 remembered+0.095
 decided+0.094
 didn+0.093
Neg logit contributions
igsaw-0.143
 difference-0.121
ummy-0.120
opl-0.120
umm-0.119
Token He
Token ID679
Feature activation+1.77e-01

Pos logit contributions
igsaw+0.032
 difference+0.027
ummy+0.026
opl+0.026
umm+0.025
Neg logit contributions
 tried-0.029
 remembered-0.029
 decided-0.029
 could-0.028
 didn-0.028
Token ran
Token ID4966
Feature activation+4.99e-03

Pos logit contributions
'd+0.172
 remembered+0.170
 stumbled+0.165
 could+0.165
 managed+0.159
Neg logit contributions
igsaw-0.451
 types-0.400
 difference-0.400
 art-0.393
 batch-0.388
Token inside
Token ID2641
Feature activation+5.20e-03

Pos logit contributions
 remembered+0.009
'd+0.009
 could+0.009
 didn+0.009
 decided+0.009
Neg logit contributions
igsaw-0.010
 difference-0.008
ummy-0.008
opl-0.008
 art-0.008
Token and
Token ID290
Feature activation+2.52e-02

Pos logit contributions
 tried+0.007
 remembered+0.007
 decided+0.007
 could+0.007
 didn+0.007
Neg logit contributions
igsaw-0.013
 difference-0.011
opl-0.011
ummy-0.011
umm-0.011
Token asked
Token ID1965
Feature activation-4.08e-03

Pos logit contributions
'd+0.025
 remembered+0.021
 wouldn+0.021
 decided+0.020
 could+0.020
Neg logit contributions
igsaw-0.066
 art-0.058
 new-0.058
 types-0.057
 batch-0.057
Token his
Token ID465
Feature activation-2.54e-02

Pos logit contributions
igsaw+0.010
opl+0.009
 difference+0.008
umm+0.008
ummy+0.008
Neg logit contributions
 remembered-0.006
 tried-0.006
 could-0.006
 decided-0.006
 didn-0.006
Token of
Token ID286
Feature activation-7.14e-02

Pos logit contributions
igsaw+0.153
 difference+0.128
opl+0.126
ummy+0.125
umm+0.125
Neg logit contributions
 could-0.136
 decided-0.136
 tried-0.135
 remembered-0.134
 didn-0.133
Token Brave
Token ID22813
Feature activation-2.39e-02

Pos logit contributions
igsaw+0.133
umm+0.114
ummy+0.111
 difference+0.108
opl+0.105
Neg logit contributions
 tried-0.141
 decided-0.139
 could-0.138
 didn-0.137
 remembered-0.136
Token Adventure
Token ID9553
Feature activation-8.33e-02

Pos logit contributions
igsaw+0.045
 difference+0.037
ummy+0.037
opl+0.036
umm+0.035
Neg logit contributions
 remembered-0.046
 tried-0.046
 decided-0.045
 could-0.045
 didn-0.045
Tokenrs
Token ID3808
Feature activation+6.54e-03

Pos logit contributions
igsaw+0.149
 difference+0.124
umm+0.120
opl+0.120
ummy+0.118
Neg logit contributions
 could-0.170
 tried-0.166
 decided-0.166
'd-0.165
 remembered-0.165
Token".
Token ID1911
Feature activation-9.04e-03

Pos logit contributions
 decided+0.012
 remembered+0.012
 could+0.012
 tried+0.012
 didn+0.011
Neg logit contributions
igsaw-0.013
 difference-0.011
opl-0.011
umm-0.011
ummy-0.011
Token He
Token ID679
Feature activation+1.77e-01

Pos logit contributions
igsaw+0.017
 difference+0.014
ummy+0.014
opl+0.013
umm+0.013
Neg logit contributions
 remembered-0.017
 tried-0.017
 decided-0.017
 could-0.017
 didn-0.017
Token felt
Token ID2936
Feature activation+5.97e-02

Pos logit contributions
 remembered+0.150
 managed+0.147
 tried+0.147
 decided+0.140
 wouldn+0.140
Neg logit contributions
igsaw-0.480
 difference-0.432
 types-0.429
ummy-0.421
opl-0.419
Token excited
Token ID6568
Feature activation-6.53e-02

Pos logit contributions
 remembered+0.090
 tried+0.088
 decided+0.086
 could+0.086
 managed+0.086
Neg logit contributions
igsaw-0.135
 difference-0.118
ummy-0.114
opl-0.113
 types-0.110
Token and
Token ID290
Feature activation+3.26e-02

Pos logit contributions
igsaw+0.161
 difference+0.140
ummy+0.139
opl+0.136
umm+0.134
Neg logit contributions
 remembered-0.088
 tried-0.087
 decided-0.085
 could-0.084
 managed-0.084
Token he
Token ID339
Feature activation+1.40e-01

Pos logit contributions
 remembered+0.033
'd+0.032
 managed+0.031
 tried+0.031
 wouldn+0.031
Neg logit contributions
igsaw-0.081
 new-0.073
 difference-0.072
 types-0.071
 art-0.071
Token walked
Token ID6807
Feature activation-1.07e-02

Pos logit contributions
pped+0.106
 remembered+0.103
 unable+0.102
 accidentally+0.101
 managed+0.100
Neg logit contributions
igsaw-0.344
 types-0.325
 new-0.318
 bou-0.314
 art-0.310
Token trust
Token ID3774
Feature activation-2.84e-02

Pos logit contributions
 remembered+0.156
 managed+0.153
 decided+0.149
 tried+0.149
 could+0.148
Neg logit contributions
igsaw-0.271
 difference-0.258
 birthday-0.233
 types-0.233
ummy-0.230
Token this
Token ID428
Feature activation-9.61e-02

Pos logit contributions
igsaw+0.059
 difference+0.049
ummy+0.049
opl+0.048
umm+0.047
Neg logit contributions
 remembered-0.049
 tried-0.049
 decided-0.048
 could-0.048
 didn-0.048
Token ig
Token ID45329
Feature activation-1.61e-01

Pos logit contributions
igsaw+0.191
umm+0.156
ummy+0.155
opl+0.154
 difference+0.150
Neg logit contributions
 tried-0.175
 could-0.174
 remembered-0.174
 didn-0.173
'd-0.172
Tokenloo
Token ID29680
Feature activation+4.01e-04

Pos logit contributions
igsaw+0.210
 difference+0.156
ummy+0.143
opl+0.139
umm+0.136
Neg logit contributions
 could-0.403
 remembered-0.398
 tried-0.397
 decided-0.396
 didn-0.396
Token".
Token ID1911
Feature activation-6.37e-03

Pos logit contributions
 remembered+0.001
 tried+0.001
 decided+0.001
 could+0.001
 didn+0.001
Neg logit contributions
igsaw-0.001
 difference-0.001
ummy-0.001
opl-0.001
umm-0.001
Token He
Token ID679
Feature activation+1.76e-01

Pos logit contributions
igsaw+0.012
 difference+0.010
ummy+0.010
opl+0.010
umm+0.010
Neg logit contributions
 remembered-0.012
 tried-0.012
 decided-0.012
 could-0.011
 didn-0.011
Token stepped
Token ID10764
Feature activation-4.14e-02

Pos logit contributions
 remembered+0.167
 managed+0.160
 decided+0.159
'd+0.159
 tried+0.158
Neg logit contributions
igsaw-0.458
 difference-0.414
 types-0.408
ummy-0.394
 new-0.393
Token inside
Token ID2641
Feature activation+7.34e-03

Pos logit contributions
igsaw+0.082
ummy+0.070
 difference+0.067
ay+0.065
 bou+0.065
Neg logit contributions
 remembered-0.074
 didn-0.072
 tried-0.071
'd-0.070
 could-0.070
Token carefully
Token ID7773
Feature activation+3.64e-02

Pos logit contributions
 remembered+0.010
 tried+0.009
 decided+0.009
 managed+0.009
 could+0.009
Neg logit contributions
igsaw-0.018
 difference-0.016
ummy-0.016
opl-0.015
 types-0.015
Token and
Token ID290
Feature activation+2.65e-02

Pos logit contributions
 remembered+0.048
 tried+0.047
 decided+0.047
 could+0.046
 didn+0.046
Neg logit contributions
igsaw-0.090
 difference-0.078
ummy-0.077
opl-0.076
 types-0.074
Token lay
Token ID3830
Feature activation-7.58e-02

Pos logit contributions
'd+0.027
 remembered+0.025
 decided+0.024
 could+0.024
 managed+0.023
Neg logit contributions
igsaw-0.067
 new-0.059
 difference-0.058
 types-0.057
 art-0.057
Token to
Token ID284
Feature activation+2.45e-02

Pos logit contributions
igsaw+0.175
 difference+0.153
opl+0.153
umm+0.149
ummy+0.147
Neg logit contributions
 tried-0.095
 could-0.093
 remembered-0.090
 decided-0.089
 didn-0.089
Token his
Token ID465
Feature activation-3.68e-02

Pos logit contributions
 remembered+0.045
 tried+0.045
 decided+0.044
 could+0.044
 didn+0.044
Neg logit contributions
igsaw-0.049
 difference-0.041
ummy-0.040
opl-0.039
umm-0.038
Token other
Token ID584
Feature activation-6.09e-02

Pos logit contributions
igsaw+0.080
opl+0.074
Whe+0.070
umm+0.069
 festivals+0.068
Neg logit contributions
 tried-0.056
 remembered-0.055
 could-0.054
 dropped-0.053
 had-0.052
Token toys
Token ID14958
Feature activation+2.37e-02

Pos logit contributions
igsaw+0.138
opl+0.118
umm+0.115
 difference+0.111
ummy+0.110
Neg logit contributions
 tried-0.099
 remembered-0.095
 didn-0.094
 could-0.094
 decided-0.093
Token.
Token ID13
Feature activation+6.18e-03

Pos logit contributions
 remembered+0.036
 tried+0.035
 managed+0.034
 decided+0.034
 didn+0.034
Neg logit contributions
igsaw-0.052
ummy-0.045
 difference-0.045
opl-0.043
 types-0.043
Token He
Token ID679
Feature activation+1.76e-01

Pos logit contributions
 could+0.010
 tried+0.010
 decided+0.009
 didn+0.009
 accidentally+0.009
Neg logit contributions
igsaw-0.014
 difference-0.012
umm-0.012
opl-0.012
ummy-0.012
Token was
Token ID373
Feature activation+8.49e-02

Pos logit contributions
 remembered+0.214
 tried+0.209
 decided+0.208
 managed+0.205
 could+0.203
Neg logit contributions
igsaw-0.439
 difference-0.385
ummy-0.377
opl-0.375
 types-0.373
Token very
Token ID845
Feature activation+5.56e-02

Pos logit contributions
 remembered+0.110
 tried+0.109
 decided+0.107
 could+0.105
 didn+0.104
Neg logit contributions
igsaw-0.219
 difference-0.191
ummy-0.188
opl-0.185
umm-0.184
Token happy
Token ID3772
Feature activation-4.65e-02

Pos logit contributions
 remembered+0.094
 tried+0.094
 decided+0.092
 could+0.091
 didn+0.090
Neg logit contributions
igsaw-0.124
 difference-0.106
opl-0.103
umm-0.102
ummy-0.102
Token.
Token ID13
Feature activation+4.54e-03

Pos logit contributions
igsaw+0.104
ummy+0.091
 difference+0.090
opl+0.086
 art+0.085
Neg logit contributions
 remembered-0.072
 tried-0.070
 managed-0.069
'd-0.069
 decided-0.069
Token He
Token ID679
Feature activation+1.71e-01

Pos logit contributions
 could+0.007
 tried+0.007
 decided+0.007
 didn+0.007
 remembered+0.007
Neg logit contributions
igsaw-0.010
 difference-0.009
umm-0.009
 bou-0.009
 art-0.009
Token bear
Token ID6842
Feature activation+9.41e-02

Pos logit contributions
igsaw+0.287
opl+0.250
ummy+0.242
umm+0.242
 difference+0.240
Neg logit contributions
 tried-0.148
 remembered-0.148
 could-0.138
 decided-0.138
 didn-0.138
Token nodded
Token ID14464
Feature activation+3.57e-02

Pos logit contributions
 unable+0.100
'd+0.099
pped+0.097
 wouldn+0.097
 managed+0.096
Neg logit contributions
igsaw-0.221
 types-0.200
ummy-0.197
 difference-0.194
 art-0.192
Token in
Token ID287
Feature activation-8.89e-03

Pos logit contributions
'd+0.066
 managed+0.066
 wouldn+0.064
 forgot+0.063
 stumbled+0.063
Neg logit contributions
 new-0.054
igsaw-0.052
 bou-0.049
 birthday-0.049
ummy-0.048
Token agreement
Token ID4381
Feature activation-6.83e-02

Pos logit contributions
igsaw+0.014
ummy+0.011
 difference+0.011
opl+0.011
umm+0.011
Neg logit contributions
 tried-0.020
 remembered-0.020
 decided-0.020
 didn-0.020
 managed-0.019
Token.
Token ID13
Feature activation+3.31e-03

Pos logit contributions
igsaw+0.125
ummy+0.115
 birthday+0.106
 art+0.105
 new+0.104
Neg logit contributions
'd-0.126
 remembered-0.125
 managed-0.124
 accidentally-0.119
 wouldn-0.119
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 could+0.004
 tried+0.004
 decided+0.004
 didn+0.004
 remembered+0.004
Neg logit contributions
igsaw-0.009
 difference-0.008
opl-0.008
 bou-0.007
umm-0.007
Token said
Token ID531
Feature activation-6.83e-03

Pos logit contributions
 unable+0.199
pped+0.181
 stumbled+0.179
 wouldn+0.179
'd+0.177
Neg logit contributions
igsaw-0.372
 types-0.364
 new-0.351
 batch-0.349
 bou-0.348
Token,
Token ID11
Feature activation+2.74e-02

Pos logit contributions
igsaw+0.016
 difference+0.013
opl+0.013
ummy+0.013
umm+0.013
Neg logit contributions
 remembered-0.011
 tried-0.011
 decided-0.011
 could-0.011
 didn-0.010
Token "
Token ID366
Feature activation+1.85e-02

Pos logit contributions
 remembered+0.058
 managed+0.057
 tried+0.055
 decided+0.055
'd+0.055
Neg logit contributions
igsaw-0.041
ummy-0.035
 new-0.033
 difference-0.033
 birthday-0.032
TokenThis
Token ID1212
Feature activation-5.28e-02

Pos logit contributions
 remembered+0.029
 wouldn+0.028
 tried+0.028
 didn+0.028
 accidentally+0.027
Neg logit contributions
igsaw-0.040
umm-0.035
opl-0.034
 types-0.034
 difference-0.034
Token is
Token ID318
Feature activation+7.21e-02

Pos logit contributions
igsaw+0.098
 difference+0.081
ummy+0.079
opl+0.078
umm+0.076
Neg logit contributions
 remembered-0.104
 tried-0.103
 decided-0.102
 could-0.101
 didn-0.101
Token over
Token ID625
Feature activation-2.80e-02

Pos logit contributions
 remembered+0.030
 decided+0.029
 tried+0.029
 managed+0.029
 could+0.029
Neg logit contributions
igsaw-0.044
 difference-0.040
opl-0.038
ummy-0.038
 types-0.037
Token and
Token ID290
Feature activation+2.48e-02

Pos logit contributions
igsaw+0.061
 difference+0.051
ummy+0.051
opl+0.050
umm+0.049
Neg logit contributions
 remembered-0.046
 tried-0.046
 decided-0.045
 could-0.045
 didn-0.045
Token over
Token ID625
Feature activation-3.00e-02

Pos logit contributions
 remembered+0.046
 tried+0.045
 decided+0.045
 could+0.045
 didn+0.044
Neg logit contributions
igsaw-0.048
 difference-0.040
ummy-0.039
opl-0.039
umm-0.038
Token again
Token ID757
Feature activation+3.97e-03

Pos logit contributions
igsaw+0.068
 difference+0.058
ummy+0.057
opl+0.057
umm+0.055
Neg logit contributions
 remembered-0.047
 tried-0.047
 decided-0.047
 could-0.046
 didn-0.046
Token.
Token ID13
Feature activation+7.50e-03

Pos logit contributions
 remembered+0.006
 tried+0.006
 decided+0.006
 didn+0.006
 managed+0.006
Neg logit contributions
igsaw-0.009
 difference-0.008
ummy-0.008
opl-0.007
umm-0.007
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 could+0.010
 tried+0.010
 had+0.010
 didn+0.009
 stumbled+0.009
Neg logit contributions
igsaw-0.019
 bou-0.017
umm-0.017
 difference-0.016
ummy-0.016
Token liked
Token ID8288
Feature activation+5.81e-03

Pos logit contributions
 unable+0.135
 managed+0.129
pped+0.129
 remembered+0.124
 tried+0.124
Neg logit contributions
igsaw-0.413
 types-0.410
 new-0.403
 difference-0.399
 goals-0.393
Token to
Token ID284
Feature activation+3.10e-02

Pos logit contributions
 remembered+0.010
 tried+0.010
 decided+0.010
 managed+0.009
 could+0.009
Neg logit contributions
igsaw-0.012
 difference-0.010
ummy-0.010
opl-0.010
 types-0.010
Token pretend
Token ID16614
Feature activation-5.36e-03

Pos logit contributions
 remembered+0.060
 could+0.059
 managed+0.058
 tried+0.058
 decided+0.058
Neg logit contributions
igsaw-0.055
 difference-0.047
opl-0.045
 art-0.044
 types-0.043
Token that
Token ID326
Feature activation+8.54e-03

Pos logit contributions
igsaw+0.014
 difference+0.012
ummy+0.012
opl+0.012
umm+0.012
Neg logit contributions
 remembered-0.006
 tried-0.006
 decided-0.006
 could-0.006
 didn-0.006
Token the
Token ID262
Feature activation-1.36e-01

Pos logit contributions
 could+0.014
 decided+0.013
 tried+0.013
 didn+0.013
 had+0.013
Neg logit contributions
igsaw-0.021
ummy-0.018
umm-0.017
opl-0.017
 difference-0.017
Token walk
Token ID2513
Feature activation-5.34e-02

Pos logit contributions
igsaw+0.119
 difference+0.098
ummy+0.094
opl+0.092
umm+0.089
Neg logit contributions
 remembered-0.146
 tried-0.145
 decided-0.144
 could-0.143
 managed-0.142
Token in
Token ID287
Feature activation-1.58e-02

Pos logit contributions
igsaw+0.116
ummy+0.101
 difference+0.100
opl+0.098
 types+0.097
Neg logit contributions
 remembered-0.081
 managed-0.078
 tried-0.078
 didn-0.076
 wouldn-0.075
Token the
Token ID262
Feature activation-1.40e-01

Pos logit contributions
igsaw+0.034
ummy+0.029
umm+0.029
 difference+0.029
opl+0.028
Neg logit contributions
 tried-0.027
 decided-0.026
 remembered-0.026
 could-0.026
 didn-0.025
Token park
Token ID3952
Feature activation+4.78e-03

Pos logit contributions
igsaw+0.270
ummy+0.230
umm+0.222
 types+0.213
 batch+0.206
Neg logit contributions
 decided-0.283
 tried-0.273
 remembered-0.271
 had-0.266
 got-0.264
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
 remembered+0.007
 tried+0.007
 decided+0.007
 could+0.007
 didn+0.007
Neg logit contributions
igsaw-0.011
 difference-0.010
ummy-0.010
opl-0.010
umm-0.009
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 could+0.027
 tried+0.027
 decided+0.027
 remembered+0.027
 didn+0.027
Neg logit contributions
igsaw-0.034
 difference-0.030
ummy-0.028
opl-0.028
umm-0.028
Token ran
Token ID4966
Feature activation+5.93e-03

Pos logit contributions
 remembered+0.132
 managed+0.132
 tried+0.125
 decided+0.122
 wouldn+0.120
Neg logit contributions
igsaw-0.471
 difference-0.434
 types-0.432
opl-0.428
 new-0.424
Token and
Token ID290
Feature activation+2.54e-02

Pos logit contributions
 remembered+0.011
 tried+0.011
 managed+0.011
 could+0.011
'd+0.011
Neg logit contributions
igsaw-0.011
ummy-0.009
 difference-0.009
 new-0.009
opl-0.009
Token ran
Token ID4966
Feature activation+2.99e-03

Pos logit contributions
 remembered+0.022
'd+0.022
 managed+0.020
 wouldn+0.020
 didn+0.020
Neg logit contributions
igsaw-0.063
 new-0.058
 bou-0.057
 festivals-0.057
 types-0.057
Token,
Token ID11
Feature activation+1.42e-02

Pos logit contributions
 remembered+0.006
 managed+0.005
'd+0.005
 didn+0.005
 tried+0.005
Neg logit contributions
igsaw-0.006
 new-0.005
ummy-0.005
 difference-0.005
opl-0.005
Token chasing
Token ID20023
Feature activation-6.81e-02

Pos logit contributions
 remembered+0.022
'd+0.021
 decided+0.021
 managed+0.020
 didn+0.020
Neg logit contributions
igsaw-0.027
 new-0.025
 biggest-0.024
 bou-0.024
 difference-0.023
TokenTurn
Token ID17278
Feature activation-4.51e-02

Pos logit contributions
 wouldn+0.029
 remembered+0.028
 managed+0.027
 could+0.027
 didn+0.027
Neg logit contributions
igsaw-0.035
Whe-0.031
 difference-0.030
 festivals-0.029
opl-0.029
Token!"
Token ID2474
Feature activation-2.80e-02

Pos logit contributions
igsaw+0.099
 difference+0.083
umm+0.083
opl+0.081
ummy+0.080
Neg logit contributions
 could-0.076
 tried-0.076
 decided-0.075
 remembered-0.074
 didn-0.074
Token it
Token ID340
Feature activation+9.23e-02

Pos logit contributions
igsaw+0.061
 types+0.052
 difference+0.051
ummy+0.051
 art+0.050
Neg logit contributions
 remembered-0.043
'd-0.042
 managed-0.040
 wouldn-0.039
 decided-0.038
Token said
Token ID531
Feature activation-1.23e-02

Pos logit contributions
 managed+0.132
'd+0.129
 unable+0.126
 wouldn+0.122
 remembered+0.121
Neg logit contributions
igsaw-0.170
 new-0.170
 types-0.158
 difference-0.155
 bou-0.152
Token.
Token ID13
Feature activation+4.95e-03

Pos logit contributions
igsaw+0.027
 difference+0.023
ummy+0.023
opl+0.023
umm+0.022
Neg logit contributions
 remembered-0.020
 tried-0.020
 decided-0.019
 could-0.019
 didn-0.019
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 remembered+0.010
 decided+0.009
 managed+0.009
'd+0.009
 tried+0.009
Neg logit contributions
igsaw-0.009
ummy-0.008
 difference-0.007
 types-0.007
opl-0.007
Token shook
Token ID14682
Feature activation+2.38e-02

Pos logit contributions
 unable+0.217
pped+0.201
 managed+0.183
Please+0.173
 accidentally+0.171
Neg logit contributions
 new-0.341
 types-0.303
 old-0.296
 Grand-0.295
igsaw-0.290
Token and
Token ID290
Feature activation+3.15e-02

Pos logit contributions
'd+0.045
 managed+0.044
 remembered+0.043
 could+0.042
 accidentally+0.041
Neg logit contributions
igsaw-0.042
 new-0.037
 difference-0.037
opl-0.035
ummy-0.035
Token was
Token ID373
Feature activation+7.99e-02

Pos logit contributions
'd+0.039
 unable+0.035
pped+0.033
 hadn+0.032
 managed+0.032
Neg logit contributions
 new-0.062
 bou-0.056
 Grand-0.054
 Of-0.053
 types-0.053
Token scared
Token ID12008
Feature activation-4.08e-02

Pos logit contributions
'd+0.108
 decided+0.101
 remembered+0.100
 could+0.100
 managed+0.099
Neg logit contributions
igsaw-0.174
 new-0.161
 difference-0.158
opl-0.157
 biggest-0.152
Token,
Token ID11
Feature activation+1.78e-02

Pos logit contributions
igsaw+0.088
ummy+0.077
 difference+0.076
opl+0.072
umm+0.071
Neg logit contributions
 remembered-0.067
 tried-0.065
 managed-0.065
'd-0.064
 decided-0.064
Token around
Token ID1088
Feature activation+9.33e-03

Pos logit contributions
igsaw+0.029
 difference+0.025
ummy+0.025
 types+0.024
 new+0.024
Neg logit contributions
 remembered-0.021
 tried-0.020
 managed-0.020
 didn-0.020
 decided-0.020
Token in
Token ID287
Feature activation-1.34e-02

Pos logit contributions
 remembered+0.013
 tried+0.013
 decided+0.013
 could+0.013
 didn+0.013
Neg logit contributions
igsaw-0.022
 difference-0.019
ummy-0.019
opl-0.019
umm-0.019
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
igsaw+0.029
umm+0.025
ummy+0.025
opl+0.024
 difference+0.024
Neg logit contributions
 tried-0.022
 decided-0.022
 could-0.021
 remembered-0.021
 didn-0.021
Token tub
Token ID12202
Feature activation-3.73e-02

Pos logit contributions
igsaw+0.291
opl+0.236
ummy+0.229
umm+0.227
 types+0.221
Neg logit contributions
 decided-0.270
 tried-0.265
 could-0.260
 had-0.255
 remembered-0.255
Token.
Token ID13
Feature activation+5.88e-03

Pos logit contributions
igsaw+0.087
 difference+0.074
ummy+0.072
opl+0.072
umm+0.071
Neg logit contributions
 remembered-0.056
 tried-0.055
 decided-0.055
 could-0.055
 didn-0.054
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 could+0.010
 tried+0.010
 decided+0.010
 didn+0.010
 accidentally+0.010
Neg logit contributions
igsaw-0.012
 difference-0.010
ummy-0.010
opl-0.010
umm-0.010
Token had
Token ID550
Feature activation+1.11e-01

Pos logit contributions
 remembered+0.137
 managed+0.134
 decided+0.128
 tried+0.128
 accidentally+0.127
Neg logit contributions
igsaw-0.438
 types-0.420
 difference-0.416
 new-0.409
 festivals-0.400
Token so
Token ID523
Feature activation+2.77e-02

Pos logit contributions
 remembered+0.165
 wouldn+0.162
 didn+0.161
 tried+0.161
 decided+0.157
Neg logit contributions
igsaw-0.233
 difference-0.203
 new-0.200
 types-0.199
 biggest-0.198
Token much
Token ID881
Feature activation-2.40e-02

Pos logit contributions
 remembered+0.050
 tried+0.050
 decided+0.049
 could+0.049
 didn+0.048
Neg logit contributions
igsaw-0.057
 difference-0.048
ummy-0.047
umm-0.046
opl-0.046
Token fun
Token ID1257
Feature activation-2.74e-02

Pos logit contributions
igsaw+0.038
 difference+0.029
opl+0.029
umm+0.028
ummy+0.028
Neg logit contributions
 tried-0.056
 remembered-0.055
 decided-0.055
 could-0.054
 didn-0.053
Token!
Token ID0
Feature activation+7.30e-03

Pos logit contributions
igsaw+0.055
ummy+0.047
 difference+0.046
opl+0.046
 types+0.045
Neg logit contributions
 remembered-0.048
 managed-0.046
 decided-0.046
'd-0.046
 tried-0.046
Token hopped
Token ID45230
Feature activation-1.64e-02

Pos logit contributions
 remembered+0.038
 managed+0.037
 tried+0.037
 decided+0.036
 wouldn+0.036
Neg logit contributions
igsaw-0.078
 difference-0.072
ummy-0.069
opl-0.068
 types-0.068
Token around
Token ID1088
Feature activation+4.01e-03

Pos logit contributions
igsaw+0.035
ummy+0.029
 difference+0.029
opl+0.028
ay+0.028
Neg logit contributions
 remembered-0.027
 tried-0.026
 managed-0.026
 didn-0.026
 could-0.025
Token the
Token ID262
Feature activation-1.40e-01

Pos logit contributions
 remembered+0.006
 tried+0.006
 decided+0.006
 could+0.006
 managed+0.006
Neg logit contributions
igsaw-0.009
 difference-0.008
ummy-0.008
opl-0.007
 types-0.007
Token garden
Token ID11376
Feature activation+2.44e-02

Pos logit contributions
igsaw+0.258
opl+0.208
ummy+0.202
umm+0.199
 difference+0.195
Neg logit contributions
 decided-0.281
 remembered-0.280
 tried-0.279
 could-0.273
 didn-0.269
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 remembered+0.035
 tried+0.034
 decided+0.034
 could+0.034
 didn+0.033
Neg logit contributions
igsaw-0.058
 difference-0.050
ummy-0.049
opl-0.049
umm-0.048
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.021
 decided+0.021
 remembered+0.021
 could+0.021
 didn+0.021
Neg logit contributions
igsaw-0.024
 difference-0.021
ummy-0.020
opl-0.020
umm-0.020
Token explored
Token ID18782
Feature activation-1.32e-02

Pos logit contributions
 remembered+0.132
 managed+0.125
 wouldn+0.124
 tried+0.121
'd+0.121
Neg logit contributions
igsaw-0.442
 types-0.413
 new-0.407
 difference-0.403
 bou-0.402
Token the
Token ID262
Feature activation-1.35e-01

Pos logit contributions
igsaw+0.030
umm+0.025
opl+0.025
ummy+0.025
 difference+0.025
Neg logit contributions
 tried-0.021
 decided-0.021
 remembered-0.021
 could-0.021
 didn-0.021
Token flower
Token ID15061
Feature activation-8.86e-03

Pos logit contributions
igsaw+0.231
opl+0.183
ummy+0.175
umm+0.166
ay+0.164
Neg logit contributions
 decided-0.293
 remembered-0.287
 tried-0.286
 could-0.285
'd-0.283
Token beds
Token ID20237
Feature activation-4.41e-02

Pos logit contributions
igsaw+0.022
 difference+0.019
opl+0.019
ummy+0.018
umm+0.018
Neg logit contributions
 could-0.012
 tried-0.012
 remembered-0.012
 decided-0.012
 didn-0.011
Token,
Token ID11
Feature activation+1.79e-02

Pos logit contributions
igsaw+0.099
ummy+0.088
 difference+0.085
opl+0.082
umm+0.082
Neg logit contributions
 remembered-0.066
 tried-0.065
 decided-0.063
 didn-0.063
 managed-0.063
Token off
Token ID572
Feature activation-4.41e-03

Pos logit contributions
igsaw+0.071
 difference+0.063
ummy+0.063
opl+0.061
umm+0.061
Neg logit contributions
 tried-0.034
 remembered-0.032
 could-0.032
 decided-0.032
 managed-0.032
Token down
Token ID866
Feature activation-2.10e-02

Pos logit contributions
igsaw+0.011
 difference+0.010
umm+0.010
ummy+0.010
 types+0.009
Neg logit contributions
 tried-0.006
 decided-0.006
 could-0.006
 remembered-0.006
 managed-0.006
Token the
Token ID262
Feature activation-1.50e-01

Pos logit contributions
igsaw+0.051
umm+0.049
 difference+0.048
ummy+0.048
opl+0.045
Neg logit contributions
 tried-0.029
 decided-0.026
 could-0.026
 managed-0.025
 had-0.025
Token street
Token ID4675
Feature activation-2.12e-02

Pos logit contributions
igsaw+0.273
ucks+0.247
ushes+0.246
ays+0.238
aps+0.234
Neg logit contributions
 had-0.291
 tried-0.283
 decided-0.270
 could-0.269
 never-0.268
Token.
Token ID13
Feature activation+1.98e-03

Pos logit contributions
igsaw+0.056
 difference+0.048
ummy+0.047
umm+0.047
opl+0.047
Neg logit contributions
 tried-0.026
 decided-0.026
 could-0.026
 remembered-0.025
 didn-0.025
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 could+0.003
 tried+0.003
 remembered+0.003
 decided+0.003
 didn+0.003
Neg logit contributions
igsaw-0.005
 difference-0.004
ummy-0.004
umm-0.004
 bou-0.004
Token went
Token ID1816
Feature activation+1.54e-02

Pos logit contributions
 remembered+0.143
 managed+0.133
 decided+0.133
 tried+0.133
'd+0.132
Neg logit contributions
igsaw-0.463
 types-0.424
 difference-0.420
opl-0.413
 new-0.408
Token faster
Token ID5443
Feature activation-6.36e-02

Pos logit contributions
 remembered+0.026
 tried+0.026
 decided+0.026
 could+0.026
 didn+0.025
Neg logit contributions
igsaw-0.033
 difference-0.028
ummy-0.027
opl-0.027
umm-0.027
Token and
Token ID290
Feature activation+2.47e-02

Pos logit contributions
igsaw+0.162
 difference+0.141
ummy+0.140
opl+0.139
 types+0.135
Neg logit contributions
 remembered-0.080
 tried-0.078
 decided-0.077
 could-0.077
 didn-0.076
Token faster
Token ID5443
Feature activation-6.48e-02

Pos logit contributions
 remembered+0.047
 tried+0.046
 decided+0.046
 could+0.045
 didn+0.045
Neg logit contributions
igsaw-0.046
 difference-0.039
ummy-0.037
opl-0.037
umm-0.036
Token until
Token ID1566
Feature activation+6.22e-02

Pos logit contributions
igsaw+0.142
 difference+0.121
opl+0.119
ummy+0.119
 types+0.115
Neg logit contributions
 remembered-0.103
 decided-0.100
 tried-0.100
'd-0.099
 could-0.099
Token excited
Token ID6568
Feature activation-6.76e-02

Pos logit contributions
 remembered+0.118
'd+0.117
 wouldn+0.115
 tried+0.115
 could+0.115
Neg logit contributions
igsaw-0.166
 difference-0.145
ummy-0.142
opl-0.141
 types-0.138
Token to
Token ID284
Feature activation+3.61e-02

Pos logit contributions
igsaw+0.147
ummy+0.135
 difference+0.129
 types+0.124
 new+0.122
Neg logit contributions
 remembered-0.107
 managed-0.105
'd-0.103
 tried-0.101
 could-0.100
Token explore
Token ID7301
Feature activation-1.17e-02

Pos logit contributions
 remembered+0.066
 wouldn+0.066
 unable+0.064
 hadn+0.064
 managed+0.063
Neg logit contributions
igsaw-0.066
 art-0.054
ay-0.053
ummy-0.053
 new-0.053
Token it
Token ID340
Feature activation+6.42e-02

Pos logit contributions
igsaw+0.029
 difference+0.025
ummy+0.024
opl+0.024
umm+0.024
Neg logit contributions
 remembered-0.016
 tried-0.016
 decided-0.016
 could-0.016
 didn-0.015
Token.
Token ID13
Feature activation+3.63e-03

Pos logit contributions
 tried+0.090
 remembered+0.090
 decided+0.089
 could+0.089
 didn+0.088
Neg logit contributions
igsaw-0.155
 difference-0.134
ummy-0.132
opl-0.131
umm-0.130
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.006
 remembered+0.006
 could+0.006
 decided+0.006
 didn+0.006
Neg logit contributions
igsaw-0.008
 difference-0.007
ummy-0.007
opl-0.007
umm-0.007
Token picked
Token ID6497
Feature activation-2.45e-02

Pos logit contributions
 unable+0.159
pped+0.145
pper+0.136
Please+0.135
 wouldn+0.133
Neg logit contributions
 types-0.385
igsaw-0.378
 new-0.370
 seats-0.363
 styles-0.358
Token it
Token ID340
Feature activation+8.11e-02

Pos logit contributions
igsaw+0.051
umm+0.043
 difference+0.043
opl+0.043
ummy+0.042
Neg logit contributions
 tried-0.046
 decided-0.044
 could-0.044
 remembered-0.043
 managed-0.043
Token up
Token ID510
Feature activation-2.33e-02

Pos logit contributions
 tried+0.166
 decided+0.165
 could+0.164
 didn+0.161
 got+0.158
Neg logit contributions
igsaw-0.152
opl-0.128
 difference-0.125
umm-0.124
 former-0.111
Token and
Token ID290
Feature activation+3.42e-02

Pos logit contributions
igsaw+0.053
ummy+0.047
 new+0.045
 difference+0.045
 art+0.044
Neg logit contributions
'd-0.033
 remembered-0.033
 tried-0.032
 could-0.031
 managed-0.031
Token shook
Token ID14682
Feature activation+2.06e-02

Pos logit contributions
'd+0.035
 unable+0.033
 remembered+0.031
 hadn+0.030
ppy+0.029
Neg logit contributions
igsaw-0.080
 types-0.075
 new-0.075
 art-0.074
 difference-0.072
Token said
Token ID531
Feature activation-7.71e-03

Pos logit contributions
 remembered+0.148
 tried+0.139
 decided+0.138
 managed+0.138
 accidentally+0.135
Neg logit contributions
igsaw-0.398
 difference-0.361
 types-0.353
opl-0.353
ummy-0.344
Token,
Token ID11
Feature activation+2.69e-02

Pos logit contributions
igsaw+0.018
 difference+0.015
opl+0.015
ummy+0.015
umm+0.015
Neg logit contributions
 remembered-0.012
 tried-0.012
 could-0.012
 decided-0.012
 didn-0.012
Token "
Token ID366
Feature activation+1.82e-02

Pos logit contributions
 remembered+0.054
 managed+0.052
 tried+0.052
 decided+0.052
'd+0.051
Neg logit contributions
igsaw-0.046
 difference-0.038
ummy-0.038
opl-0.036
 new-0.035
Token2016
Token ID5304
Feature activation-1.63e-02

Pos logit contributions
 remembered+0.028
 tried+0.028
 decided+0.027
 could+0.027
 didn+0.027
Neg logit contributions
igsaw-0.041
 difference-0.035
ummy-0.035
opl-0.035
umm-0.034
Token".
Token ID1911
Feature activation-2.22e-02

Pos logit contributions
igsaw+0.033
 difference+0.028
ummy+0.027
opl+0.027
umm+0.026
Neg logit contributions
 remembered-0.029
 tried-0.029
 decided-0.028
 could-0.028
 didn-0.028
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
igsaw+0.044
 difference+0.037
ummy+0.037
opl+0.036
umm+0.036
Neg logit contributions
 tried-0.041
 could-0.040
 remembered-0.040
 decided-0.040
 didn-0.040
Token asked
Token ID1965
Feature activation-3.27e-03

Pos logit contributions
 managed+0.152
 unable+0.147
 remembered+0.146
'd+0.145
 wouldn+0.145
Neg logit contributions
igsaw-0.423
 types-0.388
 new-0.379
 difference-0.378
ummy-0.378
Token his
Token ID465
Feature activation-2.19e-02

Pos logit contributions
igsaw+0.007
 difference+0.006
ummy+0.006
opl+0.006
umm+0.006
Neg logit contributions
 remembered-0.005
 tried-0.005
 decided-0.005
 could-0.005
 didn-0.005
Token parents
Token ID3397
Feature activation+6.50e-02

Pos logit contributions
igsaw+0.050
opl+0.043
umm+0.041
ummy+0.040
 difference+0.039
Neg logit contributions
 remembered-0.038
 tried-0.037
 decided-0.036
 could-0.036
 didn-0.036
Token,
Token ID11
Feature activation+1.42e-02

Pos logit contributions
 remembered+0.114
 tried+0.113
 decided+0.111
 managed+0.111
 didn+0.110
Neg logit contributions
igsaw-0.131
 difference-0.112
ummy-0.110
opl-0.107
 types-0.104
Token "
Token ID366
Feature activation+1.31e-02

Pos logit contributions
 remembered+0.026
 tried+0.026
 managed+0.026
 decided+0.026
'd+0.025
Neg logit contributions
igsaw-0.026
 difference-0.022
ummy-0.021
 new-0.020
opl-0.020
Token dough
Token ID15756
Feature activation-1.06e-02

Pos logit contributions
igsaw+0.145
 difference+0.121
ummy+0.118
 types+0.114
opl+0.112
Neg logit contributions
 remembered-0.195
 tried-0.192
 managed-0.190
 decided-0.188
 could-0.188
Token in
Token ID287
Feature activation-2.23e-02

Pos logit contributions
igsaw+0.021
ummy+0.019
 art+0.018
 difference+0.018
 types+0.018
Neg logit contributions
 remembered-0.018
 tried-0.017
 managed-0.017
 didn-0.017
'd-0.017
Token the
Token ID262
Feature activation-1.46e-01

Pos logit contributions
igsaw+0.045
 difference+0.038
ummy+0.037
opl+0.036
umm+0.036
Neg logit contributions
 remembered-0.040
 tried-0.040
 decided-0.039
 could-0.039
 didn-0.039
Token field
Token ID2214
Feature activation-1.83e-02

Pos logit contributions
igsaw+0.231
 difference+0.183
ummy+0.179
opl+0.176
umm+0.170
Neg logit contributions
 remembered-0.327
 tried-0.324
 decided-0.321
 could-0.319
 didn-0.318
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
igsaw+0.040
 difference+0.035
ummy+0.035
opl+0.034
 types+0.033
Neg logit contributions
 remembered-0.029
 tried-0.028
 managed-0.027
 didn-0.027
 could-0.027
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 remembered+0.016
 tried+0.016
 decided+0.016
 could+0.016
 didn+0.016
Neg logit contributions
igsaw-0.023
 difference-0.020
ummy-0.020
opl-0.019
umm-0.019
Token wanted
Token ID2227
Feature activation+3.98e-02

Pos logit contributions
 wouldn+0.142
pped+0.133
 unable+0.133
'd+0.132
 managed+0.132
Neg logit contributions
igsaw-0.393
 types-0.392
 new-0.381
 batch-0.370
ummy-0.361
Token to
Token ID284
Feature activation+3.98e-02

Pos logit contributions
 remembered+0.073
 didn+0.070
 decided+0.069
'd+0.069
 tried+0.069
Neg logit contributions
igsaw-0.076
 difference-0.065
ummy-0.064
 new-0.063
 types-0.062
Token choose
Token ID3853
Feature activation+8.46e-03

Pos logit contributions
 remembered+0.069
 wouldn+0.066
'd+0.065
 could+0.065
 managed+0.065
Neg logit contributions
igsaw-0.081
 difference-0.070
ummy-0.068
 art-0.067
 types-0.066
Token the
Token ID262
Feature activation-1.40e-01

Pos logit contributions
 remembered+0.014
 tried+0.014
 decided+0.013
 could+0.013
 didn+0.013
Neg logit contributions
igsaw-0.019
 difference-0.016
ummy-0.016
opl-0.015
umm-0.015
Token dough
Token ID15756
Feature activation-1.21e-02

Pos logit contributions
igsaw+0.217
opl+0.170
ummy+0.163
 difference+0.162
umm+0.156
Neg logit contributions
 tried-0.320
 remembered-0.315
 could-0.314
 decided-0.311
'd-0.310
Token and
Token ID290
Feature activation+3.64e-02

Pos logit contributions
igsaw+0.023
 difference+0.020
ummy+0.020
opl+0.020
 types+0.019
Neg logit contributions
 remembered-0.014
 tried-0.014
 decided-0.014
 managed-0.014
 didn-0.014
Token climbed
Token ID19952
Feature activation-4.08e-02

Pos logit contributions
 remembered+0.046
'd+0.044
 managed+0.044
 tried+0.044
 didn+0.044
Neg logit contributions
igsaw-0.086
 difference-0.077
 types-0.075
opl-0.074
 art-0.073
Token over
Token ID625
Feature activation-2.90e-02

Pos logit contributions
igsaw+0.069
ummy+0.056
 new+0.056
 difference+0.055
opl+0.055
Neg logit contributions
 remembered-0.084
 didn-0.080
 tried-0.080
'd-0.080
 managed-0.079
Token it
Token ID340
Feature activation+6.65e-02

Pos logit contributions
igsaw+0.068
 difference+0.058
ummy+0.057
opl+0.057
umm+0.056
Neg logit contributions
 remembered-0.043
 tried-0.043
 decided-0.042
 could-0.042
 didn-0.042
Token.
Token ID13
Feature activation+8.73e-03

Pos logit contributions
 tried+0.085
 could+0.085
 decided+0.085
 remembered+0.083
 didn+0.082
Neg logit contributions
igsaw-0.170
 difference-0.148
opl-0.146
ummy-0.145
umm-0.144
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 could+0.015
 tried+0.015
 decided+0.014
 remembered+0.014
 didn+0.014
Neg logit contributions
igsaw-0.019
 difference-0.016
opl-0.016
umm-0.016
ummy-0.016
Token tried
Token ID3088
Feature activation+4.97e-02

Pos logit contributions
 remembered+0.134
 managed+0.123
'd+0.123
 decided+0.121
 tried+0.120
Neg logit contributions
igsaw-0.468
 types-0.431
 difference-0.428
 new-0.415
opl-0.415
Token to
Token ID284
Feature activation+5.06e-02

Pos logit contributions
 remembered+0.080
 tried+0.079
 decided+0.078
 could+0.078
 didn+0.077
Neg logit contributions
igsaw-0.109
 difference-0.094
ummy-0.092
opl-0.091
 types-0.088
Token jump
Token ID4391
Feature activation-7.14e-03

Pos logit contributions
 remembered+0.086
 could+0.084
'd+0.082
 decided+0.081
 didn+0.081
Neg logit contributions
igsaw-0.104
 difference-0.091
opl-0.086
 art-0.084
 bou-0.083
Token on
Token ID319
Feature activation-2.24e-02

Pos logit contributions
igsaw+0.015
ummy+0.013
 difference+0.013
opl+0.012
ay+0.012
Neg logit contributions
 remembered-0.012
 tried-0.012
 decided-0.012
 managed-0.011
'd-0.011
Token the
Token ID262
Feature activation-1.45e-01

Pos logit contributions
igsaw+0.047
 difference+0.040
ummy+0.039
opl+0.038
umm+0.037
Neg logit contributions
 remembered-0.039
 tried-0.038
 decided-0.038
 could-0.037
 didn-0.037
Token trees
Token ID7150
Feature activation+1.98e-02

Pos logit contributions
igsaw+0.247
ummy+0.195
 difference+0.186
opl+0.184
umm+0.183
Neg logit contributions
 tried-0.301
 decided-0.300
 remembered-0.299
 could-0.294
 didn-0.291
Token.
Token ID13
Feature activation+3.03e-03

Pos logit contributions
 remembered+0.031
 tried+0.030
 decided+0.029
 managed+0.029
 didn+0.029
Neg logit contributions
igsaw-0.044
 difference-0.039
ummy-0.038
opl-0.038
umm-0.036
Token Rover
Token ID36718
Feature activation+4.76e-02

Pos logit contributions
 could+0.005
 tried+0.005
 remembered+0.005
 decided+0.005
 didn+0.005
Neg logit contributions
igsaw-0.007
 difference-0.006
ummy-0.005
umm-0.005
 art-0.005
Token smiled
Token ID13541
Feature activation-2.09e-02

Pos logit contributions
 remembered+0.044
 managed+0.044
 unable+0.042
 accidentally+0.042
 wouldn+0.042
Neg logit contributions
igsaw-0.113
 types-0.104
 difference-0.101
 new-0.101
 bou-0.101
Token.
Token ID13
Feature activation-1.53e-03

Pos logit contributions
igsaw+0.054
 difference+0.047
ummy+0.047
opl+0.046
umm+0.046
Neg logit contributions
 remembered-0.026
 tried-0.025
 decided-0.025
 could-0.025
 didn-0.024
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
igsaw+0.004
 difference+0.003
ummy+0.003
 art+0.003
umm+0.003
Neg logit contributions
 could-0.002
 remembered-0.002
 tried-0.002
 decided-0.002
 didn-0.002
Token was
Token ID373
Feature activation+8.37e-02

Pos logit contributions
 accidentally+0.134
 managed+0.132
 tried+0.132
 unable+0.129
 wouldn+0.127
Neg logit contributions
igsaw-0.426
 types-0.406
 new-0.406
 difference-0.404
opl-0.395
Token having
Token ID1719
Feature activation+4.03e-02

Pos logit contributions
 remembered+0.126
 tried+0.124
 decided+0.123
 could+0.121
 didn+0.120
Neg logit contributions
igsaw-0.199
 difference-0.169
ummy-0.168
umm-0.164
opl-0.164
Token such
Token ID884
Feature activation-3.49e-02

Pos logit contributions
 tried+0.060
 remembered+0.060
 decided+0.060
 could+0.059
 didn+0.058
Neg logit contributions
igsaw-0.096
 difference-0.082
ummy-0.081
opl-0.080
umm-0.080
Token a
Token ID257
Feature activation-7.55e-02

Pos logit contributions
igsaw+0.068
 difference+0.057
opl+0.054
 types+0.054
ummy+0.054
Neg logit contributions
 remembered-0.063
 tried-0.062
 could-0.062
 wouldn-0.061
 decided-0.061
Token good
Token ID922
Feature activation-6.49e-02

Pos logit contributions
igsaw+0.119
 difference+0.094
ummy+0.093
opl+0.091
umm+0.088
Neg logit contributions
 remembered-0.169
 tried-0.168
 decided-0.166
 could-0.165
 didn-0.165
Token,
Token ID11
Feature activation-1.55e-03

Pos logit contributions
 remembered+0.059
 tried+0.059
 decided+0.058
 could+0.057
 didn+0.057
Neg logit contributions
igsaw-0.060
 difference-0.050
ummy-0.049
opl-0.048
umm-0.047
Token banana
Token ID25996
Feature activation+1.41e-02

Pos logit contributions
igsaw+0.003
 difference+0.003
ummy+0.003
umm+0.003
opl+0.002
Neg logit contributions
 tried-0.003
 could-0.003
 didn-0.003
 remembered-0.003
 decided-0.003
Token,
Token ID11
Feature activation+1.58e-02

Pos logit contributions
 remembered+0.026
 tried+0.026
 decided+0.025
 could+0.025
 didn+0.025
Neg logit contributions
igsaw-0.028
 difference-0.023
ummy-0.023
opl-0.023
umm-0.022
Token banana
Token ID25996
Feature activation+1.36e-02

Pos logit contributions
 remembered+0.025
 tried+0.024
 decided+0.024
 could+0.024
 didn+0.024
Neg logit contributions
igsaw-0.036
 difference-0.030
ummy-0.030
opl-0.030
umm-0.029
Token".
Token ID1911
Feature activation-1.91e-02

Pos logit contributions
 remembered+0.024
 tried+0.024
 could+0.024
 decided+0.024
 didn+0.024
Neg logit contributions
igsaw-0.027
 difference-0.023
ummy-0.022
opl-0.022
umm-0.022
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
igsaw+0.042
 difference+0.035
ummy+0.035
opl+0.034
umm+0.034
Neg logit contributions
 tried-0.033
 could-0.032
 decided-0.032
 didn-0.032
 remembered-0.031
Token said
Token ID531
Feature activation-6.52e-03

Pos logit contributions
 unable+0.154
 remembered+0.151
 managed+0.149
'd+0.142
 wouldn+0.142
Neg logit contributions
igsaw-0.426
 types-0.404
 batch-0.389
 new-0.389
 art-0.387
Token it
Token ID340
Feature activation+9.75e-02

Pos logit contributions
igsaw+0.013
ummy+0.011
 difference+0.010
opl+0.010
 types+0.010
Neg logit contributions
 remembered-0.012
 tried-0.012
 decided-0.011
 managed-0.011
'd-0.011
Token again
Token ID757
Feature activation+1.13e-02

Pos logit contributions
 remembered+0.109
 managed+0.102
 tried+0.100
 decided+0.097
 stumbled+0.097
Neg logit contributions
igsaw-0.236
 types-0.216
 difference-0.212
 new-0.211
ummy-0.211
Token and
Token ID290
Feature activation+2.92e-02

Pos logit contributions
 remembered+0.016
 tried+0.015
 managed+0.015
 wouldn+0.015
'd+0.015
Neg logit contributions
igsaw-0.026
ummy-0.023
 difference-0.022
 art-0.022
 types-0.021
Token again
Token ID757
Feature activation+8.51e-03

Pos logit contributions
 remembered+0.039
 tried+0.038
'd+0.037
 could+0.037
 decided+0.037
Neg logit contributions
igsaw-0.069
 difference-0.061
ummy-0.059
 types-0.058
opl-0.058
Token toys
Token ID14958
Feature activation-1.01e-02

Pos logit contributions
igsaw+0.063
 difference+0.054
ummy+0.053
opl+0.052
umm+0.052
Neg logit contributions
 tried-0.046
 remembered-0.046
 decided-0.045
 could-0.044
 didn-0.044
Token around
Token ID1088
Feature activation-9.77e-03

Pos logit contributions
igsaw+0.023
 difference+0.020
ummy+0.019
opl+0.019
umm+0.018
Neg logit contributions
 remembered-0.015
 tried-0.015
 decided-0.015
 could-0.015
 managed-0.015
Token the
Token ID262
Feature activation-1.41e-01

Pos logit contributions
igsaw+0.022
 difference+0.019
ummy+0.018
opl+0.018
umm+0.018
Neg logit contributions
 remembered-0.015
 tried-0.015
 decided-0.015
 could-0.015
 didn-0.015
Token room
Token ID2119
Feature activation+4.27e-02

Pos logit contributions
igsaw+0.232
umm+0.182
ummy+0.180
opl+0.177
 difference+0.172
Neg logit contributions
 decided-0.304
 remembered-0.300
 tried-0.299
 had-0.299
 didn-0.295
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
 remembered+0.061
 tried+0.061
 decided+0.060
 could+0.059
 didn+0.059
Neg logit contributions
igsaw-0.101
 difference-0.087
ummy-0.086
opl-0.085
umm-0.083
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 could+0.028
 decided+0.028
 tried+0.028
 remembered+0.028
 didn+0.027
Neg logit contributions
igsaw-0.033
 difference-0.029
opl-0.028
ummy-0.028
umm-0.027
Token had
Token ID550
Feature activation+1.11e-01

Pos logit contributions
 remembered+0.154
 managed+0.148
 tried+0.146
 decided+0.144
 could+0.141
Neg logit contributions
igsaw-0.470
 difference-0.424
 types-0.418
opl-0.410
 art-0.408
Token fun
Token ID1257
Feature activation-2.83e-02

Pos logit contributions
 remembered+0.188
 tried+0.184
 didn+0.182
 decided+0.181
 could+0.181
Neg logit contributions
igsaw-0.224
 difference-0.189
 art-0.183
ummy-0.183
 types-0.183
Token watching
Token ID4964
Feature activation-4.66e-02

Pos logit contributions
igsaw+0.062
 difference+0.053
ummy+0.053
opl+0.052
 types+0.050
Neg logit contributions
 remembered-0.046
 tried-0.045
 decided-0.045
 could-0.044
 managed-0.044
Token them
Token ID606
Feature activation+7.33e-02

Pos logit contributions
igsaw+0.093
umm+0.078
 difference+0.078
opl+0.077
ummy+0.076
Neg logit contributions
 didn-0.084
 tried-0.084
 could-0.083
 decided-0.082
 accidentally-0.080
Token zoom
Token ID19792
Feature activation-8.16e-02

Pos logit contributions
 tried+0.103
 decided+0.102
 didn+0.102
 could+0.101
 got+0.096
Neg logit contributions
igsaw-0.180
umm-0.157
ummy-0.153
 difference-0.151
opl-0.149
Token little
Token ID1310
Feature activation-7.27e-02

Pos logit contributions
igsaw+0.201
 difference+0.173
ummy+0.169
opl+0.162
umm+0.161
Neg logit contributions
 remembered-0.194
 tried-0.191
 decided-0.190
 managed-0.189
 could-0.187
Token fish
Token ID5916
Feature activation+2.59e-02

Pos logit contributions
igsaw+0.152
 difference+0.131
ummy+0.129
opl+0.124
umm+0.124
Neg logit contributions
 remembered-0.123
 tried-0.121
 decided-0.120
 didn-0.120
 managed-0.119
Token sh
Token ID427
Feature activation-1.04e-01

Pos logit contributions
pped+0.024
'd+0.024
 unable+0.023
Please+0.022
 hadn+0.022
Neg logit contributions
 types-0.054
 new-0.052
 difference-0.050
 sorts-0.050
 kinds-0.049
Tokenivered
Token ID6396
Feature activation-2.67e-03

Pos logit contributions
opl+0.157
igsaw+0.157
ummy+0.149
ay+0.135
az+0.133
Neg logit contributions
 could-0.219
'd-0.216
 remembered-0.211
 tried-0.211
 wouldn-0.208
Token.
Token ID13
Feature activation+4.81e-03

Pos logit contributions
igsaw+0.005
ummy+0.005
 new+0.005
 difference+0.004
 birthday+0.004
Neg logit contributions
'd-0.004
 managed-0.004
 remembered-0.004
 could-0.004
 tried-0.004
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 tried+0.007
 remembered+0.007
 could+0.007
 decided+0.007
 didn+0.007
Neg logit contributions
igsaw-0.011
 difference-0.010
opl-0.009
ummy-0.009
umm-0.009
Token needed
Token ID2622
Feature activation+2.63e-02

Pos logit contributions
pped+0.203
ppers+0.190
ppings+0.184
 unable+0.184
 managed+0.182
Neg logit contributions
 new-0.338
 types-0.307
 seats-0.298
 old-0.297
 blue-0.275
Token to
Token ID284
Feature activation+3.37e-02

Pos logit contributions
 remembered+0.049
'd+0.048
 didn+0.048
 managed+0.047
 decided+0.047
Neg logit contributions
igsaw-0.045
 new-0.040
 difference-0.040
ummy-0.040
 biggest-0.038
Token find
Token ID1064
Feature activation+1.67e-02

Pos logit contributions
 remembered+0.063
 could+0.061
'd+0.061
 managed+0.060
 wouldn+0.059
Neg logit contributions
igsaw-0.061
 difference-0.057
ummy-0.052
 birthday-0.051
opl-0.051
Token a
Token ID257
Feature activation-8.63e-02

Pos logit contributions
 remembered+0.028
 tried+0.028
 decided+0.028
 could+0.028
 didn+0.027
Neg logit contributions
igsaw-0.036
 difference-0.030
ummy-0.030
opl-0.029
umm-0.029
Token way
Token ID835
Feature activation-7.15e-03

Pos logit contributions
igsaw+0.133
 difference+0.107
ummy+0.103
opl+0.100
umm+0.097
Neg logit contributions
 remembered-0.194
 tried-0.193
 decided-0.192
 could-0.190
 didn-0.190
Token lying
Token ID9105
Feature activation-1.70e-02

Pos logit contributions
 remembered+0.049
 managed+0.048
 didn+0.047
 tried+0.046
 accidentally+0.045
Neg logit contributions
igsaw-0.053
 difference-0.048
ummy-0.048
opl-0.047
 new-0.045
Token in
Token ID287
Feature activation-1.91e-02

Pos logit contributions
igsaw+0.034
ummy+0.030
opl+0.029
ay+0.029
 difference+0.029
Neg logit contributions
 remembered-0.028
 managed-0.027
 didn-0.027
 tried-0.026
 decided-0.026
Token the
Token ID262
Feature activation-1.43e-01

Pos logit contributions
igsaw+0.037
 difference+0.031
ummy+0.030
opl+0.030
umm+0.029
Neg logit contributions
 remembered-0.036
 tried-0.036
 decided-0.035
 could-0.035
 didn-0.035
Token grass
Token ID8701
Feature activation-1.46e-02

Pos logit contributions
igsaw+0.236
 difference+0.188
ummy+0.185
opl+0.180
umm+0.175
Neg logit contributions
 remembered-0.312
 tried-0.310
 decided-0.308
 could-0.305
 didn-0.304
Token.
Token ID13
Feature activation+9.07e-03

Pos logit contributions
igsaw+0.029
ummy+0.026
opl+0.025
 difference+0.025
 types+0.024
Neg logit contributions
 remembered-0.025
 managed-0.024
 tried-0.024
 didn-0.024
 wouldn-0.023
Token He
Token ID679
Feature activation+1.75e-01

Pos logit contributions
 remembered+0.016
 tried+0.016
 managed+0.016
 decided+0.015
'd+0.015
Neg logit contributions
igsaw-0.018
ummy-0.015
 difference-0.015
opl-0.014
 types-0.014
Token decided
Token ID3066
Feature activation+5.69e-02

Pos logit contributions
pped+0.169
 wouldn+0.162
 unable+0.157
ppy+0.154
 managed+0.151
Neg logit contributions
 new-0.365
 types-0.350
igsaw-0.328
 old-0.317
 bou-0.317
Token to
Token ID284
Feature activation+4.32e-02

Pos logit contributions
ppy+0.120
 remembered+0.116
'd+0.115
 managed+0.114
 hadn+0.111
Neg logit contributions
 new-0.076
igsaw-0.065
 tours-0.058
 art-0.058
 festivals-0.058
Token take
Token ID1011
Feature activation-3.00e-02

Pos logit contributions
 remembered+0.075
 could+0.070
 managed+0.070
 decided+0.070
 wouldn+0.070
Neg logit contributions
igsaw-0.088
 difference-0.077
ummy-0.072
opl-0.071
 types-0.071
Token the
Token ID262
Feature activation-1.35e-01

Pos logit contributions
igsaw+0.071
 difference+0.061
ummy+0.060
opl+0.059
ay+0.058
Neg logit contributions
 remembered-0.043
 decided-0.042
'd-0.042
 didn-0.041
 could-0.041
Token spear
Token ID17514
Feature activation+4.34e-02

Pos logit contributions
igsaw+0.274
 difference+0.238
ummy+0.231
opl+0.225
 types+0.223
Neg logit contributions
 remembered-0.232
 decided-0.230
 tried-0.227
 didn-0.226
 managed-0.225
Token to
Token ID284
Feature activation+3.20e-02

Pos logit contributions
 remembered+0.073
 tried+0.071
 managed+0.070
 didn+0.069
 could+0.068
Neg logit contributions
igsaw-0.074
 new-0.062
 difference-0.062
ummy-0.061
 art-0.060
Token take
Token ID1011
Feature activation-4.33e-02

Pos logit contributions
 remembered+0.055
 tried+0.053
 could+0.053
 decided+0.053
 didn+0.052
Neg logit contributions
igsaw-0.067
 difference-0.057
ummy-0.055
opl-0.054
 types-0.053
Token a
Token ID257
Feature activation-9.55e-02

Pos logit contributions
igsaw+0.097
 difference+0.085
ummy+0.083
opl+0.082
umm+0.082
Neg logit contributions
 tried-0.068
 remembered-0.067
 decided-0.066
 could-0.065
 didn-0.064
Token journey
Token ID7002
Feature activation-4.97e-02

Pos logit contributions
igsaw+0.190
opl+0.169
umm+0.167
ummy+0.165
 difference+0.164
Neg logit contributions
 tried-0.168
 remembered-0.163
 could-0.162
 decided-0.161
 didn-0.161
Token to
Token ID284
Feature activation+1.38e-02

Pos logit contributions
igsaw+0.121
 difference+0.105
opl+0.103
umm+0.103
ummy+0.103
Neg logit contributions
 decided-0.068
 tried-0.067
 could-0.067
 remembered-0.066
 didn-0.064
Token outer
Token ID12076
Feature activation-2.91e-01

Pos logit contributions
 tried+0.020
 decided+0.020
 remembered+0.020
 could+0.019
 didn+0.019
Neg logit contributions
igsaw-0.033
umm-0.030
ummy-0.030
opl-0.028
 difference-0.027
Token space
Token ID2272
Feature activation-6.28e-02

Pos logit contributions
igsaw+0.486
umm+0.409
opl+0.398
ummy+0.362
bows+0.351
Neg logit contributions
 tried-0.598
 decided-0.597
 could-0.596
 remembered-0.595
 never-0.581
Token.
Token ID13
Feature activation-1.99e-04

Pos logit contributions
igsaw+0.153
 difference+0.131
ummy+0.130
opl+0.129
umm+0.128
Neg logit contributions
 decided-0.087
 tried-0.087
 remembered-0.086
 could-0.086
 didn-0.084
Token He
Token ID679
Feature activation+1.67e-01

Pos logit contributions
igsaw+0.000
 difference+0.000
ummy+0.000
opl+0.000
umm+0.000
Neg logit contributions
 remembered-0.000
 tried-0.000
 decided-0.000
 could-0.000
 didn-0.000
Token hopped
Token ID45230
Feature activation-1.04e-02

Pos logit contributions
 remembered+0.140
 accidentally+0.136
 tried+0.134
 didn+0.133
 wouldn+0.133
Neg logit contributions
igsaw-0.437
 types-0.403
 difference-0.390
 new-0.386
ummy-0.383
Token into
Token ID656
Feature activation-4.78e-02

Pos logit contributions
igsaw+0.023
 difference+0.019
ummy+0.019
opl+0.019
umm+0.018
Neg logit contributions
 remembered-0.017
 tried-0.017
 decided-0.017
 didn-0.016
 could-0.016
Token what
Token ID644
Feature activation+2.24e-02

Pos logit contributions
igsaw+0.161
ummy+0.133
umm+0.132
 difference+0.126
opl+0.125
Neg logit contributions
 tried-0.135
 remembered-0.134
 decided-0.132
 didn-0.132
 could-0.131
Token the
Token ID262
Feature activation-1.37e-01

Pos logit contributions
 remembered+0.027
 tried+0.026
 decided+0.026
 didn+0.025
 could+0.025
Neg logit contributions
igsaw-0.058
 difference-0.051
ummy-0.050
opl-0.050
 types-0.049
Token rocket
Token ID10701
Feature activation-4.51e-04

Pos logit contributions
igsaw+0.253
ummy+0.206
opl+0.199
 difference+0.195
umm+0.195
Neg logit contributions
 tried-0.274
 remembered-0.273
 decided-0.269
 didn-0.268
 could-0.267
Token discovered
Token ID5071
Feature activation-2.47e-02

Pos logit contributions
igsaw+0.001
 difference+0.001
opl+0.001
 types+0.001
ummy+0.001
Neg logit contributions
 remembered-0.000
 accidentally-0.000
 tried-0.000
 managed-0.000
 stumbled-0.000
Token in
Token ID287
Feature activation-1.96e-02

Pos logit contributions
igsaw+0.059
 difference+0.051
ummy+0.050
opl+0.049
umm+0.049
Neg logit contributions
 remembered-0.035
 tried-0.035
 decided-0.034
 could-0.034
 didn-0.034
Token outer
Token ID12076
Feature activation-2.79e-01

Pos logit contributions
igsaw+0.040
ummy+0.034
umm+0.034
 difference+0.034
opl+0.033
Neg logit contributions
 remembered-0.034
 tried-0.034
 decided-0.034
 could-0.033
 didn-0.033
Token space
Token ID2272
Feature activation-5.95e-02

Pos logit contributions
igsaw+0.418
umm+0.341
ummy+0.327
 batch+0.312
 Of+0.298
Neg logit contributions
 remembered-0.630
 could-0.627
 never-0.620
 tried-0.615
 decided-0.602
Token.
Token ID13
Feature activation-3.55e-03

Pos logit contributions
igsaw+0.137
 difference+0.118
ummy+0.117
opl+0.115
umm+0.113
Neg logit contributions
 remembered-0.089
 tried-0.088
 decided-0.087
 could-0.086
 didn-0.085
Token It
Token ID632
Feature activation+1.48e-01

Pos logit contributions
igsaw+0.007
 difference+0.006
ummy+0.006
opl+0.006
umm+0.006
Neg logit contributions
 could-0.006
 tried-0.006
 decided-0.006
 remembered-0.006
 didn-0.006
Token was
Token ID373
Feature activation+8.06e-02

Pos logit contributions
 remembered+0.130
 tried+0.129
 accidentally+0.127
 decided+0.123
 managed+0.120
Neg logit contributions
igsaw-0.400
 difference-0.370
 types-0.358
opl-0.356
ummy-0.350
Token a
Token ID257
Feature activation-6.93e-02

Pos logit contributions
 remembered+0.132
 tried+0.130
 decided+0.130
 didn+0.128
 could+0.128
Neg logit contributions
igsaw-0.180
 difference-0.151
ummy-0.150
opl-0.147
umm-0.146
Token the
Token ID262
Feature activation-1.46e-01

Pos logit contributions
 tried+0.016
 decided+0.016
 remembered+0.015
 could+0.015
 didn+0.015
Neg logit contributions
igsaw-0.021
umm-0.018
ummy-0.017
 difference-0.017
opl-0.017
Token stars
Token ID5788
Feature activation+8.31e-03

Pos logit contributions
igsaw+0.199
 batch+0.148
opl+0.145
ummy+0.143
 difference+0.139
Neg logit contributions
 could-0.369
 decided-0.364
 didn-0.361
 tried-0.360
'd-0.356
Token and
Token ID290
Feature activation+1.23e-02

Pos logit contributions
 remembered+0.013
 managed+0.013
 tried+0.013
 didn+0.012
 decided+0.012
Neg logit contributions
igsaw-0.017
ummy-0.015
 difference-0.015
opl-0.015
 art-0.015
Token planets
Token ID14705
Feature activation+1.14e-03

Pos logit contributions
 managed+0.019
 remembered+0.019
 wouldn+0.018
'd+0.018
 tried+0.018
Neg logit contributions
igsaw-0.025
 types-0.022
 difference-0.022
 art-0.022
 festivals-0.021
Token in
Token ID287
Feature activation-2.87e-02

Pos logit contributions
 remembered+0.002
 didn+0.002
 managed+0.002
 tried+0.002
 accidentally+0.002
Neg logit contributions
igsaw-0.002
 difference-0.002
ummy-0.002
 art-0.002
 new-0.002
Token outer
Token ID12076
Feature activation-2.77e-01

Pos logit contributions
igsaw+0.066
umm+0.060
ummy+0.058
 difference+0.057
opl+0.057
Neg logit contributions
 tried-0.043
 decided-0.042
 could-0.041
 remembered-0.041
 had-0.040
Token space
Token ID2272
Feature activation-5.55e-02

Pos logit contributions
igsaw+0.416
umm+0.378
ummy+0.336
 batch+0.336
 difference+0.324
Neg logit contributions
 remembered-0.614
 could-0.610
 tried-0.608
 never-0.600
 had-0.582
Token.
Token ID13
Feature activation+3.00e-03

Pos logit contributions
igsaw+0.129
 difference+0.111
ummy+0.110
opl+0.108
umm+0.105
Neg logit contributions
 remembered-0.082
 tried-0.081
 decided-0.080
 could-0.079
 didn-0.079
Token So
Token ID1406
Feature activation+2.63e-02

Pos logit contributions
 could+0.005
 decided+0.004
 tried+0.004
 didn+0.004
 had+0.004
Neg logit contributions
igsaw-0.007
opl-0.006
 difference-0.006
umm-0.006
 coll-0.006
Token one
Token ID530
Feature activation-3.81e-02

Pos logit contributions
 remembered+0.040
 tried+0.040
 decided+0.039
 could+0.039
 didn+0.038
Neg logit contributions
igsaw-0.061
 difference-0.052
ummy-0.051
opl-0.051
umm-0.050
Token day
Token ID1110
Feature activation+1.10e-02

Pos logit contributions
igsaw+0.064
opl+0.053
 difference+0.052
umm+0.050
ummy+0.049
Neg logit contributions
 tried-0.085
 remembered-0.084
 decided-0.084
 didn-0.083
 could-0.083
Token special
Token ID2041
Feature activation-1.18e-01

Pos logit contributions
igsaw+0.117
 difference+0.093
ummy+0.090
opl+0.088
umm+0.085
Neg logit contributions
 remembered-0.166
 tried-0.165
 decided-0.164
 could-0.163
 didn-0.162
Token star
Token ID3491
Feature activation-3.51e-02

Pos logit contributions
igsaw+0.200
opl+0.158
ummy+0.155
 difference+0.154
umm+0.153
Neg logit contributions
 remembered-0.252
 tried-0.252
 decided-0.248
 could-0.247
 didn-0.246
Token that
Token ID326
Feature activation-1.33e-03

Pos logit contributions
igsaw+0.076
 difference+0.065
ummy+0.064
opl+0.063
umm+0.061
Neg logit contributions
 remembered-0.057
 tried-0.057
 decided-0.056
 could-0.055
 didn-0.055
Token comes
Token ID2058
Feature activation+4.42e-02

Pos logit contributions
igsaw+0.003
 difference+0.003
ummy+0.003
opl+0.002
umm+0.002
Neg logit contributions
 remembered-0.002
 tried-0.002
 could-0.002
 decided-0.002
 didn-0.002
Token from
Token ID422
Feature activation-5.32e-02

Pos logit contributions
 decided+0.073
 remembered+0.073
 tried+0.073
 could+0.073
 didn+0.071
Neg logit contributions
igsaw-0.098
 difference-0.081
umm-0.080
ummy-0.080
opl-0.079
Token outer
Token ID12076
Feature activation-2.69e-01

Pos logit contributions
igsaw+0.116
 difference+0.095
ummy+0.095
umm+0.095
opl+0.093
Neg logit contributions
 tried-0.091
 remembered-0.090
 decided-0.089
 didn-0.087
 could-0.087
Token space
Token ID2272
Feature activation-5.60e-02

Pos logit contributions
igsaw+0.416
umm+0.342
ummy+0.317
opl+0.313
 batch+0.301
Neg logit contributions
 tried-0.610
 could-0.603
 remembered-0.600
 decided-0.591
 never-0.590
Token,"
Token ID553
Feature activation-4.79e-02

Pos logit contributions
igsaw+0.124
 difference+0.105
ummy+0.104
opl+0.103
umm+0.102
Neg logit contributions
 remembered-0.089
 tried-0.089
 decided-0.088
 could-0.087
 didn-0.086
Token she
Token ID673
Feature activation+1.38e-01

Pos logit contributions
igsaw+0.084
 difference+0.071
ummy+0.070
opl+0.067
 types+0.067
Neg logit contributions
 remembered-0.094
 managed-0.093
 tried-0.092
 didn-0.091
'd-0.091
Token explained
Token ID4893
Feature activation-3.03e-02

Pos logit contributions
 remembered+0.185
 managed+0.181
'd+0.181
 tried+0.180
 could+0.179
Neg logit contributions
igsaw-0.313
 difference-0.271
ummy-0.267
 types-0.266
opl-0.259
Token.
Token ID13
Feature activation-3.86e-03

Pos logit contributions
igsaw+0.076
opl+0.066
 difference+0.066
umm+0.065
ummy+0.064
Neg logit contributions
 tried-0.040
 decided-0.040
 remembered-0.040
 didn-0.039
 could-0.039
Token Emma
Token ID18966
Feature activation+8.36e-02

Pos logit contributions
igsaw+0.075
opl+0.073
umm+0.072
 difference+0.071
 decisions+0.067
Neg logit contributions
 had-0.042
 tried-0.041
 accidentally-0.041
 could-0.041
 didn-0.041
Token who
Token ID508
Feature activation+8.23e-02

Pos logit contributions
 managed+0.175
 wouldn+0.162
 tried+0.159
 remembered+0.159
'd+0.159
Neg logit contributions
igsaw-0.119
 difference-0.114
ummy-0.114
 birthday-0.111
opl-0.110
Token loved
Token ID6151
Feature activation-3.14e-02

Pos logit contributions
 managed+0.136
 possibly+0.128
 remembered+0.127
'd+0.126
 unable+0.125
Neg logit contributions
igsaw-0.138
opl-0.130
 new-0.130
 sorts-0.129
 art-0.123
Token learning
Token ID4673
Feature activation-2.60e-02

Pos logit contributions
igsaw+0.075
 difference+0.064
ummy+0.063
opl+0.062
umm+0.062
Neg logit contributions
 tried-0.046
 decided-0.046
 remembered-0.045
 could-0.045
 didn-0.045
Token about
Token ID546
Feature activation+2.73e-03

Pos logit contributions
igsaw+0.040
umm+0.039
Whe+0.033
opl+0.032
ummy+0.030
Neg logit contributions
 could-0.058
 didn-0.056
 decided-0.055
 tried-0.054
 came-0.054
Token outer
Token ID12076
Feature activation-2.68e-01

Pos logit contributions
 tried+0.005
 could+0.005
 accidentally+0.005
 decided+0.005
 didn+0.005
Neg logit contributions
igsaw-0.005
umm-0.005
ummy-0.004
ay-0.004
 difference-0.004
Token space
Token ID2272
Feature activation-6.20e-02

Pos logit contributions
igsaw+0.425
umm+0.343
 difference+0.321
Whe+0.312
opl+0.309
Neg logit contributions
 could-0.587
 remembered-0.553
 tried-0.551
 never-0.541
 decided-0.538
Token.
Token ID13
Feature activation-9.42e-05

Pos logit contributions
igsaw+0.149
 difference+0.129
ummy+0.127
opl+0.125
umm+0.123
Neg logit contributions
 remembered-0.087
 tried-0.087
 decided-0.086
 could-0.085
 didn-0.084
Token Everyday
Token ID48154
Feature activation-1.90e-02

Pos logit contributions
igsaw+0.000
opl+0.000
 difference+0.000
umm+0.000
 decisions+0.000
Neg logit contributions
 could-0.000
 tried-0.000
 didn-0.000
 had-0.000
 accidentally-0.000
Token,
Token ID11
Feature activation+2.11e-02

Pos logit contributions
igsaw+0.040
 difference+0.034
ummy+0.034
opl+0.033
umm+0.033
Neg logit contributions
 remembered-0.032
 tried-0.032
 decided-0.031
 could-0.031
 managed-0.031
Token she
Token ID673
Feature activation+1.34e-01

Pos logit contributions
 remembered+0.028
 tried+0.028
 decided+0.027
 could+0.027
 didn+0.027
Neg logit contributions
igsaw-0.053
 difference-0.046
opl-0.046
ummy-0.045
umm-0.044
Tokenizz
Token ID6457
Feature activation-4.88e-02

Pos logit contributions
igsaw+0.210
 difference+0.173
ummy+0.171
opl+0.169
umm+0.162
Neg logit contributions
 remembered-0.209
 tried-0.207
 decided-0.205
 could-0.205
 didn-0.203
Tokened
Token ID276
Feature activation-2.03e-02

Pos logit contributions
igsaw+0.107
 difference+0.091
ummy+0.089
opl+0.087
umm+0.087
Neg logit contributions
 remembered-0.080
 tried-0.080
 decided-0.079
 could-0.078
 didn-0.078
Token off
Token ID572
Feature activation-1.34e-02

Pos logit contributions
igsaw+0.048
opl+0.043
umm+0.043
ummy+0.042
 difference+0.041
Neg logit contributions
 tried-0.030
 decided-0.030
 remembered-0.029
 could-0.028
 didn-0.028
Token back
Token ID736
Feature activation-3.99e-03

Pos logit contributions
igsaw+0.032
 difference+0.028
opl+0.027
umm+0.027
ummy+0.027
Neg logit contributions
 tried-0.020
 decided-0.019
 remembered-0.019
 could-0.019
 didn-0.019
Token into
Token ID656
Feature activation-5.39e-02

Pos logit contributions
igsaw+0.008
umm+0.007
 difference+0.007
ummy+0.007
opl+0.007
Neg logit contributions
 tried-0.007
 remembered-0.006
 decided-0.006
 could-0.006
 had-0.006
Token outer
Token ID12076
Feature activation-2.67e-01

Pos logit contributions
igsaw+0.122
umm+0.114
ummy+0.113
opl+0.113
aj+0.107
Neg logit contributions
 tried-0.083
 remembered-0.079
 decided-0.077
 had-0.076
 managed-0.073
Token space
Token ID2272
Feature activation-5.00e-02

Pos logit contributions
igsaw+0.405
umm+0.349
Whe+0.329
opl+0.324
ummy+0.313
Neg logit contributions
 remembered-0.582
 never-0.571
 tried-0.566
 could-0.566
 decided-0.557
Token.
Token ID13
Feature activation-3.70e-03

Pos logit contributions
igsaw+0.123
 difference+0.105
ummy+0.105
opl+0.104
umm+0.104
Neg logit contributions
 remembered-0.068
 tried-0.067
 decided-0.067
 could-0.066
 didn-0.065
Token Earth
Token ID3668
Feature activation-1.50e-02

Pos logit contributions
igsaw+0.008
 difference+0.007
ummy+0.007
opl+0.007
 art+0.007
Neg logit contributions
 could-0.006
 decided-0.006
 tried-0.006
 didn-0.006
 had-0.006
Token was
Token ID373
Feature activation+7.93e-02

Pos logit contributions
igsaw+0.039
 difference+0.035
ummy+0.034
 types+0.034
opl+0.033
Neg logit contributions
 remembered-0.015
 tried-0.014
 accidentally-0.014
 decided-0.014
 managed-0.014
Token an
Token ID281
Feature activation-1.14e-01

Pos logit contributions
 remembered+0.117
 decided+0.115
 tried+0.114
 could+0.113
 managed+0.112
Neg logit contributions
igsaw-0.194
ummy-0.165
 difference-0.163
umm-0.162
opl-0.160
Token,
Token ID11
Feature activation+1.04e-02

Pos logit contributions
igsaw+0.104
 difference+0.090
ummy+0.089
opl+0.088
umm+0.087
Neg logit contributions
 remembered-0.053
 tried-0.053
 decided-0.052
 could-0.052
 didn-0.051
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
 remembered+0.013
 tried+0.013
 decided+0.013
 could+0.013
 didn+0.013
Neg logit contributions
igsaw-0.026
 difference-0.023
ummy-0.023
opl-0.023
umm-0.022
Token even
Token ID772
Feature activation+4.56e-02

Pos logit contributions
 tried+0.027
 remembered+0.027
 decided+0.027
 could+0.027
 didn+0.027
Neg logit contributions
igsaw-0.042
 difference-0.036
ummy-0.036
opl-0.036
umm-0.035
Token went
Token ID1816
Feature activation+4.60e-03

Pos logit contributions
 tried+0.074
 remembered+0.074
 decided+0.073
 could+0.073
 didn+0.071
Neg logit contributions
igsaw-0.104
ummy-0.089
opl-0.088
 difference-0.088
umm-0.086
Token to
Token ID284
Feature activation+1.61e-02

Pos logit contributions
 remembered+0.007
 tried+0.007
 decided+0.007
 could+0.007
 didn+0.007
Neg logit contributions
igsaw-0.010
 difference-0.009
ummy-0.009
opl-0.009
umm-0.009
Token outer
Token ID12076
Feature activation-2.67e-01

Pos logit contributions
 tried+0.026
 never+0.026
 remembered+0.025
 decided+0.025
 accidentally+0.025
Neg logit contributions
igsaw-0.036
umm-0.035
ummy-0.033
ay-0.031
 types-0.030
Token space
Token ID2272
Feature activation-5.22e-02

Pos logit contributions
igsaw+0.459
umm+0.372
opl+0.363
ummy+0.333
 batch+0.317
Neg logit contributions
 remembered-0.580
 could-0.579
 tried-0.571
 decided-0.562
 never-0.559
Token!
Token ID0
Feature activation-5.36e-03

Pos logit contributions
igsaw+0.125
ummy+0.106
 difference+0.106
umm+0.106
opl+0.105
Neg logit contributions
 could-0.076
 decided-0.075
 remembered-0.075
 tried-0.074
 didn-0.073
Token It
Token ID632
Feature activation+1.46e-01

Pos logit contributions
igsaw+0.012
 types+0.010
 difference+0.010
opl+0.010
umm+0.010
Neg logit contributions
 could-0.009
 didn-0.008
 had-0.008
 decided-0.008
 accidentally-0.008
Token was
Token ID373
Feature activation+7.74e-02

Pos logit contributions
 remembered+0.133
 tried+0.131
 decided+0.125
 managed+0.122
 could+0.121
Neg logit contributions
igsaw-0.402
 difference-0.365
ummy-0.354
 types-0.353
opl-0.351
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
 remembered+0.126
 decided+0.124
 tried+0.123
 didn+0.123
 could+0.121
Neg logit contributions
igsaw-0.175
 difference-0.147
ummy-0.147
umm-0.145
opl-0.144
Token looked
Token ID3114
Feature activation+6.25e-02

Pos logit contributions
 remembered+0.137
 unable+0.133
 managed+0.132
pped+0.132
pper+0.125
Neg logit contributions
 new-0.173
igsaw-0.170
 difference-0.163
 art-0.163
 biggest-0.163
Token like
Token ID588
Feature activation+2.84e-02

Pos logit contributions
 remembered+0.112
 tried+0.110
'd+0.110
 managed+0.109
 decided+0.107
Neg logit contributions
igsaw-0.118
 difference-0.104
 new-0.102
ummy-0.098
 art-0.097
Token it
Token ID340
Feature activation+7.69e-02

Pos logit contributions
 remembered+0.049
 tried+0.047
 decided+0.047
 managed+0.047
 could+0.046
Neg logit contributions
igsaw-0.059
 difference-0.050
ummy-0.049
opl-0.047
 art-0.047
Token came
Token ID1625
Feature activation+4.07e-02

Pos logit contributions
 remembered+0.096
 managed+0.090
 tried+0.086
 decided+0.086
 stumbled+0.085
Neg logit contributions
igsaw-0.179
 new-0.165
 difference-0.162
ummy-0.162
 types-0.160
Token from
Token ID422
Feature activation-4.62e-02

Pos logit contributions
 remembered+0.072
 tried+0.072
 decided+0.071
 could+0.070
 didn+0.070
Neg logit contributions
igsaw-0.084
 difference-0.070
ummy-0.069
opl-0.068
umm-0.067
Token outer
Token ID12076
Feature activation-2.67e-01

Pos logit contributions
igsaw+0.094
 difference+0.080
ummy+0.078
opl+0.077
umm+0.075
Neg logit contributions
 remembered-0.081
 tried-0.080
 decided-0.079
 could-0.079
 didn-0.079
Token space
Token ID2272
Feature activation-5.82e-02

Pos logit contributions
igsaw+0.414
umm+0.325
ummy+0.318
opl+0.309
 difference+0.303
Neg logit contributions
 tried-0.611
 remembered-0.600
 could-0.600
 decided-0.589
 never-0.573
Token.
Token ID13
Feature activation-2.69e-03

Pos logit contributions
igsaw+0.132
 difference+0.114
ummy+0.112
opl+0.110
 types+0.108
Neg logit contributions
 remembered-0.089
 tried-0.087
 decided-0.085
 didn-0.085
 managed-0.085
Token
Token ID198
Feature activation-5.58e-02

Pos logit contributions
igsaw+0.006
 difference+0.005
opl+0.005
umm+0.005
ummy+0.005
Neg logit contributions
 could-0.005
 tried-0.005
 didn-0.004
 decided-0.004
 remembered-0.004
Token
Token ID198
Feature activation-5.37e-02

Pos logit contributions
igsaw+0.131
umm+0.114
 difference+0.110
opl+0.108
ummy+0.105
Neg logit contributions
 could-0.089
 tried-0.087
 didn-0.085
 decided-0.084
 remembered-0.083
TokenL
Token ID43
Feature activation-4.89e-02

Pos logit contributions
igsaw+0.097
ummy+0.082
ay+0.078
 difference+0.077
opl+0.076
Neg logit contributions
 remembered-0.103
 managed-0.101
 wouldn-0.101
'd-0.099
 tried-0.099
Token that
Token ID326
Feature activation+8.99e-03

Pos logit contributions
 remembered+0.027
 tried+0.026
 decided+0.025
 managed+0.025
 didn+0.025
Neg logit contributions
igsaw-0.042
 difference-0.036
ummy-0.035
 types-0.035
 art-0.034
Token they
Token ID484
Feature activation+1.21e-01

Pos logit contributions
 could+0.010
 remembered+0.009
 tried+0.009
 decided+0.009
 didn+0.009
Neg logit contributions
igsaw-0.026
ummy-0.023
opl-0.023
umm-0.022
 difference-0.022
Token were
Token ID547
Feature activation+6.92e-02

Pos logit contributions
 remembered+0.086
 accidentally+0.083
 tried+0.079
Please+0.078
 unable+0.077
Neg logit contributions
igsaw-0.316
 types-0.297
 art-0.290
 difference-0.289
ay-0.287
Token going
Token ID1016
Feature activation-2.24e-02

Pos logit contributions
 remembered+0.111
 tried+0.110
 decided+0.109
 could+0.108
 didn+0.107
Neg logit contributions
igsaw-0.156
 difference-0.132
ummy-0.131
opl-0.128
umm-0.127
Token to
Token ID284
Feature activation+3.13e-02

Pos logit contributions
igsaw+0.044
 difference+0.037
ummy+0.037
opl+0.036
 new+0.036
Neg logit contributions
 remembered-0.040
 tried-0.039
 didn-0.038
 decided-0.038
 managed-0.038
Token outer
Token ID12076
Feature activation-2.64e-01

Pos logit contributions
 tried+0.046
 decided+0.044
 remembered+0.044
 could+0.044
 didn+0.044
Neg logit contributions
igsaw-0.076
ummy-0.069
umm-0.067
opl-0.063
 difference-0.062
Token space
Token ID2272
Feature activation-5.72e-02

Pos logit contributions
igsaw+0.420
umm+0.344
opl+0.328
ummy+0.319
 bou+0.306
Neg logit contributions
 could-0.605
 never-0.591
 remembered-0.584
 tried-0.583
 decided-0.574
Token.
Token ID13
Feature activation+1.45e-04

Pos logit contributions
igsaw+0.133
 difference+0.114
ummy+0.113
opl+0.111
umm+0.109
Neg logit contributions
 remembered-0.085
 tried-0.084
 decided-0.083
 could-0.083
 didn-0.082
Token They
Token ID1119
Feature activation+1.62e-01

Pos logit contributions
 could+0.000
 tried+0.000
 decided+0.000
 didn+0.000
 remembered+0.000
Neg logit contributions
igsaw-0.000
opl-0.000
 difference-0.000
umm-0.000
ummy-0.000
Token watched
Token ID7342
Feature activation-4.49e-02

Pos logit contributions
 remembered+0.138
 accidentally+0.137
 wouldn+0.133
 didn+0.131
 tried+0.129
Neg logit contributions
igsaw-0.405
 types-0.377
 parties-0.360
 art-0.360
 difference-0.359
Token out
Token ID503
Feature activation+4.19e-03

Pos logit contributions
igsaw+0.112
opl+0.097
 difference+0.095
umm+0.094
ummy+0.091
Neg logit contributions
 tried-0.064
 could-0.064
 didn-0.064
 decided-0.061
 remembered-0.061
Token.
Token ID13
Feature activation+1.34e-03

Pos logit contributions
igsaw+0.033
 difference+0.029
umm+0.029
opl+0.029
ummy+0.028
Neg logit contributions
 tried-0.020
 could-0.020
 decided-0.019
 remembered-0.019
 managed-0.019
Token Soon
Token ID15894
Feature activation+2.45e-02

Pos logit contributions
 tried+0.002
 could+0.002
 remembered+0.002
 decided+0.002
 didn+0.002
Neg logit contributions
igsaw-0.003
 difference-0.002
ummy-0.002
opl-0.002
umm-0.002
Token they
Token ID484
Feature activation+1.28e-01

Pos logit contributions
 remembered+0.040
 tried+0.039
 decided+0.039
 could+0.038
'd+0.038
Neg logit contributions
igsaw-0.052
 difference-0.045
ummy-0.044
 types-0.042
umm-0.042
Token were
Token ID547
Feature activation+7.21e-02

Pos logit contributions
 accidentally+0.098
 tried+0.097
 wouldn+0.095
 remembered+0.094
 didn+0.094
Neg logit contributions
igsaw-0.329
ay-0.302
 difference-0.300
 types-0.299
 bou-0.297
Token in
Token ID287
Feature activation-1.09e-02

Pos logit contributions
 remembered+0.116
 tried+0.115
 decided+0.114
 could+0.112
 didn+0.112
Neg logit contributions
igsaw-0.160
 difference-0.136
ummy-0.134
opl-0.132
umm-0.130
Token outer
Token ID12076
Feature activation-2.63e-01

Pos logit contributions
igsaw+0.022
ummy+0.020
umm+0.019
opl+0.019
 difference+0.018
Neg logit contributions
 tried-0.019
 remembered-0.019
 had-0.018
 decided-0.018
 managed-0.018
Token space
Token ID2272
Feature activation-5.32e-02

Pos logit contributions
igsaw+0.406
umm+0.346
ummy+0.322
 batch+0.305
opl+0.297
Neg logit contributions
 remembered-0.592
 tried-0.584
 could-0.580
 never-0.578
 had-0.563
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
igsaw+0.125
 difference+0.107
ummy+0.106
opl+0.105
umm+0.103
Neg logit contributions
 remembered-0.078
 tried-0.077
 decided-0.076
 could-0.076
 didn-0.075
Token having
Token ID1719
Feature activation+2.63e-02

Pos logit contributions
 remembered+0.022
 tried+0.022
 decided+0.022
 could+0.022
 didn+0.022
Neg logit contributions
igsaw-0.037
ummy-0.032
 difference-0.032
opl-0.031
umm-0.031
Token fun
Token ID1257
Feature activation-3.83e-02

Pos logit contributions
 tried+0.054
 remembered+0.054
 decided+0.054
 could+0.053
 didn+0.053
Neg logit contributions
igsaw-0.047
ummy-0.039
 difference-0.038
umm-0.037
opl-0.037
Token.
Token ID13
Feature activation-2.96e-04

Pos logit contributions
igsaw+0.087
 difference+0.074
ummy+0.073
opl+0.072
umm+0.070
Neg logit contributions
 remembered-0.060
 tried-0.059
 decided-0.058
 could-0.057
 didn-0.057
Token mouse
Token ID10211
Feature activation+6.98e-02

Pos logit contributions
igsaw+0.249
 difference+0.210
ummy+0.206
opl+0.203
umm+0.199
Neg logit contributions
 remembered-0.212
 tried-0.210
 decided-0.208
 could-0.206
 didn-0.205
Token never
Token ID1239
Feature activation+8.74e-02

Pos logit contributions
 tried+0.066
 remembered+0.065
 accidentally+0.064
 decided+0.064
 stumbled+0.063
Neg logit contributions
igsaw-0.173
 difference-0.164
opl-0.160
ummy-0.156
 types-0.155
Token suffered
Token ID6989
Feature activation-2.47e-02

Pos logit contributions
'd+0.048
 remembered+0.047
 decided+0.047
 could+0.046
 tried+0.045
Neg logit contributions
igsaw-0.261
 difference-0.243
 types-0.232
ummy-0.229
 new-0.228
Token from
Token ID422
Feature activation-5.83e-02

Pos logit contributions
igsaw+0.053
 difference+0.045
ummy+0.045
opl+0.044
umm+0.043
Neg logit contributions
 remembered-0.040
 tried-0.040
 decided-0.039
 could-0.039
 didn-0.039
Token the
Token ID262
Feature activation-1.51e-01

Pos logit contributions
igsaw+0.115
 difference+0.097
ummy+0.095
opl+0.093
umm+0.090
Neg logit contributions
 remembered-0.107
 tried-0.106
 decided-0.105
 could-0.104
 didn-0.103
Token rud
Token ID28906
Feature activation-2.35e-01

Pos logit contributions
igsaw+0.290
 difference+0.239
ummy+0.236
opl+0.233
umm+0.227
Neg logit contributions
 remembered-0.288
 tried-0.286
 decided-0.282
 could-0.280
 didn-0.279
Tokeneness
Token ID9449
Feature activation+9.82e-03

Pos logit contributions
igsaw+0.413
ummy+0.344
 difference+0.339
opl+0.339
umm+0.329
Neg logit contributions
 decided-0.490
 tried-0.489
 remembered-0.489
 could-0.484
 didn-0.480
Token of
Token ID286
Feature activation-8.08e-02

Pos logit contributions
 remembered+0.017
 tried+0.017
 decided+0.016
 managed+0.016
 didn+0.016
Neg logit contributions
igsaw-0.020
 difference-0.017
ummy-0.017
opl-0.016
umm-0.016
Token other
Token ID584
Feature activation-7.93e-02

Pos logit contributions
igsaw+0.153
 difference+0.128
ummy+0.124
opl+0.123
 types+0.119
Neg logit contributions
 remembered-0.155
 tried-0.153
 decided-0.152
 could-0.151
 didn-0.150
Token animals
Token ID4695
Feature activation+2.63e-02

Pos logit contributions
igsaw+0.146
 difference+0.123
ummy+0.120
opl+0.117
umm+0.115
Neg logit contributions
 remembered-0.155
 tried-0.154
 decided-0.153
 could-0.151
 didn-0.151
Token again
Token ID757
Feature activation-8.86e-03

Pos logit contributions
 remembered+0.044
 tried+0.042
 decided+0.042
 managed+0.041
 didn+0.041
Neg logit contributions
igsaw-0.054
 difference-0.049
ummy-0.046
opl-0.045
 new-0.043
Token getting
Token ID1972
Feature activation-1.03e-02

Pos logit contributions
 remembered+0.097
 tried+0.096
 decided+0.094
 could+0.094
 didn+0.093
Neg logit contributions
igsaw-0.202
 difference-0.178
ummy-0.175
opl-0.174
umm-0.169
Token very
Token ID845
Feature activation+3.96e-02

Pos logit contributions
igsaw+0.021
 difference+0.019
ummy+0.018
 types+0.018
 new+0.018
Neg logit contributions
 remembered-0.016
'd-0.016
 didn-0.016
 tried-0.016
 could-0.016
Token annoyed
Token ID25602
Feature activation-5.02e-02

Pos logit contributions
 remembered+0.062
 tried+0.061
 decided+0.061
 didn+0.059
 could+0.059
Neg logit contributions
igsaw-0.095
 difference-0.080
opl-0.079
ummy-0.079
umm-0.078
Token with
Token ID351
Feature activation-2.94e-02

Pos logit contributions
igsaw+0.113
ummy+0.100
 difference+0.098
opl+0.094
 types+0.093
Neg logit contributions
 remembered-0.079
 managed-0.076
'd-0.076
 tried-0.075
 decided-0.074
Token his
Token ID465
Feature activation-4.16e-02

Pos logit contributions
igsaw+0.064
 difference+0.055
opl+0.053
ummy+0.053
 types+0.052
Neg logit contributions
 remembered-0.049
 decided-0.047
 tried-0.047
'd-0.046
 could-0.046
Token rud
Token ID28906
Feature activation-2.33e-01

Pos logit contributions
igsaw+0.082
umm+0.066
 difference+0.065
ummy+0.064
opl+0.064
Neg logit contributions
 tried-0.080
 remembered-0.080
 didn-0.078
 could-0.077
 dropped-0.075
Tokeneness
Token ID9449
Feature activation+1.71e-02

Pos logit contributions
igsaw+0.427
ummy+0.355
 difference+0.354
opl+0.346
umm+0.338
Neg logit contributions
 tried-0.460
 remembered-0.460
 decided-0.457
 could-0.452
 didn-0.449
Token and
Token ID290
Feature activation+1.18e-02

Pos logit contributions
 remembered+0.028
 tried+0.026
 decided+0.026
 managed+0.026
'd+0.026
Neg logit contributions
igsaw-0.037
 difference-0.032
ummy-0.031
opl-0.031
 types-0.030
Token complained
Token ID13832
Feature activation-3.16e-02

Pos logit contributions
'd+0.013
 remembered+0.012
 managed+0.012
 stumbled+0.012
pped+0.011
Neg logit contributions
igsaw-0.028
 types-0.026
 difference-0.026
 art-0.026
 new-0.026
Token that
Token ID326
Feature activation+3.76e-03

Pos logit contributions
igsaw+0.066
 difference+0.056
ummy+0.056
opl+0.054
 types+0.053
Neg logit contributions
 remembered-0.053
 tried-0.052
 decided-0.052
 managed-0.051
 could-0.051
Token he
Token ID339
Feature activation+1.31e-01

Pos logit contributions
 remembered+0.004
 tried+0.004
 decided+0.004
 could+0.004
 didn+0.004
Neg logit contributions
igsaw-0.010
 difference-0.009
ummy-0.009
opl-0.009
umm-0.008
Token
Token ID198
Feature activation-1.73e-02

Pos logit contributions
 tried+0.011
 remembered+0.010
 decided+0.010
 could+0.010
 didn+0.010
Neg logit contributions
igsaw-0.020
 difference-0.017
ummy-0.017
opl-0.017
umm-0.017
Token
Token ID198
Feature activation-1.99e-02

Pos logit contributions
igsaw+0.037
 difference+0.032
ummy+0.031
umm+0.031
opl+0.031
Neg logit contributions
 remembered-0.029
 tried-0.029
 decided-0.029
 could-0.028
 didn-0.028
TokenShe
Token ID3347
Feature activation+1.60e-01

Pos logit contributions
igsaw+0.045
 difference+0.039
ummy+0.038
opl+0.038
umm+0.037
Neg logit contributions
 remembered-0.031
 tried-0.030
 decided-0.030
 could-0.030
 didn-0.030
Token takes
Token ID2753
Feature activation-3.46e-02

Pos logit contributions
 remembered+0.146
 tried+0.140
 managed+0.140
 decided+0.137
 could+0.137
Neg logit contributions
igsaw-0.425
 difference-0.385
 types-0.374
ummy-0.372
opl-0.370
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
igsaw+0.081
 difference+0.075
opl+0.073
ummy+0.073
umm+0.072
Neg logit contributions
 tried-0.052
 decided-0.048
 managed-0.047
 could-0.047
 remembered-0.047
Token v
Token ID410
Feature activation-2.24e-01

Pos logit contributions
igsaw+0.288
opl+0.245
ummy+0.238
umm+0.234
 difference+0.228
Neg logit contributions
 tried-0.289
 remembered-0.280
 could-0.279
 didn-0.278
 decided-0.277
Tokenase
Token ID589
Feature activation+3.61e-03

Pos logit contributions
igsaw+0.399
 difference+0.360
 festivals+0.326
 types+0.320
 bou+0.308
Neg logit contributions
 could-0.438
 tried-0.424
 got-0.424
 didn-0.423
 decided-0.421
Token from
Token ID422
Feature activation-5.74e-02

Pos logit contributions
 decided+0.005
 tried+0.005
 could+0.005
 remembered+0.005
 didn+0.005
Neg logit contributions
igsaw-0.009
 difference-0.008
opl-0.008
ummy-0.008
umm-0.008
Token them
Token ID606
Feature activation+6.26e-02

Pos logit contributions
igsaw+0.122
 difference+0.105
ummy+0.102
opl+0.101
umm+0.101
Neg logit contributions
 tried-0.097
 remembered-0.096
 decided-0.094
 didn-0.093
 could-0.093
Token and
Token ID290
Feature activation+1.75e-02

Pos logit contributions
 remembered+0.090
 tried+0.089
 decided+0.088
 could+0.087
 didn+0.086
Neg logit contributions
igsaw-0.149
 difference-0.128
ummy-0.127
opl-0.125
umm-0.123
Token puts
Token ID7584
Feature activation-4.51e-02

Pos logit contributions
'd+0.029
 remembered+0.029
 managed+0.028
 didn+0.028
 decided+0.028
Neg logit contributions
igsaw-0.034
 difference-0.030
 types-0.029
 art-0.029
opl-0.028
Token punished
Token ID16851
Feature activation-3.16e-02

Pos logit contributions
'd+0.125
 could+0.122
 tried+0.122
 remembered+0.121
 managed+0.120
Neg logit contributions
igsaw-0.139
 difference-0.118
 new-0.114
ummy-0.114
 art-0.112
Token for
Token ID329
Feature activation-3.82e-02

Pos logit contributions
igsaw+0.064
ummy+0.056
 difference+0.056
opl+0.053
 new+0.052
Neg logit contributions
 remembered-0.054
 tried-0.052
 managed-0.052
'd-0.051
 decided-0.051
Token knocking
Token ID21500
Feature activation-3.19e-02

Pos logit contributions
igsaw+0.071
 art+0.062
 new+0.061
 difference+0.060
ummy+0.059
Neg logit contributions
'd-0.071
 could-0.070
 remembered-0.069
 decided-0.067
 managed-0.067
Token over
Token ID625
Feature activation-3.72e-02

Pos logit contributions
igsaw+0.070
ummy+0.060
 difference+0.060
opl+0.058
 types+0.057
Neg logit contributions
 remembered-0.050
'd-0.049
 tried-0.048
 managed-0.047
 didn-0.047
Token the
Token ID262
Feature activation-1.71e-01

Pos logit contributions
igsaw+0.077
 difference+0.064
ummy+0.064
opl+0.063
umm+0.062
Neg logit contributions
 remembered-0.065
 tried-0.065
 decided-0.064
 could-0.063
 didn-0.063
Token v
Token ID410
Feature activation-2.24e-01

Pos logit contributions
igsaw+0.320
opl+0.255
ummy+0.252
 difference+0.249
 types+0.243
Neg logit contributions
 tried-0.341
 decided-0.339
 remembered-0.338
 could-0.334
 didn-0.329
Tokenase
Token ID589
Feature activation-3.52e-03

Pos logit contributions
igsaw+0.358
 difference+0.302
 types+0.278
ummy+0.271
 bou+0.266
Neg logit contributions
 decided-0.477
 could-0.475
 tried-0.474
 remembered-0.470
 didn-0.467
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
igsaw+0.009
 difference+0.007
ummy+0.007
opl+0.007
umm+0.007
Neg logit contributions
 tried-0.005
 remembered-0.005
 decided-0.005
 could-0.005
 didn-0.005
Token Tim
Token ID5045
Feature activation+5.41e-02

Pos logit contributions
igsaw+0.023
 difference+0.020
umm+0.019
 types+0.019
opl+0.019
Neg logit contributions
 could-0.018
 didn-0.018
 tried-0.018
 decided-0.018
 remembered-0.017
Tokenmy
Token ID1820
Feature activation+1.12e-01

Pos logit contributions
 stumbled+0.107
 managed+0.107
 accidentally+0.105
 could+0.105
'd+0.105
Neg logit contributions
igsaw-0.074
 difference-0.069
ummy-0.066
 types-0.064
opl-0.062
Token felt
Token ID2936
Feature activation+4.69e-02

Pos logit contributions
'd+0.129
Please+0.122
pper+0.121
pped+0.121
 stumbled+0.120
Neg logit contributions
 difference-0.222
 new-0.211
 memories-0.203
 batch-0.201
 art-0.200
Token fold
Token ID5591
Feature activation-6.19e-02

Pos logit contributions
 remembered+0.036
'd+0.035
 managed+0.034
 didn+0.033
 wouldn+0.033
Neg logit contributions
igsaw-0.042
 art-0.036
 difference-0.036
 new-0.036
 birthday-0.034
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
igsaw+0.172
opl+0.154
umm+0.151
 difference+0.151
ummy+0.149
Neg logit contributions
 tried-0.070
 could-0.067
 decided-0.065
 didn-0.065
 remembered-0.064
Token knob
Token ID36842
Feature activation-9.70e-03

Pos logit contributions
igsaw+0.295
 difference+0.248
ummy+0.245
opl+0.238
 types+0.233
Neg logit contributions
 remembered-0.281
 tried-0.276
 decided-0.275
 managed-0.271
 could-0.271
Token into
Token ID656
Feature activation-5.75e-02

Pos logit contributions
igsaw+0.022
 difference+0.019
ummy+0.019
opl+0.019
umm+0.018
Neg logit contributions
 tried-0.015
 remembered-0.015
 decided-0.014
 could-0.014
 didn-0.014
Token an
Token ID281
Feature activation-1.48e-01

Pos logit contributions
igsaw+0.125
opl+0.109
 difference+0.106
ummy+0.105
umm+0.105
Neg logit contributions
 tried-0.098
 remembered-0.096
 decided-0.095
 could-0.094
 didn-0.093
Token orig
Token ID1796
Feature activation-2.23e-01

Pos logit contributions
igsaw+0.270
 difference+0.219
ummy+0.212
opl+0.210
umm+0.208
Neg logit contributions
 decided-0.297
 tried-0.296
 remembered-0.295
 didn-0.287
 could-0.286
Tokenami
Token ID6277
Feature activation-4.53e-02

Pos logit contributions
igsaw+0.360
 difference+0.291
ummy+0.271
opl+0.268
 types+0.263
Neg logit contributions
 remembered-0.484
 tried-0.480
 decided-0.478
 could-0.471
 didn-0.468
Token figure
Token ID3785
Feature activation-7.96e-03

Pos logit contributions
igsaw+0.090
 difference+0.079
ummy+0.075
 art+0.074
opl+0.073
Neg logit contributions
 remembered-0.081
 tried-0.078
 decided-0.077
 didn-0.077
 managed-0.077
Token.
Token ID13
Feature activation-5.49e-03

Pos logit contributions
igsaw+0.018
 difference+0.015
ummy+0.015
opl+0.015
 types+0.015
Neg logit contributions
 remembered-0.012
 tried-0.012
 decided-0.012
 didn-0.012
 managed-0.011
Token But
Token ID887
Feature activation+5.38e-02

Pos logit contributions
igsaw+0.013
 difference+0.012
opl+0.012
ummy+0.011
 coll+0.011
Neg logit contributions
 could-0.008
 decided-0.008
 tried-0.008
 didn-0.008
 remembered-0.007
Token no
Token ID645
Feature activation-1.36e-02

Pos logit contributions
 remembered+0.079
'd+0.078
 tried+0.077
 decided+0.077
 managed+0.076
Neg logit contributions
igsaw-0.121
ummy-0.104
 difference-0.104
 art-0.099
 types-0.099
Token they
Token ID484
Feature activation+1.19e-01

Pos logit contributions
igsaw+0.047
 difference+0.041
ummy+0.041
opl+0.040
umm+0.040
Neg logit contributions
 remembered-0.023
 tried-0.023
 decided-0.022
 could-0.022
 didn-0.022
Token had
Token ID550
Feature activation+8.80e-02

Pos logit contributions
pper+0.128
 accidentally+0.118
ppy+0.115
 swoop+0.110
ged+0.107
Neg logit contributions
 new-0.208
 parties-0.196
 seats-0.195
 tickets-0.192
 birthday-0.191
Token never
Token ID1239
Feature activation+7.58e-02

Pos logit contributions
'd+0.082
 could+0.080
 decided+0.078
 remembered+0.077
 wouldn+0.076
Neg logit contributions
igsaw-0.215
 new-0.204
 art-0.204
 difference-0.204
 birthday-0.202
Token taken
Token ID2077
Feature activation-6.56e-02

Pos logit contributions
 wouldn+0.063
'd+0.063
 remembered+0.062
 could+0.062
 didn+0.061
Neg logit contributions
igsaw-0.209
 difference-0.187
 types-0.184
ummy-0.183
 art-0.182
Token the
Token ID262
Feature activation-1.66e-01

Pos logit contributions
igsaw+0.133
 difference+0.111
opl+0.111
ummy+0.110
umm+0.108
Neg logit contributions
 tried-0.119
 remembered-0.118
 decided-0.116
 could-0.115
 didn-0.114
Token v
Token ID410
Feature activation-2.23e-01

Pos logit contributions
igsaw+0.352
opl+0.289
umm+0.285
ummy+0.277
aps+0.260
Neg logit contributions
 tried-0.318
 had-0.317
 didn-0.304
 got-0.301
 dropped-0.301
Tokenase
Token ID589
Feature activation-6.38e-03

Pos logit contributions
igsaw+0.395
 difference+0.347
 types+0.321
 festivals+0.310
 bou+0.300
Neg logit contributions
 could-0.434
 tried-0.428
 decided-0.426
 didn-0.422
 remembered-0.422
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
igsaw+0.015
 difference+0.013
ummy+0.012
opl+0.012
umm+0.012
Neg logit contributions
 remembered-0.010
 tried-0.010
 decided-0.009
 could-0.009
 didn-0.009
Token They
Token ID1119
Feature activation+1.50e-01

Pos logit contributions
igsaw+0.030
 difference+0.025
ummy+0.025
opl+0.024
umm+0.023
Neg logit contributions
 remembered-0.032
 tried-0.031
 managed-0.031
'd-0.031
 decided-0.031
Token wished
Token ID16555
Feature activation-2.03e-02

Pos logit contributions
pper+0.178
pped+0.168
ppers+0.164
ppy+0.163
ppings+0.155
Neg logit contributions
 seats-0.262
 beds-0.238
 Grand-0.234
 parties-0.232
 art-0.232
Token they
Token ID484
Feature activation+1.16e-01

Pos logit contributions
igsaw+0.053
 difference+0.046
ummy+0.046
opl+0.045
umm+0.044
Neg logit contributions
 remembered-0.025
 tried-0.025
 decided-0.024
 could-0.024
 didn-0.024
Token sailed
Token ID33835
Feature activation-4.95e-02

Pos logit contributions
 remembered+0.079
 tried+0.078
 decided+0.076
 could+0.076
 managed+0.075
Neg logit contributions
igsaw-0.187
 difference-0.167
ummy-0.165
opl-0.164
 types-0.163
Token for
Token ID329
Feature activation-2.49e-02

Pos logit contributions
igsaw+0.114
 difference+0.096
opl+0.096
ummy+0.095
umm+0.094
Neg logit contributions
 remembered-0.079
 tried-0.079
 decided-0.078
 could-0.077
 managed-0.076
Token days
Token ID1528
Feature activation-3.16e-02

Pos logit contributions
igsaw+0.054
 difference+0.046
ummy+0.046
umm+0.044
opl+0.044
Neg logit contributions
 tried-0.042
 remembered-0.042
 decided-0.041
 could-0.041
 didn-0.040
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
igsaw+0.074
 difference+0.064
ummy+0.063
opl+0.062
umm+0.061
Neg logit contributions
 remembered-0.046
 tried-0.046
 decided-0.045
 could-0.045
 didn-0.044
Token exploring
Token ID13504
Feature activation+3.05e-03

Pos logit contributions
 remembered+0.022
 could+0.021
 tried+0.021
 decided+0.021
'd+0.021
Neg logit contributions
igsaw-0.030
 difference-0.026
opl-0.026
 types-0.026
 new-0.025
Token unch
Token ID16258
Feature activation-2.23e-01

Pos logit contributions
 tried+0.005
 decided+0.005
 remembered+0.005
 could+0.005
 didn+0.004
Neg logit contributions
igsaw-0.007
ummy-0.006
umm-0.006
opl-0.006
 difference-0.006
Tokenarted
Token ID19112
Feature activation-1.21e-02

Pos logit contributions
igsaw+0.429
 difference+0.390
 bou+0.352
 birthday+0.343
 biggest+0.343
Neg logit contributions
 decided-0.393
 tried-0.378
 accidentally-0.369
 remembered-0.364
 didn-0.363
Token waters
Token ID10150
Feature activation-5.23e-03

Pos logit contributions
igsaw+0.025
 difference+0.021
ummy+0.021
opl+0.021
umm+0.020
Neg logit contributions
 remembered-0.021
 tried-0.021
 decided-0.021
 could-0.020
 didn-0.020
Token until
Token ID1566
Feature activation+5.34e-02

Pos logit contributions
igsaw+0.013
 difference+0.011
ummy+0.011
umm+0.011
opl+0.011
Neg logit contributions
 tried-0.007
 remembered-0.007
 decided-0.007
 could-0.007
 didn-0.007
Token he
Token ID339
Feature activation+1.28e-01

Pos logit contributions
 remembered+0.084
 tried+0.082
 decided+0.081
 could+0.080
'd+0.080
Neg logit contributions
igsaw-0.119
 difference-0.101
ummy-0.100
opl-0.099
umm-0.096
Token came
Token ID1625
Feature activation+3.42e-02

Pos logit contributions
 remembered+0.102
 tried+0.099
 decided+0.092
 wouldn+0.091
'd+0.090
Neg logit contributions
igsaw-0.344
 difference-0.319
opl-0.308
ummy-0.305
 types-0.305
Token he
Token ID339
Feature activation+1.29e-01

Pos logit contributions
 remembered+0.171
'd+0.167
 managed+0.164
 tried+0.163
 stumbled+0.161
Neg logit contributions
igsaw-0.165
 new-0.146
 difference-0.141
ummy-0.140
 art-0.133
Token couldn
Token ID3521
Feature activation+3.76e-02

Pos logit contributions
pper+0.162
Please+0.159
ppy+0.154
 unable+0.153
pped+0.152
Neg logit contributions
 new-0.237
 types-0.217
 difference-0.205
ummy-0.203
 biggest-0.202
Token't
Token ID470
Feature activation+7.25e-02

Pos logit contributions
 remembered+0.047
 decided+0.044
 tried+0.044
 managed+0.043
 could+0.043
Neg logit contributions
igsaw-0.098
ummy-0.086
opl-0.085
 difference-0.085
umm-0.082
Token touch
Token ID3638
Feature activation-7.78e-02

Pos logit contributions
 remembered+0.141
 hadn+0.136
 unable+0.135
 wouldn+0.134
 managed+0.134
Neg logit contributions
igsaw-0.119
 difference-0.107
ay-0.100
 new-0.097
opl-0.097
Token the
Token ID262
Feature activation-1.67e-01

Pos logit contributions
igsaw+0.172
 difference+0.148
ummy+0.146
opl+0.145
 types+0.140
Neg logit contributions
 remembered-0.124
 tried-0.121
 decided-0.120
 managed-0.119
 didn-0.118
Token v
Token ID410
Feature activation-2.22e-01

Pos logit contributions
igsaw+0.284
ummy+0.218
umm+0.206
opl+0.205
 difference+0.199
Neg logit contributions
 tried-0.376
 didn-0.366
 could-0.366
 had-0.362
 decided-0.362
Tokenase
Token ID589
Feature activation-6.21e-03

Pos logit contributions
igsaw+0.365
 difference+0.316
 types+0.286
ummy+0.280
 bou+0.280
Neg logit contributions
 could-0.458
 tried-0.457
 decided-0.454
 didn-0.449
 remembered-0.448
Token.
Token ID13
Feature activation-7.11e-03

Pos logit contributions
igsaw+0.014
 difference+0.012
ummy+0.012
opl+0.012
umm+0.011
Neg logit contributions
 remembered-0.010
 tried-0.010
 decided-0.009
 could-0.009
 didn-0.009
Token Tim
Token ID5045
Feature activation+4.56e-02

Pos logit contributions
igsaw+0.014
 difference+0.012
ummy+0.012
opl+0.012
umm+0.011
Neg logit contributions
 remembered-0.013
 tried-0.013
 decided-0.013
 could-0.012
 didn-0.012
Tokenmy
Token ID1820
Feature activation+1.09e-01

Pos logit contributions
 managed+0.096
 stumbled+0.095
 could+0.094
 possibly+0.093
 unable+0.093
Neg logit contributions
igsaw-0.061
ummy-0.056
 types-0.054
 difference-0.052
ay-0.051
Token felt
Token ID2936
Feature activation+4.75e-02

Pos logit contributions
'd+0.148
 possibly+0.130
 wouldn+0.129
ppy+0.125
 unable+0.125
Neg logit contributions
 new-0.189
 batch-0.183
igsaw-0.174
 types-0.173
 chip-0.171
Token brother
Token ID3956
Feature activation+6.17e-02

Pos logit contributions
igsaw+0.046
opl+0.041
 difference+0.037
umm+0.036
 types+0.036
Neg logit contributions
 didn-0.029
 tried-0.029
 could-0.029
 remembered-0.028
 got-0.028
Token accidentally
Token ID14716
Feature activation+6.72e-02

Pos logit contributions
'd+0.098
 unable+0.097
 managed+0.093
 hadn+0.091
 possibly+0.090
Neg logit contributions
igsaw-0.111
ummy-0.103
 new-0.098
 difference-0.098
 birthday-0.097
Token knocked
Token ID13642
Feature activation-7.08e-02

Pos logit contributions
'd+0.068
 could+0.060
 remembered+0.059
 decided+0.058
 hadn+0.058
Neg logit contributions
igsaw-0.157
 art-0.146
 new-0.141
 math-0.141
 birthday-0.137
Token over
Token ID625
Feature activation-3.64e-02

Pos logit contributions
igsaw+0.146
ummy+0.122
 difference+0.120
opl+0.119
 types+0.115
Neg logit contributions
 remembered-0.125
 tried-0.121
 decided-0.120
'd-0.120
 could-0.120
Token the
Token ID262
Feature activation-1.67e-01

Pos logit contributions
igsaw+0.079
 difference+0.068
ummy+0.066
opl+0.066
umm+0.065
Neg logit contributions
 tried-0.060
 remembered-0.060
 decided-0.059
 could-0.058
 didn-0.058
Token v
Token ID410
Feature activation-2.22e-01

Pos logit contributions
igsaw+0.316
opl+0.259
 types+0.246
 difference+0.236
umm+0.232
Neg logit contributions
 tried-0.331
 had-0.330
 could-0.324
 dropped-0.322
 accidentally-0.318
Tokenase
Token ID589
Feature activation+7.79e-05

Pos logit contributions
igsaw+0.384
 difference+0.340
 types+0.311
 bou+0.303
ummy+0.296
Neg logit contributions
 could-0.443
 tried-0.433
 decided-0.430
 didn-0.429
 got-0.425
Token and
Token ID290
Feature activation+2.05e-02

Pos logit contributions
 remembered+0.000
 tried+0.000
 decided+0.000
 managed+0.000
'd+0.000
Neg logit contributions
igsaw-0.000
ummy-0.000
 difference-0.000
opl-0.000
umm-0.000
Token it
Token ID340
Feature activation+7.05e-02

Pos logit contributions
'd+0.033
 managed+0.027
pper+0.027
 remembered+0.027
 stumbled+0.026
Neg logit contributions
 new-0.037
igsaw-0.037
 art-0.036
 batch-0.035
 types-0.035
Token broke
Token ID6265
Feature activation-5.93e-02

Pos logit contributions
'd+0.074
 remembered+0.072
 managed+0.068
 decided+0.067
 unable+0.063
Neg logit contributions
igsaw-0.162
 new-0.154
 biggest-0.150
ummy-0.150
ay-0.150
Token into
Token ID656
Feature activation-6.15e-02

Pos logit contributions
igsaw+0.123
ummy+0.106
 difference+0.102
umm+0.099
 types+0.099
Neg logit contributions
 remembered-0.102
'd-0.100
 tried-0.099
 didn-0.098
 decided-0.097
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
igsaw+0.078
 difference+0.067
ummy+0.066
opl+0.065
umm+0.064
Neg logit contributions
 remembered-0.047
 tried-0.046
 decided-0.045
 could-0.045
 didn-0.045
Token help
Token ID1037
Feature activation-3.23e-02

Pos logit contributions
 remembered+0.017
 tried+0.017
 decided+0.017
 could+0.017
 didn+0.016
Neg logit contributions
igsaw-0.025
 difference-0.021
ummy-0.021
opl-0.021
umm-0.020
Token me
Token ID502
Feature activation+3.26e-02

Pos logit contributions
igsaw+0.076
opl+0.063
umm+0.063
 difference+0.062
ummy+0.061
Neg logit contributions
 tried-0.051
 could-0.050
 remembered-0.050
 decided-0.050
 didn-0.049
Token fix
Token ID4259
Feature activation-6.78e-02

Pos logit contributions
 remembered+0.054
 tried+0.054
 decided+0.053
 could+0.053
 didn+0.053
Neg logit contributions
igsaw-0.070
 difference-0.059
ummy-0.059
opl-0.058
umm-0.057
Token the
Token ID262
Feature activation-1.76e-01

Pos logit contributions
igsaw+0.144
umm+0.123
 difference+0.118
ummy+0.118
opl+0.115
Neg logit contributions
 tried-0.116
 decided-0.113
 didn-0.113
 could-0.113
 remembered-0.112
Token v
Token ID410
Feature activation-2.22e-01

Pos logit contributions
igsaw+0.316
ummy+0.261
opl+0.258
umm+0.252
ay+0.240
Neg logit contributions
 tried-0.351
 had-0.345
 didn-0.343
 decided-0.340
 could-0.338
Tokenase
Token ID589
Feature activation-2.78e-03

Pos logit contributions
igsaw+0.405
 difference+0.362
 bou+0.331
 pleasure+0.329
 types+0.322
Neg logit contributions
 could-0.416
 decided-0.407
 didn-0.405
 got-0.402
 tried-0.401
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
igsaw+0.007
umm+0.006
 difference+0.006
ummy+0.006
opl+0.006
Neg logit contributions
 decided-0.004
 tried-0.004
 could-0.004
 didn-0.004
 remembered-0.003
Token And
Token ID843
Feature activation+2.58e-02

Pos logit contributions
igsaw+0.029
 difference+0.024
 bou+0.023
 types+0.023
ummy+0.022
Neg logit contributions
 could-0.027
 tried-0.027
 remembered-0.027
 decided-0.027
 didn-0.026
Token you
Token ID345
Feature activation+9.86e-02

Pos logit contributions
 remembered+0.040
 tried+0.040
 decided+0.039
 could+0.039
 didn+0.039
Neg logit contributions
igsaw-0.060
 difference-0.051
opl-0.050
ummy-0.050
umm-0.049
Token have
Token ID423
Feature activation+6.42e-02

Pos logit contributions
 remembered+0.146
 decided+0.142
 stumbled+0.140
 accidentally+0.138
 unable+0.137
Neg logit contributions
igsaw-0.205
 difference-0.193
ay-0.177
 art-0.172
 types-0.170
Token from
Token ID422
Feature activation-5.94e-02

Pos logit contributions
igsaw+0.039
 difference+0.034
ummy+0.033
opl+0.032
umm+0.032
Neg logit contributions
 remembered-0.027
 tried-0.026
 decided-0.025
 didn-0.025
 managed-0.025
Token the
Token ID262
Feature activation-1.50e-01

Pos logit contributions
igsaw+0.106
 difference+0.092
ummy+0.090
opl+0.089
 new+0.087
Neg logit contributions
 remembered-0.114
 managed-0.113
'd-0.112
 tried-0.110
 could-0.110
Token pond
Token ID16723
Feature activation+1.39e-02

Pos logit contributions
igsaw+0.260
 difference+0.203
opl+0.202
ummy+0.201
umm+0.192
Neg logit contributions
 remembered-0.319
 tried-0.318
 decided-0.315
 could-0.313
 didn-0.310
Token.
Token ID13
Feature activation-3.41e-03

Pos logit contributions
 remembered+0.021
 tried+0.020
 decided+0.020
 could+0.020
 didn+0.020
Neg logit contributions
igsaw-0.032
 difference-0.028
ummy-0.028
opl-0.027
umm-0.026
Token Then
Token ID3244
Feature activation+6.05e-02

Pos logit contributions
igsaw+0.008
 difference+0.007
opl+0.007
ummy+0.006
umm+0.006
Neg logit contributions
 tried-0.005
 could-0.005
 decided-0.005
 remembered-0.005
 didn-0.005
Token he
Token ID339
Feature activation+1.38e-01

Pos logit contributions
'd+0.128
 hadn+0.120
 managed+0.119
 unable+0.118
 wouldn+0.118
Neg logit contributions
igsaw-0.078
 new-0.077
 difference-0.074
ummy-0.073
 memories-0.072
Token hopped
Token ID45230
Feature activation-9.96e-03

Pos logit contributions
 unable+0.121
 wouldn+0.112
 hadn+0.104
 didn+0.103
 managed+0.101
Neg logit contributions
igsaw-0.309
 new-0.308
 bou-0.305
 types-0.300
 difference-0.297
Token away
Token ID1497
Feature activation+7.68e-04

Pos logit contributions
igsaw+0.019
ummy+0.016
 difference+0.016
opl+0.015
ay+0.015
Neg logit contributions
 remembered-0.018
 didn-0.018
 tried-0.018
'd-0.017
 accidentally-0.017
Token,
Token ID11
Feature activation+1.45e-02

Pos logit contributions
 remembered+0.001
 tried+0.001
'd+0.001
 could+0.001
 decided+0.001
Neg logit contributions
igsaw-0.002
 difference-0.001
ummy-0.001
opl-0.001
 new-0.001
Token leaving
Token ID4305
Feature activation-3.10e-02

Pos logit contributions
 remembered+0.023
'd+0.022
 decided+0.022
 managed+0.022
 could+0.022
Neg logit contributions
igsaw-0.028
 difference-0.026
 new-0.025
 bou-0.024
ummy-0.024
Token the
Token ID262
Feature activation-1.47e-01

Pos logit contributions
igsaw+0.054
 difference+0.047
ummy+0.045
opl+0.044
 new+0.043
Neg logit contributions
 remembered-0.061
 tried-0.060
'd-0.060
 decided-0.060
 didn-0.059
Token couldn
Token ID3521
Feature activation+4.26e-02

Pos logit contributions
 unable+0.155
pped+0.141
ged+0.127
 managed+0.126
ppings+0.124
Neg logit contributions
 new-0.231
 moms-0.223
 types-0.221
 seats-0.218
 favorite-0.216
Token't
Token ID470
Feature activation+8.36e-02

Pos logit contributions
 remembered+0.046
 tried+0.045
 decided+0.044
 could+0.043
 didn+0.043
Neg logit contributions
igsaw-0.117
 difference-0.103
ummy-0.102
opl-0.101
umm-0.099
Token.
Token ID13
Feature activation-6.52e-03

Pos logit contributions
 managed+0.171
 remembered+0.170
 hadn+0.166
 wouldn+0.165
'd+0.164
Neg logit contributions
igsaw-0.131
 difference-0.121
 birthday-0.116
 parties-0.109
 art-0.106
Token Her
Token ID2332
Feature activation-3.94e-03

Pos logit contributions
igsaw+0.015
 difference+0.013
 types+0.013
opl+0.013
umm+0.013
Neg logit contributions
 could-0.010
 tried-0.010
 remembered-0.009
 decided-0.009
 didn-0.009
Token dream
Token ID4320
Feature activation-5.98e-02

Pos logit contributions
igsaw+0.009
opl+0.007
umm+0.007
ummy+0.007
 difference+0.007
Neg logit contributions
 remembered-0.007
 didn-0.007
 tried-0.007
 could-0.007
 had-0.007
Token was
Token ID373
Feature activation+7.29e-02

Pos logit contributions
igsaw+0.162
 difference+0.144
ummy+0.144
opl+0.142
ay+0.142
Neg logit contributions
 remembered-0.056
 decided-0.053
 tried-0.052
 managed-0.050
 accidentally-0.049
Token gone
Token ID3750
Feature activation+1.20e-03

Pos logit contributions
 remembered+0.109
 tried+0.108
 decided+0.106
 managed+0.104
 could+0.104
Neg logit contributions
igsaw-0.164
 difference-0.143
ummy-0.138
opl-0.137
 types-0.134
Token.
Token ID13
Feature activation-7.97e-03

Pos logit contributions
 remembered+0.002
 tried+0.002
 managed+0.002
 decided+0.002
 didn+0.002
Neg logit contributions
igsaw-0.002
 difference-0.002
ummy-0.002
 new-0.002
opl-0.002
Token And
Token ID843
Feature activation+2.69e-02

Pos logit contributions
igsaw+0.017
 difference+0.015
 types+0.015
opl+0.015
umm+0.015
Neg logit contributions
 could-0.013
 tried-0.013
 remembered-0.013
 decided-0.012
 didn-0.012
Token so
Token ID523
Feature activation+2.02e-02

Pos logit contributions
 remembered+0.035
 tried+0.035
 decided+0.034
 could+0.034
 didn+0.034
Neg logit contributions
igsaw-0.067
 difference-0.058
ummy-0.058
opl-0.057
umm-0.056
Token was
Token ID373
Feature activation+7.37e-02

Pos logit contributions
 remembered+0.030
 could+0.030
 didn+0.029
 tried+0.029
 decided+0.029
Neg logit contributions
igsaw-0.052
 difference-0.044
umm-0.044
opl-0.043
ummy-0.042
Token but
Token ID475
Feature activation+1.00e-01

Pos logit contributions
 remembered+0.020
 tried+0.019
 managed+0.019
 stumbled+0.018
'd+0.018
Neg logit contributions
igsaw-0.028
 difference-0.026
 new-0.026
 art-0.026
ummy-0.026
Token a
Token ID257
Feature activation-7.89e-02

Pos logit contributions
 remembered+0.124
'd+0.123
 tried+0.121
 managed+0.121
 stumbled+0.118
Neg logit contributions
igsaw-0.240
 difference-0.212
ummy-0.211
 new-0.209
 types-0.208
Token baby
Token ID5156
Feature activation-6.19e-03

Pos logit contributions
igsaw+0.115
 difference+0.092
ummy+0.088
opl+0.084
umm+0.082
Neg logit contributions
 remembered-0.183
 tried-0.182
 decided-0.182
 could-0.180
 managed-0.179
Token bird
Token ID6512
Feature activation+2.43e-02

Pos logit contributions
igsaw+0.011
 difference+0.011
ummy+0.011
 bou+0.010
 new+0.010
Neg logit contributions
 managed-0.010
 remembered-0.009
 unable-0.009
 didn-0.009
'd-0.009
Token.
Token ID13
Feature activation-7.69e-03

Pos logit contributions
 remembered+0.032
 managed+0.031
 tried+0.030
 stumbled+0.029
 decided+0.029
Neg logit contributions
igsaw-0.052
opl-0.049
 difference-0.048
 bou-0.047
 new-0.046
Token She
Token ID1375
Feature activation+1.57e-01

Pos logit contributions
igsaw+0.019
 difference+0.016
opl+0.016
 types+0.016
umm+0.016
Neg logit contributions
 could-0.011
 decided-0.011
 tried-0.010
 remembered-0.010
 didn-0.010
Token gently
Token ID15165
Feature activation+3.23e-02

Pos logit contributions
 unable+0.161
 wouldn+0.151
 managed+0.145
'd+0.144
pped+0.141
Neg logit contributions
 bou-0.318
 types-0.313
 new-0.310
 orig-0.305
igsaw-0.304
Token lifted
Token ID13663
Feature activation-5.05e-02

Pos logit contributions
'd+0.033
 could+0.030
 remembered+0.030
 hadn+0.029
 wouldn+0.029
Neg logit contributions
igsaw-0.079
 bou-0.074
 batch-0.071
 new-0.071
 art-0.071
Token the
Token ID262
Feature activation-1.49e-01

Pos logit contributions
igsaw+0.132
 difference+0.115
opl+0.115
umm+0.114
ummy+0.113
Neg logit contributions
 tried-0.063
 decided-0.061
 remembered-0.061
 could-0.061
 didn-0.060
Token egg
Token ID5935
Feature activation-8.77e-03

Pos logit contributions
igsaw+0.329
opl+0.272
 difference+0.267
ummy+0.265
umm+0.259
Neg logit contributions
 remembered-0.250
 tried-0.249
 decided-0.246
 could-0.241
 didn-0.238
Token and
Token ID290
Feature activation+2.35e-02

Pos logit contributions
igsaw+0.020
 difference+0.017
ummy+0.017
opl+0.017
umm+0.016
Neg logit contributions
 remembered-0.013
 tried-0.013
 decided-0.013
 could-0.013
 didn-0.013
Token picture
Token ID4286
Feature activation-4.85e-02

Pos logit contributions
igsaw+0.295
umm+0.272
Whe+0.262
opl+0.259
aj+0.255
Neg logit contributions
 tried-0.281
 had-0.274
 didn-0.267
 accidentally-0.262
 heard-0.261
Token and
Token ID290
Feature activation+1.65e-02

Pos logit contributions
igsaw+0.114
 difference+0.098
ummy+0.098
opl+0.096
umm+0.094
Neg logit contributions
 remembered-0.070
 tried-0.068
 decided-0.067
 didn-0.066
 managed-0.066
Token they
Token ID484
Feature activation+1.19e-01

Pos logit contributions
 remembered+0.019
 tried+0.018
'd+0.018
 could+0.018
 decided+0.018
Neg logit contributions
igsaw-0.043
 difference-0.038
ummy-0.037
 types-0.037
opl-0.036
Token could
Token ID714
Feature activation+1.00e-01

Pos logit contributions
 remembered+0.152
'd+0.148
 tried+0.148
 stumbled+0.146
 managed+0.145
Neg logit contributions
igsaw-0.265
 difference-0.236
 bou-0.232
 art-0.232
 types-0.230
Token see
Token ID766
Feature activation+8.28e-03

Pos logit contributions
 remembered+0.173
 tried+0.168
 decided+0.166
 managed+0.165
 could+0.165
Neg logit contributions
igsaw-0.206
 difference-0.177
ummy-0.173
opl-0.169
 art-0.167
Token it
Token ID340
Feature activation+6.61e-02

Pos logit contributions
 could+0.013
 decided+0.013
 tried+0.012
 remembered+0.012
 didn+0.012
Neg logit contributions
igsaw-0.020
ummy-0.017
umm-0.017
opl-0.016
 difference-0.016
Token much
Token ID881
Feature activation-4.52e-02

Pos logit contributions
 tried+0.109
 could+0.108
 decided+0.108
 remembered+0.107
 didn+0.106
Neg logit contributions
igsaw-0.151
ummy-0.125
umm-0.124
opl-0.123
 difference-0.123
Token better
Token ID1365
Feature activation-4.69e-02

Pos logit contributions
igsaw+0.092
opl+0.077
ummy+0.076
 difference+0.075
umm+0.073
Neg logit contributions
 tried-0.082
 remembered-0.081
 decided-0.081
 could-0.080
 didn-0.078
Token.
Token ID13
Feature activation-6.05e-03

Pos logit contributions
igsaw+0.101
 difference+0.087
ummy+0.086
 art+0.082
umm+0.082
Neg logit contributions
 remembered-0.076
 tried-0.076
 didn-0.074
 decided-0.074
'd-0.074
Token Tim
Token ID5045
Feature activation+4.83e-02

Pos logit contributions
igsaw+0.014
 difference+0.012
umm+0.012
 types+0.011
 bou+0.011
Neg logit contributions
 could-0.010
 didn-0.009
 tried-0.009
 decided-0.009
 accidentally-0.009
Tokenmy
Token ID1820
Feature activation+1.07e-01

Pos logit contributions
 tried+0.084
 remembered+0.083
 managed+0.083
 could+0.083
 decided+0.083
Neg logit contributions
igsaw-0.094
 difference-0.082
ummy-0.081
opl-0.079
 types-0.078
Token fragile
Token ID21049
Feature activation-1.28e-01

Pos logit contributions
igsaw+0.254
opl+0.190
ummy+0.190
 difference+0.184
umm+0.182
Neg logit contributions
 remembered-0.302
 tried-0.301
 decided-0.297
 could-0.296
 didn-0.293
Token house
Token ID2156
Feature activation+3.03e-02

Pos logit contributions
igsaw+0.203
 difference+0.170
ummy+0.164
opl+0.158
 types+0.156
Neg logit contributions
 remembered-0.279
 tried-0.275
 decided-0.274
 managed-0.273
 could-0.272
Token.
Token ID13
Feature activation+9.63e-04

Pos logit contributions
 remembered+0.050
 tried+0.048
 managed+0.046
 decided+0.046
 accidentally+0.046
Neg logit contributions
igsaw-0.065
 difference-0.056
ummy-0.055
 art-0.053
 types-0.053
Token The
Token ID383
Feature activation-1.05e-01

Pos logit contributions
 could+0.002
 tried+0.002
 decided+0.002
 didn+0.002
 remembered+0.002
Neg logit contributions
igsaw-0.002
 difference-0.002
opl-0.002
umm-0.002
ummy-0.002
Token bird
Token ID6512
Feature activation+5.19e-02

Pos logit contributions
igsaw+0.219
ummy+0.176
opl+0.175
 difference+0.175
umm+0.171
Neg logit contributions
 remembered-0.191
 tried-0.190
 decided-0.187
 could-0.185
 didn-0.185
Token looked
Token ID3114
Feature activation+6.90e-02

Pos logit contributions
'd+0.041
 unable+0.041
 wouldn+0.040
 hadn+0.040
 stumbled+0.040
Neg logit contributions
igsaw-0.124
 types-0.123
 difference-0.119
 new-0.118
 bou-0.116
Token at
Token ID379
Feature activation-5.46e-02

Pos logit contributions
 remembered+0.128
 tried+0.126
 decided+0.125
 could+0.124
 didn+0.123
Neg logit contributions
igsaw-0.136
 difference-0.114
ummy-0.112
opl-0.110
umm-0.107
Token the
Token ID262
Feature activation-1.34e-01

Pos logit contributions
igsaw+0.108
 difference+0.089
ummy+0.088
opl+0.086
umm+0.085
Neg logit contributions
 remembered-0.102
 tried-0.102
 decided-0.101
 could-0.100
 didn-0.099
Token house
Token ID2156
Feature activation+3.14e-02

Pos logit contributions
igsaw+0.248
opl+0.185
ummy+0.183
umm+0.178
 difference+0.176
Neg logit contributions
 remembered-0.286
 tried-0.285
 decided-0.281
 could-0.280
 didn-0.276
Token and
Token ID290
Feature activation+3.14e-02

Pos logit contributions
 remembered+0.044
 tried+0.042
 decided+0.041
 managed+0.041
'd+0.041
Neg logit contributions
igsaw-0.074
ummy-0.065
 difference-0.065
 types-0.062
opl-0.062
Token saw
Token ID2497
Feature activation-1.67e-02

Pos logit contributions
'd+0.030
 managed+0.028
 wouldn+0.027
 remembered+0.027
 hadn+0.026
Neg logit contributions
igsaw-0.076
 types-0.073
 difference-0.072
 new-0.071
 art-0.071
Token the
Token ID262
Feature activation-1.26e-01

Pos logit contributions
igsaw+0.028
 difference+0.025
ummy+0.024
opl+0.024
umm+0.024
Neg logit contributions
 could-0.014
 tried-0.013
 decided-0.013
 remembered-0.013
 didn-0.012
Token swings
Token ID26728
Feature activation-4.46e-02

Pos logit contributions
igsaw+0.220
ummy+0.174
opl+0.173
 difference+0.170
umm+0.170
Neg logit contributions
 remembered-0.259
 tried-0.259
 decided-0.257
 didn-0.255
 could-0.253
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
igsaw+0.110
 difference+0.095
ummy+0.093
opl+0.092
umm+0.091
Neg logit contributions
 remembered-0.061
 tried-0.060
 decided-0.060
 could-0.059
 didn-0.058
Token the
Token ID262
Feature activation-1.15e-01

Pos logit contributions
 remembered+0.024
 tried+0.023
 managed+0.023
 decided+0.023
 could+0.023
Neg logit contributions
igsaw-0.035
 difference-0.030
ummy-0.030
opl-0.029
 types-0.029
Token slide
Token ID10649
Feature activation-5.13e-03

Pos logit contributions
igsaw+0.200
ummy+0.160
 difference+0.159
opl+0.156
umm+0.153
Neg logit contributions
 remembered-0.240
 tried-0.239
 decided-0.237
 could-0.235
 didn-0.234
Token and
Token ID290
Feature activation+3.30e-02

Pos logit contributions
igsaw+0.012
 difference+0.010
ummy+0.010
opl+0.010
 types+0.010
Neg logit contributions
 remembered-0.007
 tried-0.007
 decided-0.007
 managed-0.007
 didn-0.007
Token the
Token ID262
Feature activation-1.28e-01

Pos logit contributions
 remembered+0.056
 tried+0.056
 decided+0.056
 could+0.055
 didn+0.055
Neg logit contributions
igsaw-0.070
 difference-0.059
ummy-0.059
opl-0.058
umm-0.057
Token sees
Token ID7224
Feature activation-2.87e-02

Pos logit contributions
igsaw+0.216
ummy+0.171
 difference+0.169
opl+0.166
umm+0.163
Neg logit contributions
 remembered-0.276
 tried-0.276
 decided-0.273
 could-0.270
 didn-0.269
Tokenaw
Token ID707
Feature activation-2.96e-02

Pos logit contributions
igsaw+0.064
 difference+0.053
ummy+0.053
umm+0.052
opl+0.052
Neg logit contributions
 decided-0.048
 tried-0.048
 remembered-0.047
 could-0.047
 didn-0.046
Token.
Token ID13
Feature activation+6.02e-04

Pos logit contributions
igsaw+0.065
ummy+0.057
 difference+0.056
opl+0.056
umm+0.053
Neg logit contributions
 remembered-0.047
 tried-0.046
 decided-0.044
 managed-0.044
 didn-0.044
Token They
Token ID1119
Feature activation+1.60e-01

Pos logit contributions
 could+0.001
 tried+0.001
 decided+0.001
 accidentally+0.001
 stumbled+0.001
Neg logit contributions
igsaw-0.001
 difference-0.001
 types-0.001
umm-0.001
opl-0.001
Token realized
Token ID6939
Feature activation+3.60e-03

Pos logit contributions
 unable+0.113
 accidentally+0.108
 tried+0.107
Please+0.106
 dropped+0.103
Neg logit contributions
igsaw-0.333
 difference-0.328
 types-0.327
 parties-0.318
opl-0.318
Token how
Token ID703
Feature activation-6.42e-03

Pos logit contributions
 remembered+0.006
 tried+0.006
 decided+0.006
 managed+0.006
 could+0.006
Neg logit contributions
igsaw-0.008
 difference-0.006
ummy-0.006
opl-0.006
 types-0.006
Token important
Token ID1593
Feature activation-5.64e-02

Pos logit contributions
igsaw+0.014
 difference+0.012
ummy+0.012
 new+0.011
 types+0.011
Neg logit contributions
 remembered-0.010
 tried-0.010
 decided-0.010
 managed-0.010
'd-0.010
Token it
Token ID340
Feature activation+8.73e-02

Pos logit contributions
igsaw+0.139
 difference+0.123
 types+0.121
ummy+0.119
 new+0.118
Neg logit contributions
 remembered-0.072
'd-0.067
 accidentally-0.066
 managed-0.066
 tried-0.066
Token was
Token ID373
Feature activation+7.31e-02

Pos logit contributions
 remembered+0.098
 accidentally+0.094
 unable+0.092
 decided+0.091
 stumbled+0.091
Neg logit contributions
igsaw-0.198
 types-0.187
 difference-0.186
 new-0.180
 art-0.172
Token to
Token ID284
Feature activation+3.86e-02

Pos logit contributions
 remembered+0.118
 tried+0.117
 decided+0.114
 managed+0.114
 could+0.113
Neg logit contributions
igsaw-0.147
 difference-0.132
ummy-0.127
 types-0.124
 new-0.124
Token make
Token ID787
Feature activation-2.85e-02

Pos logit contributions
 remembered+0.060
 could+0.059
 tried+0.058
 managed+0.058
 decided+0.058
Neg logit contributions
igsaw-0.084
 difference-0.075
opl-0.072
ummy-0.071
 art-0.068
Token sure
Token ID1654
Feature activation+8.41e-03

Pos logit contributions
igsaw+0.069
opl+0.057
umm+0.057
ummy+0.056
ay+0.054
Neg logit contributions
 tried-0.041
 decided-0.041
 didn-0.041
 remembered-0.040
 could-0.040
Token his
Token ID465
Feature activation-3.93e-02

Pos logit contributions
 remembered+0.012
 tried+0.011
 decided+0.011
 could+0.011
 didn+0.011
Neg logit contributions
igsaw-0.020
 difference-0.018
ummy-0.017
opl-0.017
umm-0.017
Token hiding
Token ID11816
Feature activation-4.73e-02

Pos logit contributions
igsaw+0.070
opl+0.055
umm+0.054
ummy+0.053
 difference+0.052
Neg logit contributions
 didn-0.083
 remembered-0.083
 tried-0.083
 could-0.081
 decided-0.081
Token place
Token ID1295
Feature activation-2.28e-02

Pos logit contributions
igsaw+0.079
 difference+0.065
ummy+0.061
 types+0.059
opl+0.059
Neg logit contributions
 remembered-0.100
 tried-0.100
 decided-0.099
 could-0.098
 didn-0.097
Token see
Token ID766
Feature activation+1.95e-02

Pos logit contributions
 remembered+0.064
 tried+0.062
 decided+0.061
 could+0.061
 managed+0.061
Neg logit contributions
igsaw-0.085
 difference-0.074
opl-0.071
ummy-0.071
 art-0.068
Token what
Token ID644
Feature activation+3.29e-02

Pos logit contributions
 could+0.026
 tried+0.026
 decided+0.025
 remembered+0.025
 didn+0.024
Neg logit contributions
igsaw-0.052
umm-0.044
ummy-0.043
opl-0.043
 difference-0.042
Token was
Token ID373
Feature activation+7.02e-02

Pos logit contributions
 remembered+0.032
 tried+0.030
 decided+0.030
 managed+0.029
 didn+0.029
Neg logit contributions
igsaw-0.091
 difference-0.082
ummy-0.081
opl-0.080
 types-0.079
Token up
Token ID510
Feature activation-1.91e-02

Pos logit contributions
 remembered+0.118
 tried+0.114
 decided+0.114
 could+0.113
 didn+0.113
Neg logit contributions
igsaw-0.146
 difference-0.125
opl-0.124
ummy-0.122
 types-0.118
Token there
Token ID612
Feature activation+3.20e-02

Pos logit contributions
igsaw+0.041
 difference+0.035
ummy+0.035
 new+0.034
 types+0.033
Neg logit contributions
 remembered-0.029
 wouldn-0.028
 didn-0.028
 tried-0.028
 could-0.028
Token,
Token ID11
Feature activation+1.84e-02

Pos logit contributions
 remembered+0.049
 tried+0.049
 decided+0.048
 managed+0.047
 didn+0.047
Neg logit contributions
igsaw-0.071
 difference-0.062
ummy-0.062
opl-0.060
 types-0.059
Token so
Token ID523
Feature activation+2.94e-02

Pos logit contributions
 remembered+0.027
 tried+0.025
 managed+0.025
'd+0.025
 decided+0.025
Neg logit contributions
igsaw-0.041
 difference-0.037
ummy-0.036
opl-0.035
 new-0.035
Token he
Token ID339
Feature activation+1.42e-01

Pos logit contributions
 remembered+0.028
 tried+0.027
 decided+0.027
 could+0.026
 didn+0.026
Neg logit contributions
igsaw-0.084
 difference-0.074
ummy-0.074
opl-0.073
umm-0.072
Token crawled
Token ID45668
Feature activation-1.65e-02

Pos logit contributions
 unable+0.137
pped+0.122
 hadn+0.116
 remembered+0.113
'd+0.110
Neg logit contributions
 types-0.284
 new-0.284
 bou-0.284
 orig-0.273
 seats-0.267
Token onto
Token ID4291
Feature activation-5.25e-02

Pos logit contributions
igsaw+0.028
 new+0.024
ummy+0.024
 bou+0.023
 difference+0.022
Neg logit contributions
 remembered-0.033
 didn-0.031
 accidentally-0.031
'd-0.030
 tried-0.030
Token a
Token ID257
Feature activation-9.50e-02

Pos logit contributions
igsaw+0.113
 difference+0.095
ummy+0.094
opl+0.093
umm+0.091
Neg logit contributions
 tried-0.088
 remembered-0.088
 decided-0.087
 could-0.086
 didn-0.085
Token
Token ID198
Feature activation-6.03e-02

Pos logit contributions
igsaw+0.138
umm+0.117
 difference+0.115
ummy+0.113
opl+0.112
Neg logit contributions
 tried-0.106
 could-0.106
 remembered-0.105
 decided-0.104
 didn-0.103
TokenA
Token ID32
Feature activation-6.13e-02

Pos logit contributions
igsaw+0.118
 difference+0.099
ummy+0.096
 new+0.095
 art+0.094
Neg logit contributions
 wouldn-0.110
'd-0.107
 remembered-0.103
 managed-0.103
 didn-0.102
Token man
Token ID582
Feature activation+6.14e-02

Pos logit contributions
igsaw+0.107
 difference+0.088
ummy+0.085
opl+0.081
 types+0.081
Neg logit contributions
 remembered-0.124
 decided-0.123
 wouldn-0.123
 tried-0.123
 didn-0.122
Token heard
Token ID2982
Feature activation+8.23e-03

Pos logit contributions
 wouldn+0.064
 accidentally+0.058
 managed+0.056
 didn+0.055
'd+0.054
Neg logit contributions
 auditor-0.124
 seats-0.124
 difference-0.123
 new-0.121
 theater-0.120
Token them
Token ID606
Feature activation+8.08e-02

Pos logit contributions
 tried+0.011
 decided+0.011
 remembered+0.010
 didn+0.010
 could+0.010
Neg logit contributions
igsaw-0.021
umm-0.018
opl-0.018
 difference-0.018
ummy-0.018
Token and
Token ID290
Feature activation+2.09e-02

Pos logit contributions
 remembered+0.138
 managed+0.135
'd+0.133
 accidentally+0.130
 wouldn+0.127
Neg logit contributions
igsaw-0.157
 difference-0.148
opl-0.145
ummy-0.143
 new-0.136
Token came
Token ID1625
Feature activation+4.16e-02

Pos logit contributions
'd+0.023
 possibly+0.021
 wouldn+0.021
 unable+0.020
 shouldn+0.020
Neg logit contributions
 new-0.043
 seats-0.039
 old-0.037
 Grand-0.037
 cheers-0.037
Token to
Token ID284
Feature activation+2.33e-02

Pos logit contributions
 remembered+0.077
 didn+0.076
 wouldn+0.076
'd+0.075
 managed+0.075
Neg logit contributions
igsaw-0.073
 new-0.064
 difference-0.064
ummy-0.064
 types-0.062
Token see
Token ID766
Feature activation+1.50e-02

Pos logit contributions
 remembered+0.040
 wouldn+0.040
 could+0.039
'd+0.039
 didn+0.039
Neg logit contributions
igsaw-0.047
 difference-0.041
ummy-0.038
 bou-0.038
 new-0.038
Token what
Token ID644
Feature activation+2.60e-02

Pos logit contributions
 remembered+0.025
'd+0.024
 managed+0.024
 tried+0.024
 decided+0.024
Neg logit contributions
igsaw-0.032
 difference-0.028
ummy-0.027
opl-0.026
 types-0.026
Token was
Token ID373
Feature activation+6.54e-02

Pos logit contributions
 remembered+0.023
 decided+0.022
 tried+0.022
 managed+0.022
 wouldn+0.022
Neg logit contributions
igsaw-0.073
 difference-0.065
ummy-0.064
 types-0.063
 new-0.063
Token.
Token ID13
Feature activation-7.96e-03

Pos logit contributions
 remembered+0.063
 tried+0.062
 could+0.061
 didn+0.061
 decided+0.061
Neg logit contributions
igsaw-0.091
 difference-0.077
opl-0.076
ummy-0.075
umm-0.074
Token I
Token ID314
Feature activation+1.32e-01

Pos logit contributions
igsaw+0.018
 difference+0.016
opl+0.016
 types+0.015
ummy+0.015
Neg logit contributions
 could-0.013
 tried-0.012
 remembered-0.012
 didn-0.012
 decided-0.012
Token didn
Token ID1422
Feature activation+6.03e-02

Pos logit contributions
 unable+0.198
 decided+0.194
 tried+0.184
 wouldn+0.184
Please+0.183
Neg logit contributions
igsaw-0.250
 difference-0.240
 fault-0.211
 pleasure-0.211
ay-0.210
Token't
Token ID470
Feature activation+9.27e-02

Pos logit contributions
 remembered+0.063
 tried+0.062
 decided+0.061
 could+0.060
 didn+0.060
Neg logit contributions
igsaw-0.167
 difference-0.147
ummy-0.145
opl-0.144
umm-0.142
Token know
Token ID760
Feature activation+1.98e-02

Pos logit contributions
'd+0.175
 wouldn+0.174
 hadn+0.172
 could+0.170
 managed+0.168
Neg logit contributions
igsaw-0.163
 difference-0.143
 art-0.133
 parties-0.131
 new-0.130
Token that
Token ID326
Feature activation+1.21e-02

Pos logit contributions
 could+0.026
 tried+0.026
 decided+0.025
 remembered+0.025
 didn+0.025
Neg logit contributions
igsaw-0.052
 difference-0.044
umm-0.044
opl-0.043
ummy-0.042
Token.
Token ID13
Feature activation-9.51e-03

Pos logit contributions
 tried+0.016
 decided+0.016
 remembered+0.016
 could+0.016
 didn+0.016
Neg logit contributions
igsaw-0.031
opl-0.026
ummy-0.026
umm-0.026
 difference-0.026
Token Thank
Token ID6952
Feature activation-5.72e-02

Pos logit contributions
igsaw+0.022
 difference+0.019
opl+0.019
ummy+0.018
 types+0.018
Neg logit contributions
 could-0.015
 remembered-0.014
 decided-0.014
 tried-0.014
 didn-0.014
Token you
Token ID345
Feature activation+9.26e-02

Pos logit contributions
igsaw+0.112
ummy+0.094
 difference+0.093
opl+0.090
umm+0.089
Neg logit contributions
 remembered-0.106
 tried-0.104
 decided-0.103
 managed-0.103
'd-0.102
Token,
Token ID11
Feature activation+7.80e-03

Pos logit contributions
 could+0.133
 decided+0.130
 didn+0.128
 remembered+0.127
 tried+0.125
Neg logit contributions
igsaw-0.232
opl-0.200
umm-0.194
 difference-0.189
 types-0.188
Token Lily
Token ID20037
Feature activation+6.20e-02

Pos logit contributions
 tried+0.015
 remembered+0.014
 didn+0.014
 decided+0.014
 could+0.014
Neg logit contributions
igsaw-0.016
 difference-0.013
ummy-0.013
opl-0.013
umm-0.013
Token asked
Token ID1965
Feature activation-5.51e-03

Pos logit contributions
pped+0.119
 unable+0.116
 managed+0.108
'd+0.100
 remembered+0.099
Neg logit contributions
igsaw-0.389
 types-0.381
 new-0.377
 difference-0.372
 goals-0.364
Token Jill
Token ID22772
Feature activation+7.48e-02

Pos logit contributions
igsaw+0.011
opl+0.010
 difference+0.010
ummy+0.009
umm+0.009
Neg logit contributions
 remembered-0.010
 tried-0.010
 decided-0.010
 could-0.010
 didn-0.010
Token why
Token ID1521
Feature activation+3.37e-02

Pos logit contributions
 remembered+0.114
 could+0.113
 tried+0.113
 decided+0.112
 didn+0.111
Neg logit contributions
igsaw-0.173
umm-0.149
 difference-0.147
opl-0.147
ummy-0.146
Token she
Token ID673
Feature activation+1.28e-01

Pos logit contributions
 remembered+0.044
 tried+0.043
 managed+0.042
 decided+0.041
'd+0.041
Neg logit contributions
igsaw-0.081
ummy-0.070
 difference-0.070
 art-0.069
 new-0.068
Token thought
Token ID1807
Feature activation+2.60e-02

Pos logit contributions
 managed+0.119
 unable+0.117
 remembered+0.110
 tried+0.105
 decided+0.105
Neg logit contributions
igsaw-0.288
 difference-0.283
opl-0.279
 types-0.276
 new-0.268
Token golf
Token ID13126
Feature activation-9.56e-02

Pos logit contributions
 remembered+0.035
 tried+0.034
 decided+0.033
 managed+0.032
 didn+0.032
Neg logit contributions
igsaw-0.064
 difference-0.056
ummy-0.055
opl-0.054
 types-0.053
Token was
Token ID373
Feature activation+7.55e-02

Pos logit contributions
igsaw+0.246
 difference+0.216
ummy+0.212
opl+0.211
 types+0.207
Neg logit contributions
 remembered-0.116
 tried-0.112
 decided-0.111
 could-0.108
 managed-0.108
Token not
Token ID407
Feature activation+9.09e-02

Pos logit contributions
 remembered+0.121
 tried+0.120
 decided+0.119
 could+0.117
 didn+0.117
Neg logit contributions
igsaw-0.167
 difference-0.142
ummy-0.140
opl-0.139
umm-0.136
Token normal
Token ID3487
Feature activation-7.08e-02

Pos logit contributions
 remembered+0.167
 tried+0.163
 could+0.162
 decided+0.161
 managed+0.160
Neg logit contributions
igsaw-0.176
 difference-0.148
ummy-0.142
 types-0.140
opl-0.140
Token.
Token ID13
Feature activation-7.99e-03

Pos logit contributions
igsaw+0.170
 difference+0.147
opl+0.145
ummy+0.144
umm+0.142
Neg logit contributions
 tried-0.100
 remembered-0.099
 decided-0.099
 could-0.097
 didn-0.097
Token
Token ID220
Feature activation-1.28e-02

Pos logit contributions
igsaw+0.016
 difference+0.014
opl+0.014
ummy+0.013
umm+0.013
Neg logit contributions
 decided-0.015
 could-0.015
 tried-0.015
 didn-0.014
 remembered-0.014
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
igsaw+0.045
umm+0.038
opl+0.038
ummy+0.037
 difference+0.037
Neg logit contributions
 decided-0.034
 tried-0.034
 could-0.033
 remembered-0.033
 didn-0.032
Token water
Token ID1660
Feature activation-1.91e-02

Pos logit contributions
igsaw+0.271
opl+0.215
ummy+0.214
umm+0.209
 difference+0.206
Neg logit contributions
 decided-0.284
 tried-0.279
 remembered-0.279
 could-0.277
 had-0.273
Token.
Token ID13
Feature activation+4.54e-04

Pos logit contributions
igsaw+0.049
 difference+0.041
opl+0.041
ummy+0.041
umm+0.040
Neg logit contributions
 decided-0.025
 tried-0.024
 could-0.024
 remembered-0.024
 didn-0.023
Token
Token ID198
Feature activation-4.93e-02

Pos logit contributions
 could+0.001
 tried+0.001
 decided+0.001
 accidentally+0.001
 didn+0.001
Neg logit contributions
igsaw-0.001
 difference-0.001
 types-0.001
opl-0.001
ummy-0.001
Token
Token ID198
Feature activation-4.84e-02

Pos logit contributions
igsaw+0.113
umm+0.100
 difference+0.095
ummy+0.093
opl+0.091
Neg logit contributions
 could-0.080
 tried-0.079
 decided-0.078
 remembered-0.077
 didn-0.077
TokenOne
Token ID3198
Feature activation-6.92e-02

Pos logit contributions
igsaw+0.091
 difference+0.075
ummy+0.074
opl+0.074
 types+0.070
Neg logit contributions
 remembered-0.092
 tried-0.091
 could-0.089
 wouldn-0.089
 decided-0.089
Token day
Token ID1110
Feature activation+1.65e-02

Pos logit contributions
igsaw+0.121
opl+0.095
 difference+0.095
umm+0.094
 bou+0.089
Neg logit contributions
 decided-0.153
 could-0.152
 tried-0.151
 remembered-0.150
 didn-0.149
Token,
Token ID11
Feature activation+2.13e-02

Pos logit contributions
 remembered+0.029
 tried+0.028
 decided+0.028
'd+0.028
 managed+0.028
Neg logit contributions
igsaw-0.034
ummy-0.028
 difference-0.028
opl-0.027
umm-0.026
Token Tim
Token ID5045
Feature activation+6.33e-02

Pos logit contributions
'd+0.037
 remembered+0.037
 managed+0.036
 tried+0.035
 wouldn+0.035
Neg logit contributions
igsaw-0.042
ummy-0.036
 new-0.035
 festivals-0.034
 birthday-0.034
Token saw
Token ID2497
Feature activation-1.72e-02

Pos logit contributions
 unable+0.069
 wouldn+0.069
 managed+0.066
'd+0.065
 hadn+0.063
Neg logit contributions
igsaw-0.142
 types-0.134
ummy-0.134
 difference-0.133
opl-0.130
Token a
Token ID257
Feature activation-9.07e-02

Pos logit contributions
igsaw+0.041
 difference+0.036
umm+0.035
opl+0.035
ummy+0.035
Neg logit contributions
 tried-0.026
 remembered-0.025
 decided-0.025
 could-0.025
 didn-0.025
Token.
Token ID13
Feature activation-8.82e-03

Pos logit contributions
 remembered+0.003
 tried+0.003
 decided+0.003
 didn+0.003
 could+0.003
Neg logit contributions
igsaw-0.003
 difference-0.003
ummy-0.003
opl-0.003
umm-0.003
Token They
Token ID1119
Feature activation+1.58e-01

Pos logit contributions
igsaw+0.021
opl+0.019
 difference+0.018
umm+0.017
 bou+0.017
Neg logit contributions
 could-0.013
 tried-0.012
 didn-0.012
 had-0.012
 decided-0.012
Token thanked
Token ID26280
Feature activation-3.30e-02

Pos logit contributions
Please+0.158
 tried+0.151
 accidentally+0.151
 remembered+0.150
'd+0.148
Neg logit contributions
igsaw-0.344
 parties-0.333
 difference-0.331
 bou-0.330
 new-0.318
Token Lucy
Token ID22162
Feature activation+7.54e-02

Pos logit contributions
igsaw+0.066
 difference+0.057
ummy+0.055
 art+0.052
 types+0.052
Neg logit contributions
 remembered-0.060
 managed-0.058
 tried-0.058
 decided-0.058
 could-0.058
Token for
Token ID329
Feature activation-3.00e-02

Pos logit contributions
 remembered+0.137
 didn+0.136
 decided+0.136
 could+0.136
 tried+0.135
Neg logit contributions
igsaw-0.153
opl-0.130
umm-0.127
 difference-0.121
 types-0.120
Token inviting
Token ID23310
Feature activation-8.35e-02

Pos logit contributions
igsaw+0.056
 difference+0.047
opl+0.045
 art+0.045
ummy+0.045
Neg logit contributions
 remembered-0.055
 could-0.055
'd-0.055
 tried-0.054
 decided-0.054
Token them
Token ID606
Feature activation+7.30e-02

Pos logit contributions
igsaw+0.184
 difference+0.158
ummy+0.158
opl+0.149
umm+0.148
Neg logit contributions
 remembered-0.136
 tried-0.134
 managed-0.131
 could-0.131
'd-0.131
Token and
Token ID290
Feature activation+5.00e-03

Pos logit contributions
 decided+0.105
 tried+0.103
 remembered+0.103
 could+0.103
 didn+0.101
Neg logit contributions
igsaw-0.176
umm-0.157
opl-0.155
ummy-0.151
 difference-0.150
Token promised
Token ID8072
Feature activation+9.88e-03

Pos logit contributions
 remembered+0.006
'd+0.006
 tried+0.006
 managed+0.006
 could+0.006
Neg logit contributions
igsaw-0.012
 difference-0.011
ummy-0.011
 types-0.010
 art-0.010
Token to
Token ID284
Feature activation+2.71e-02

Pos logit contributions
 remembered+0.017
 tried+0.017
 managed+0.017
 could+0.017
'd+0.017
Neg logit contributions
igsaw-0.020
 difference-0.017
ummy-0.017
 new-0.016
 birthday-0.016
Token play
Token ID711
Feature activation-5.02e-02

Pos logit contributions
 remembered+0.040
 could+0.039
 tried+0.039
 wouldn+0.039
 accidentally+0.038
Neg logit contributions
igsaw-0.058
 difference-0.054
 birthday-0.051
 parties-0.051
ummy-0.051
Token,
Token ID11
Feature activation+2.41e-02

Pos logit contributions
igsaw+0.037
 difference+0.032
opl+0.032
umm+0.031
ummy+0.031
Neg logit contributions
 remembered-0.023
 could-0.023
 tried-0.023
 didn-0.023
 decided-0.023
Token "
Token ID366
Feature activation+1.37e-02

Pos logit contributions
 remembered+0.049
 managed+0.047
 decided+0.047
 tried+0.047
'd+0.045
Neg logit contributions
igsaw-0.041
ummy-0.035
 difference-0.033
 new-0.032
 birthday-0.032
TokenV
Token ID53
Feature activation-1.61e-01

Pos logit contributions
 remembered+0.024
 tried+0.023
 wouldn+0.023
 didn+0.023
 managed+0.023
Neg logit contributions
igsaw-0.026
opl-0.023
 difference-0.022
umm-0.022
ay-0.022
Tokenanish
Token ID7115
Feature activation-7.65e-03

Pos logit contributions
igsaw+0.194
 difference+0.146
opl+0.136
ummy+0.134
 types+0.128
Neg logit contributions
 remembered-0.417
 tried-0.416
 didn-0.411
 decided-0.411
 could-0.409
Token!"
Token ID2474
Feature activation-3.70e-02

Pos logit contributions
igsaw+0.017
 difference+0.014
opl+0.014
ummy+0.014
umm+0.014
Neg logit contributions
 remembered-0.013
 tried-0.013
 didn-0.013
 could-0.013
 decided-0.012
Token The
Token ID383
Feature activation-1.13e-01

Pos logit contributions
igsaw+0.073
 difference+0.060
ummy+0.059
opl+0.058
umm+0.057
Neg logit contributions
 remembered-0.069
 tried-0.069
 decided-0.068
 could-0.067
 didn-0.067
Token toy
Token ID13373
Feature activation-2.66e-02

Pos logit contributions
igsaw+0.208
opl+0.167
ummy+0.166
umm+0.163
 difference+0.163
Neg logit contributions
 tried-0.238
 remembered-0.235
 could-0.231
 decided-0.230
 didn-0.230
Token car
Token ID1097
Feature activation+1.96e-02

Pos logit contributions
igsaw+0.049
 difference+0.042
ummy+0.040
 art+0.040
opl+0.040
Neg logit contributions
 remembered-0.047
 managed-0.047
 decided-0.046
'd-0.046
 tried-0.046
Token was
Token ID373
Feature activation+7.93e-02

Pos logit contributions
 remembered+0.018
 managed+0.017
'd+0.016
 decided+0.016
 accidentally+0.016
Neg logit contributions
igsaw-0.050
 difference-0.046
 types-0.046
 art-0.045
 new-0.045
Token gone
Token ID3750
Feature activation+1.92e-03

Pos logit contributions
 remembered+0.112
 tried+0.111
 decided+0.108
 could+0.108
 didn+0.107
Neg logit contributions
igsaw-0.197
 difference-0.169
ummy-0.168
umm-0.166
opl-0.165
Token.
Token ID13
Feature activation+1.71e-03

Pos logit contributions
 remembered+0.003
 tried+0.003
 managed+0.003
 decided+0.003
'd+0.003
Neg logit contributions
igsaw-0.004
ummy-0.003
 difference-0.003
 new-0.003
opl-0.003
Token and
Token ID290
Feature activation+1.66e-02

Pos logit contributions
 could+0.002
 decided+0.002
 tried+0.002
 had+0.002
 didn+0.002
Neg logit contributions
igsaw-0.005
 difference-0.004
umm-0.004
ummy-0.004
opl-0.004
Token run
Token ID1057
Feature activation-7.95e-03

Pos logit contributions
 remembered+0.032
 tried+0.032
 decided+0.032
 could+0.031
 didn+0.031
Neg logit contributions
igsaw-0.031
 difference-0.025
ummy-0.025
opl-0.024
 types-0.024
Token.
Token ID13
Feature activation+3.40e-03

Pos logit contributions
igsaw+0.019
 difference+0.016
ummy+0.016
opl+0.016
umm+0.015
Neg logit contributions
 remembered-0.012
 tried-0.012
 decided-0.011
 could-0.011
 didn-0.011
Token They
Token ID1119
Feature activation+1.60e-01

Pos logit contributions
 could+0.005
 decided+0.005
 tried+0.005
 didn+0.005
 remembered+0.005
Neg logit contributions
igsaw-0.008
 difference-0.007
umm-0.006
opl-0.006
 types-0.006
Token saw
Token ID2497
Feature activation-1.27e-02

Pos logit contributions
 remembered+0.128
 managed+0.122
 unable+0.120
 accidentally+0.119
 wouldn+0.119
Neg logit contributions
igsaw-0.405
 types-0.382
 parties-0.380
 difference-0.377
opl-0.376
Token a
Token ID257
Feature activation-8.48e-02

Pos logit contributions
igsaw+0.034
umm+0.029
ummy+0.028
opl+0.028
 difference+0.028
Neg logit contributions
 decided-0.019
 tried-0.018
 could-0.018
 remembered-0.017
 didn-0.017
Token big
Token ID1263
Feature activation-1.49e-01

Pos logit contributions
igsaw+0.131
 difference+0.101
ummy+0.101
opl+0.100
umm+0.097
Neg logit contributions
 remembered-0.193
 tried-0.191
 decided-0.190
 could-0.189
 didn-0.188
Token tent
Token ID11105
Feature activation-7.61e-03

Pos logit contributions
igsaw+0.264
ummy+0.209
 difference+0.208
opl+0.207
umm+0.205
Neg logit contributions
 tried-0.305
 remembered-0.303
 decided-0.295
 didn-0.294
 could-0.291
Token with
Token ID351
Feature activation-4.19e-02

Pos logit contributions
igsaw+0.017
 difference+0.014
ummy+0.014
opl+0.014
umm+0.014
Neg logit contributions
 remembered-0.012
 tried-0.012
 decided-0.012
 could-0.012
 didn-0.012
Token many
Token ID867
Feature activation-1.11e-01

Pos logit contributions
igsaw+0.082
 difference+0.067
opl+0.064
ummy+0.064
umm+0.063
Neg logit contributions
 decided-0.085
 tried-0.083
 could-0.083
 remembered-0.082
 had-0.080
Token colors
Token ID7577
Feature activation-3.57e-02

Pos logit contributions
igsaw+0.186
 difference+0.140
umm+0.139
ummy+0.138
opl+0.136
Neg logit contributions
 tried-0.251
 remembered-0.250
 decided-0.249
 could-0.249
 didn-0.242
Token it
Token ID340
Feature activation+8.38e-02

Pos logit contributions
 remembered+0.027
 tried+0.026
 decided+0.026
 managed+0.026
'd+0.025
Neg logit contributions
igsaw-0.046
 difference-0.041
ummy-0.040
opl-0.039
 types-0.038
Token could
Token ID714
Feature activation+1.07e-01

Pos logit contributions
 remembered+0.073
 decided+0.072
'd+0.067
 managed+0.067
 tried+0.063
Neg logit contributions
igsaw-0.228
ummy-0.195
 difference-0.193
 new-0.192
opl-0.192
Token fit
Token ID4197
Feature activation-7.56e-02

Pos logit contributions
 remembered+0.160
 tried+0.157
 decided+0.155
 could+0.153
 managed+0.152
Neg logit contributions
igsaw-0.245
 difference-0.212
ummy-0.207
opl-0.206
umm-0.199
Token in
Token ID287
Feature activation-2.41e-02

Pos logit contributions
igsaw+0.160
 difference+0.136
ummy+0.134
opl+0.132
umm+0.127
Neg logit contributions
 remembered-0.128
 tried-0.126
 decided-0.125
 could-0.123
 didn-0.122
Token its
Token ID663
Feature activation-7.58e-02

Pos logit contributions
igsaw+0.050
 difference+0.042
ummy+0.042
umm+0.041
opl+0.041
Neg logit contributions
 tried-0.042
 remembered-0.042
 decided-0.042
 didn-0.041
 could-0.041
Token j
Token ID474
Feature activation-1.77e-01

Pos logit contributions
igsaw+0.139
ummy+0.110
umm+0.108
opl+0.108
 types+0.106
Neg logit contributions
 remembered-0.158
 tried-0.154
 didn-0.150
 decided-0.150
 could-0.148
Tokenogging
Token ID30853
Feature activation-1.00e-01

Pos logit contributions
igsaw+0.186
 difference+0.176
 bou+0.167
 types+0.156
 art+0.152
Neg logit contributions
 tried-0.426
 could-0.421
 didn-0.419
 got-0.418
 remembered-0.412
Token pants
Token ID12581
Feature activation+4.62e-04

Pos logit contributions
igsaw+0.222
 difference+0.187
ummy+0.186
opl+0.183
umm+0.181
Neg logit contributions
 tried-0.162
 could-0.161
 remembered-0.161
 decided-0.159
 didn-0.159
Token,
Token ID11
Feature activation+1.64e-02

Pos logit contributions
 remembered+0.001
 tried+0.001
 decided+0.001
 managed+0.001
 didn+0.001
Neg logit contributions
igsaw-0.001
ummy-0.001
 difference-0.001
opl-0.001
 types-0.001
Token it
Token ID340
Feature activation+8.87e-02

Pos logit contributions
 remembered+0.023
 tried+0.022
 decided+0.022
'd+0.022
 managed+0.022
Neg logit contributions
igsaw-0.038
 difference-0.034
ummy-0.033
 art-0.032
 types-0.032
Token was
Token ID373
Feature activation+7.92e-02

Pos logit contributions
 remembered+0.057
Please+0.053
 tried+0.052
'd+0.052
 decided+0.051
Neg logit contributions
igsaw-0.224
 difference-0.210
 new-0.207
 types-0.205
 biggest-0.203
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
 remembered+0.034
 tried+0.034
 decided+0.033
 could+0.033
 didn+0.033
Neg logit contributions
igsaw-0.039
 difference-0.032
ummy-0.032
opl-0.031
umm-0.031
Token little
Token ID1310
Feature activation-7.62e-02

Pos logit contributions
igsaw+0.244
opl+0.208
 difference+0.199
ummy+0.196
umm+0.191
Neg logit contributions
 tried-0.218
 remembered-0.216
 decided-0.216
 didn-0.213
 could-0.212
Token boy
Token ID2933
Feature activation+9.48e-02

Pos logit contributions
igsaw+0.154
opl+0.128
 difference+0.126
ummy+0.125
umm+0.120
Neg logit contributions
 tried-0.141
 remembered-0.141
 decided-0.139
 could-0.137
 didn-0.136
Token had
Token ID550
Feature activation+9.64e-02

Pos logit contributions
 remembered+0.093
'd+0.091
 managed+0.090
 tried+0.090
 wouldn+0.088
Neg logit contributions
igsaw-0.227
ummy-0.210
 difference-0.209
 types-0.206
 batch-0.201
Token the
Token ID262
Feature activation-1.28e-01

Pos logit contributions
 remembered+0.158
 tried+0.156
 decided+0.155
 could+0.153
 didn+0.152
Neg logit contributions
igsaw-0.213
 difference-0.181
ummy-0.179
opl-0.177
umm-0.173
Token y
Token ID331
Feature activation-1.75e-01

Pos logit contributions
igsaw+0.271
opl+0.223
umm+0.200
ummy+0.199
 languages+0.196
Neg logit contributions
 tried-0.244
 decided-0.237
 didn-0.232
 could-0.227
 got-0.227
Tokenumm
Token ID13929
Feature activation-1.26e-01

Pos logit contributions
igsaw+0.088
 difference+0.031
opl+0.027
 types+0.010
 bou+0.010
Neg logit contributions
 tried-0.569
 remembered-0.566
 decided-0.564
 didn-0.558
 could-0.556
Tokeniest
Token ID6386
Feature activation-8.65e-02

Pos logit contributions
igsaw+0.150
ummy+0.125
umm+0.116
 difference+0.108
 types+0.106
Neg logit contributions
 remembered-0.326
 accidentally-0.322
 didn-0.320
 tried-0.319
 decided-0.318
Token treat
Token ID2190
Feature activation-1.51e-02

Pos logit contributions
igsaw+0.142
 difference+0.114
ummy+0.111
opl+0.110
umm+0.106
Neg logit contributions
 remembered-0.188
 tried-0.187
 decided-0.185
 could-0.184
 didn-0.183
Token.
Token ID13
Feature activation-1.77e-03

Pos logit contributions
igsaw+0.035
ummy+0.031
 difference+0.030
 types+0.029
 birthday+0.029
Neg logit contributions
 remembered-0.022
 tried-0.021
 didn-0.020
 accidentally-0.020
 managed-0.020
Token He
Token ID679
Feature activation+1.68e-01

Pos logit contributions
igsaw+0.005
 difference+0.004
 bou+0.004
opl+0.004
ummy+0.004
Neg logit contributions
 could-0.002
 tried-0.002
 decided-0.002
 didn-0.002
 remembered-0.002
Token to
Token ID284
Feature activation+3.93e-02

Pos logit contributions
'd+0.103
 remembered+0.100
 managed+0.098
 tried+0.096
 could+0.096
Neg logit contributions
igsaw-0.077
 new-0.070
 difference-0.066
 art-0.065
ummy-0.062
Token go
Token ID467
Feature activation-6.43e-03

Pos logit contributions
 remembered+0.072
 managed+0.067
 could+0.067
'd+0.066
 decided+0.066
Neg logit contributions
igsaw-0.073
 difference-0.069
ummy-0.063
 birthday-0.060
 art-0.060
Token back
Token ID736
Feature activation+2.89e-03

Pos logit contributions
igsaw+0.013
 difference+0.011
ummy+0.010
opl+0.010
 new+0.010
Neg logit contributions
 remembered-0.012
 didn-0.011
 tried-0.011
 could-0.011
 decided-0.011
Token to
Token ID284
Feature activation+2.12e-02

Pos logit contributions
 remembered+0.005
 tried+0.005
 decided+0.005
 could+0.005
 didn+0.004
Neg logit contributions
igsaw-0.006
 difference-0.006
ummy-0.005
opl-0.005
umm-0.005
Token his
Token ID465
Feature activation-4.76e-02

Pos logit contributions
 tried+0.035
 remembered+0.035
 decided+0.034
 didn+0.034
 could+0.034
Neg logit contributions
igsaw-0.047
ummy-0.040
 difference-0.040
umm-0.039
opl-0.039
Token ig
Token ID45329
Feature activation-1.77e-01

Pos logit contributions
igsaw+0.089
umm+0.065
opl+0.063
ay+0.063
ummy+0.061
Neg logit contributions
 remembered-0.109
 tried-0.105
 decided-0.104
 didn-0.103
 had-0.102
Tokenloo
Token ID29680
Feature activation-1.02e-02

Pos logit contributions
igsaw+0.244
 difference+0.180
 types+0.163
ummy+0.163
 bou+0.161
Neg logit contributions
 remembered-0.427
 could-0.426
 tried-0.426
 decided-0.423
 didn-0.423
Token and
Token ID290
Feature activation+2.00e-02

Pos logit contributions
igsaw+0.026
 difference+0.023
ummy+0.022
opl+0.022
umm+0.022
Neg logit contributions
 remembered-0.013
 tried-0.013
 decided-0.013
 could-0.013
 didn-0.012
Token ask
Token ID1265
Feature activation+2.76e-03

Pos logit contributions
 remembered+0.033
'd+0.032
pped+0.032
 managed+0.032
 stumbled+0.031
Neg logit contributions
igsaw-0.035
 difference-0.033
 new-0.031
 birthday-0.031
 art-0.031
Token his
Token ID465
Feature activation-3.18e-02

Pos logit contributions
 remembered+0.005
 tried+0.005
 decided+0.005
 could+0.005
 didn+0.005
Neg logit contributions
igsaw-0.006
opl-0.005
 difference-0.005
umm-0.005
ummy-0.005
Token family
Token ID1641
Feature activation+3.30e-02

Pos logit contributions
igsaw+0.072
opl+0.056
umm+0.056
ummy+0.054
ay+0.053
Neg logit contributions
 remembered-0.060
 decided-0.057
 tried-0.056
 could-0.056
 didn-0.055
Token "
Token ID366
Feature activation+1.41e-02

Pos logit contributions
 remembered+0.049
 managed+0.048
'd+0.047
 decided+0.047
 stumbled+0.046
Neg logit contributions
igsaw-0.025
ummy-0.023
 new-0.022
 Other-0.020
 Of-0.020
TokenOh
Token ID5812
Feature activation+3.87e-02

Pos logit contributions
'd+0.021
 remembered+0.021
 could+0.020
 managed+0.020
 wouldn+0.019
Neg logit contributions
igsaw-0.027
 difference-0.025
umm-0.025
Whe-0.025
ummy-0.024
Token no
Token ID645
Feature activation-1.86e-03

Pos logit contributions
 decided+0.052
 tried+0.051
 remembered+0.051
'd+0.050
 managed+0.050
Neg logit contributions
igsaw-0.093
ummy-0.083
 difference-0.081
opl-0.078
ay-0.078
Token,
Token ID11
Feature activation+1.82e-02

Pos logit contributions
igsaw+0.004
 difference+0.004
opl+0.004
umm+0.004
 types+0.004
Neg logit contributions
 remembered-0.003
 tried-0.003
 didn-0.003
 accidentally-0.003
 decided-0.003
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
'd+0.043
 remembered+0.042
 decided+0.042
 stumbled+0.042
 managed+0.042
Neg logit contributions
igsaw-0.025
 difference-0.020
ummy-0.020
 new-0.019
 art-0.018
Token v
Token ID410
Feature activation-1.77e-01

Pos logit contributions
igsaw+0.226
ummy+0.178
opl+0.177
 difference+0.168
 types+0.168
Neg logit contributions
 tried-0.252
 remembered-0.245
 decided-0.243
 didn-0.241
 accidentally-0.241
Tokenase
Token ID589
Feature activation+3.39e-02

Pos logit contributions
igsaw+0.260
 difference+0.205
ummy+0.195
opl+0.190
 types+0.185
Neg logit contributions
 remembered-0.415
 tried-0.412
 decided-0.409
 could-0.409
 didn-0.407
Token is
Token ID318
Feature activation+7.11e-02

Pos logit contributions
 remembered+0.073
 decided+0.073
 stumbled+0.072
 managed+0.070
'd+0.070
Neg logit contributions
igsaw-0.046
 difference-0.041
ummy-0.039
opl-0.037
ay-0.037
Token broken
Token ID5445
Feature activation-7.33e-02

Pos logit contributions
'd+0.105
 decided+0.104
 remembered+0.103
 managed+0.099
 could+0.099
Neg logit contributions
igsaw-0.157
ummy-0.134
 difference-0.134
 art-0.134
 new-0.133
Token!
Token ID0
Feature activation-7.15e-04

Pos logit contributions
igsaw+0.159
ummy+0.137
 difference+0.136
opl+0.132
umm+0.129
Neg logit contributions
 remembered-0.119
 tried-0.117
 decided-0.115
 managed-0.114
 didn-0.113
Token Can
Token ID1680
Feature activation+4.34e-02

Pos logit contributions
igsaw+0.001
 difference+0.001
ummy+0.001
opl+0.001
umm+0.001
Neg logit contributions
 remembered-0.001
 tried-0.001
 decided-0.001
 could-0.001
 didn-0.001
Token could
Token ID714
Feature activation+1.13e-01

Pos logit contributions
 remembered+0.075
'd+0.070
 accidentally+0.068
 unable+0.067
 stumbled+0.067
Neg logit contributions
igsaw-0.220
 types-0.205
 difference-0.204
 new-0.202
 art-0.195
Token help
Token ID1037
Feature activation-2.62e-02

Pos logit contributions
 remembered+0.167
 decided+0.160
 tried+0.159
 managed+0.154
 didn+0.154
Neg logit contributions
igsaw-0.256
 difference-0.225
ummy-0.217
opl-0.215
 art-0.209
Token get
Token ID651
Feature activation-1.52e-02

Pos logit contributions
igsaw+0.053
 difference+0.048
ummy+0.046
opl+0.044
 new+0.044
Neg logit contributions
 remembered-0.045
 tried-0.043
 decided-0.043
 managed-0.043
 could-0.042
Token the
Token ID262
Feature activation-1.48e-01

Pos logit contributions
igsaw+0.029
 difference+0.025
ummy+0.024
 new+0.024
 types+0.024
Neg logit contributions
 remembered-0.027
'd-0.027
 tried-0.026
 didn-0.026
 decided-0.026
Token car
Token ID1097
Feature activation-5.01e-03

Pos logit contributions
igsaw+0.254
 difference+0.201
ummy+0.200
opl+0.197
umm+0.190
Neg logit contributions
 remembered-0.316
 tried-0.315
 decided-0.311
 could-0.309
 didn-0.308
Token unst
Token ID15014
Feature activation-1.74e-01

Pos logit contributions
igsaw+0.011
 difference+0.010
ummy+0.009
opl+0.009
umm+0.009
Neg logit contributions
 remembered-0.008
 tried-0.008
 decided-0.008
 could-0.008
 didn-0.008
Tokenuck
Token ID1347
Feature activation-3.37e-03

Pos logit contributions
igsaw+0.254
ummy+0.211
umm+0.201
 difference+0.201
opl+0.195
Neg logit contributions
 remembered-0.412
'd-0.403
 could-0.401
 tried-0.400
 didn-0.397
Token.
Token ID13
Feature activation-9.83e-04

Pos logit contributions
igsaw+0.008
 difference+0.006
ummy+0.006
opl+0.006
 types+0.006
Neg logit contributions
 remembered-0.005
 tried-0.005
 decided-0.005
 didn-0.005
 could-0.005
Token
Token ID198
Feature activation-5.50e-02

Pos logit contributions
igsaw+0.002
 difference+0.002
opl+0.002
 types+0.002
ummy+0.002
Neg logit contributions
 tried-0.002
 could-0.002
 decided-0.002
 remembered-0.002
 didn-0.002
Token
Token ID198
Feature activation-5.31e-02

Pos logit contributions
igsaw+0.111
 difference+0.093
ummy+0.091
umm+0.090
opl+0.090
Neg logit contributions
 tried-0.099
 remembered-0.099
 decided-0.098
 could-0.098
 didn-0.096
TokenThe
Token ID464
Feature activation-1.10e-01

Pos logit contributions
igsaw+0.107
 difference+0.090
ummy+0.088
opl+0.087
ay+0.084
Neg logit contributions
 remembered-0.094
 tried-0.092
 could-0.091
 wouldn-0.090
 didn-0.090