TOP ACTIVATIONS
MAX = 4.959

sorry , Tim . I should have played with you ."
too , Sue . I should not have tried to take
we buy this sauce , please ?" His mom said ,
sorry , Tim . I should have told you to be
eat the sandwich now , please ?" His mom said ,
I 'm sorry . I should have told you the pot
I 'm sorry , Lily . I 'm sorry , Ben
I have a lemon , please ?" His mom
, " L ily , will you help me make lemon
I 'm sorry , Ben . I was mean . I
I have the apple , please ?" His mom said ,
the l oll ipop , please ?" His mom said ,
'm sorry , Mom my . I won 't touch insects
'm sorry , Mom my . I was naughty . I
I was naughty . I should have stopped ."
L ily didn 't want to leave her friend alone ,
I 'm sorry , mom . I wanted to taste the
Lily . You did it . You ate all your soup
sorry , Sam . I should not have taken your truck
sorry , Anna . I should have listened to you ,"

MIN ACTIVATIONS
MIN = -4.221

spent their time together . One day , the
that frogs talk about . The two were very happy
would last forever . At the end of each
have a friend ." The rain stopped , and
of being outdoors . At the end of the
the clothes were dry ! The h anger was now
lots of fun . The little butter is no
that would last forever . At the end of
admire its beauty . One day , a little
continued on his way . The little boy was very
to his mom . Max i loved having dinner
happily ever after . The moral of the story
wait to explore more . Later that day ,
park and play together . One day , they
'm still hungry ". The cone was angry and
having the best time . The raven was so
and always played together . One day , Lucy
nice to one another . The End . <|endoftext|> Once
really enjoyed themselves . Finally the frog and the
that everyone was happy . One day , he

INTERVAL 2.664 - 4.959
CONTAINS 2.067%

and asked the first , � � € � � what
. Tim was so happy to be on the boat .
her head . She did not want the pill . She
not happy . They wanted to stay home and play with
. " What are you doing here , children

INTERVAL 0.369 - 2.664
CONTAINS 33.433%

. She told Lily that they needed to use a needle
not be afraid of it . Lily listened to her mom
Can we have the rabbit , Mom ?" they asked .
made the rope go over and under and around , and
L ily decided to show her coin to her

INTERVAL -1.926 - 0.369
CONTAINS 60.369%

it under a bush and try again later .
and she wanted her mom my to bring her a night
a bath first , mom ?" Their mom laughs and says
� it was loaded with lots of snacks and drinks for
't need you !" Her mom sighed and left

INTERVAL -4.221 - -1.926
CONTAINS 4.131%

. The block flew out of their hands and hit their
still said no . John started to cry
my found it fascinating . Tim my wanted to ride the
man 's work . The man was very pleased
, they spread the blanket on the grass and began to
Token sorry
Token ID7926
Feature activation+3.74e+00

Pos logit contributions
 Sorry+7.415
 I+6.899
 give+6.784
 leave+6.483
 thank+6.377
Neg logit contributions
 sudden-6.939
 nowhere-6.897
side-6.812
amed-6.571
 nearby-6.477
Token,
Token ID11
Feature activation+1.96e+00

Pos logit contributions
 give+6.576
 Sorry+6.301
 leave+5.968
 share+5.887
 ask+5.809
Neg logit contributions
 sudden-7.745
side-7.351
 nearby-7.104
Meanwhile-6.965
J-6.818
Token Tim
Token ID5045
Feature activation+1.73e+00

Pos logit contributions
 give+2.715
 Sorry+2.595
 I+2.590
 leave+2.455
 thank+2.385
Neg logit contributions
side-4.563
 sudden-4.519
amed-4.203
Meanwhile-4.200
 distance-4.093
Token.
Token ID13
Feature activation+4.07e+00

Pos logit contributions
 I+3.271
 give+3.271
 Sorry+3.240
 leave+3.067
 thank+3.058
Neg logit contributions
side-3.284
 sudden-3.117
amed-2.986
Meanwhile-2.848
 distance-2.794
Token I
Token ID314
Feature activation+1.88e+00

Pos logit contributions
 give+5.512
 leave+5.090
 belong+4.834
 thank+4.755
 share+4.653
Neg logit contributions
side-10.525
 sudden-10.129
Meanwhile-9.717
amed-9.507
The-9.287
Token should
Token ID815
Feature activation+4.96e+00

Pos logit contributions
 I+3.332
 give+3.151
 Sorry+3.104
 leave+2.984
 thank+2.981
Neg logit contributions
side-3.916
 sudden-3.796
amed-3.625
Meanwhile-3.442
J-3.380
Token have
Token ID423
Feature activation+1.05e+00

Pos logit contributions
 Sorry+7.072
 I+6.372
 thank+6.039
 give+5.919
 leave+5.859
Neg logit contributions
side-11.252
 sudden-11.114
 nowhere-10.756
 nearby-10.720
J-10.613
Token played
Token ID2826
Feature activation+5.49e-01

Pos logit contributions
 I+1.889
 give+1.853
 Sorry+1.769
 thank+1.721
 leave+1.720
Neg logit contributions
side-2.187
 sudden-2.095
amed-2.004
Meanwhile-1.917
 distance-1.842
Token with
Token ID351
Feature activation+4.09e-01

Pos logit contributions
 I+1.058
 give+1.042
 Sorry+0.988
 thank+0.968
 leave+0.964
Neg logit contributions
side-1.078
 sudden-1.025
amed-0.967
Meanwhile-0.935
 distance-0.901
Token you
Token ID345
Feature activation+5.52e-01

Pos logit contributions
 I+0.432
 give+0.414
 Sorry+0.377
 thank+0.360
 leave+0.357
Neg logit contributions
side-1.171
 sudden-1.126
amed-1.090
Meanwhile-1.064
 distance-1.038
Token."
Token ID526
Feature activation+1.15e+00

Pos logit contributions
 I+0.759
 give+0.743
 Sorry+0.688
 leave+0.667
 thank+0.667
Neg logit contributions
side-1.387
 sudden-1.335
amed-1.280
Meanwhile-1.245
 distance-1.210
Token too
Token ID1165
Feature activation+3.00e+00

Pos logit contributions
 give+6.990
 Sorry+6.856
 leave+6.388
 belong+6.320
 thank+6.211
Neg logit contributions
 sudden-7.735
side-7.298
 nearby-7.136
Meanwhile-7.005
J-6.929
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 give+5.155
 I+5.042
 Sorry+4.886
 leave+4.775
 share+4.656
Neg logit contributions
side-6.481
 sudden-6.376
Meanwhile-5.724
amed-5.620
 distance-5.568
Token Sue
Token ID26089
Feature activation+2.09e+00

Pos logit contributions
 give+1.890
 I+1.864
 Sorry+1.791
 leave+1.717
 thank+1.690
Neg logit contributions
side-3.078
 sudden-3.005
amed-2.832
Meanwhile-2.792
 distance-2.710
Token.
Token ID13
Feature activation+3.79e+00

Pos logit contributions
 give+4.090
 Sorry+4.087
 I+4.031
 leave+3.871
 thank+3.817
Neg logit contributions
side-3.758
 sudden-3.544
amed-3.397
Meanwhile-3.232
 nearby-3.217
Token I
Token ID314
Feature activation+1.76e+00

Pos logit contributions
 give+4.721
 leave+4.406
 belong+4.088
 thank+3.890
 lose+3.838
Neg logit contributions
side-10.020
 sudden-9.547
Meanwhile-9.385
amed-9.231
 corner-9.024
Token should
Token ID815
Feature activation+4.82e+00

Pos logit contributions
 I+3.295
 give+3.136
 Sorry+3.074
 leave+2.980
 thank+2.950
Neg logit contributions
side-3.500
 sudden-3.365
amed-3.204
Meanwhile-3.054
J-2.977
Token not
Token ID407
Feature activation+3.20e+00

Pos logit contributions
 Sorry+6.699
 I+6.229
 give+5.948
 leave+5.887
 thank+5.857
Neg logit contributions
side-10.806
 sudden-10.600
J-10.213
 nearby-10.175
The-10.010
Token have
Token ID423
Feature activation+7.42e-01

Pos logit contributions
 I+4.152
 Sorry+4.145
 thank+3.800
 give+3.723
 sorry+3.642
Neg logit contributions
side-7.639
 sudden-7.412
 nearby-7.234
J-7.100
 distance-7.060
Token tried
Token ID3088
Feature activation+1.24e+00

Pos logit contributions
 I+1.394
 give+1.376
 Sorry+1.309
 thank+1.276
 leave+1.275
Neg logit contributions
side-1.484
 sudden-1.409
amed-1.338
Meanwhile-1.288
 distance-1.242
Token to
Token ID284
Feature activation+1.85e+00

Pos logit contributions
 give+2.106
 I+2.103
 Sorry+2.031
 thank+1.957
 leave+1.948
Neg logit contributions
side-2.631
 sudden-2.513
amed-2.371
Meanwhile-2.307
 distance-2.239
Token take
Token ID1011
Feature activation+1.02e+00

Pos logit contributions
 I+2.419
 Sorry+2.281
 give+2.223
 thank+2.078
 leave+2.019
Neg logit contributions
side-4.614
 sudden-4.504
amed-4.287
Meanwhile-4.162
 distance-4.132
Token we
Token ID356
Feature activation+3.62e+00

Pos logit contributions
 give+2.919
 leave+2.760
 ask+2.503
 thank+2.413
 Sorry+2.349
Neg logit contributions
 sudden-13.114
side-12.485
The-12.349
 nowhere-12.185
Meanwhile-11.809
Token buy
Token ID2822
Feature activation+1.11e+00

Pos logit contributions
 Sorry+5.108
 I+4.948
 sorry+4.503
 thank+4.435
 give+4.251
Neg logit contributions
side-8.780
 sudden-8.488
 nearby-8.384
 distance-7.996
At-7.791
Token this
Token ID428
Feature activation+1.52e+00

Pos logit contributions
 I+2.133
 give+2.115
 Sorry+2.044
 leave+1.980
 thank+1.974
Neg logit contributions
side-2.139
 sudden-2.032
amed-1.921
Meanwhile-1.839
 distance-1.768
Token sauce
Token ID10746
Feature activation+1.28e+00

Pos logit contributions
 I+2.822
 give+2.797
 Sorry+2.744
 leave+2.671
 thank+2.632
Neg logit contributions
side-2.984
 sudden-2.818
amed-2.652
Meanwhile-2.564
 distance-2.498
Token,
Token ID11
Feature activation+1.80e+00

Pos logit contributions
 I+1.234
 give+1.231
 Sorry+1.139
 leave+1.072
 thank+1.069
Neg logit contributions
side-3.687
 sudden-3.475
amed-3.415
Meanwhile-3.312
 distance-3.236
Token please
Token ID3387
Feature activation+4.82e+00

Pos logit contributions
 give+1.926
 I+1.881
 Sorry+1.839
 leave+1.687
 thank+1.659
Neg logit contributions
side-4.802
 sudden-4.733
amed-4.451
Meanwhile-4.429
 nearby-4.305
Token?"
Token ID1701
Feature activation+1.23e+00

Pos logit contributions
 Sorry+6.583
 seem+6.066
 belong+5.909
 sorry+5.806
 thank+5.774
Neg logit contributions
 nearby-11.871
side-11.791
 corner-11.402
 sudden-11.310
ep-11.270
Token His
Token ID2399
Feature activation+7.64e-01

Pos logit contributions
 I+2.183
 give+2.157
 Sorry+2.029
 leave+2.002
 thank+1.990
Neg logit contributions
side-2.586
 sudden-2.484
amed-2.324
Meanwhile-2.268
 distance-2.191
Token mom
Token ID1995
Feature activation+5.34e-01

Pos logit contributions
 I+1.160
 give+1.140
 Sorry+1.071
 thank+1.036
 leave+1.035
Neg logit contributions
side-1.814
 sudden-1.739
amed-1.664
Meanwhile-1.612
 distance-1.565
Token said
Token ID531
Feature activation+2.11e+00

Pos logit contributions
 I+1.016
 give+1.006
 Sorry+0.949
 thank+0.933
 leave+0.928
Neg logit contributions
side-1.064
 sudden-1.005
amed-0.951
Meanwhile-0.918
 distance-0.890
Token,
Token ID11
Feature activation+8.44e-01

Pos logit contributions
 give+2.782
 I+2.760
 Sorry+2.746
 leave+2.583
 ask+2.494
Neg logit contributions
side-5.326
 sudden-5.083
amed-4.850
Meanwhile-4.666
J-4.626
Token sorry
Token ID7926
Feature activation+3.47e+00

Pos logit contributions
 Sorry+5.532
 I+5.336
 give+5.210
 thank+4.956
 leave+4.873
Neg logit contributions
 sudden-5.509
side-5.213
amed-4.989
 nearby-4.978
 nowhere-4.958
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
 give+6.517
 Sorry+6.432
 leave+6.048
 ask+5.926
 share+5.901
Neg logit contributions
 sudden-6.658
side-6.212
 nearby-5.977
J-5.914
Meanwhile-5.868
Token Tim
Token ID5045
Feature activation+1.83e+00

Pos logit contributions
 give+2.242
 I+2.179
 Sorry+2.156
 leave+2.039
 ask+1.983
Neg logit contributions
side-4.045
 sudden-4.018
amed-3.732
Meanwhile-3.718
 distance-3.639
Token.
Token ID13
Feature activation+3.76e+00

Pos logit contributions
 I+3.524
 give+3.523
 Sorry+3.478
 leave+3.304
 thank+3.295
Neg logit contributions
side-3.367
 sudden-3.220
amed-3.065
Meanwhile-2.916
 distance-2.882
Token I
Token ID314
Feature activation+1.64e+00

Pos logit contributions
 give+5.532
 leave+5.122
 belong+4.770
 share+4.692
 ask+4.648
Neg logit contributions
side-9.086
 sudden-9.020
Meanwhile-8.468
amed-8.219
The-8.146
Token should
Token ID815
Feature activation+4.79e+00

Pos logit contributions
 I+2.918
 give+2.784
 Sorry+2.717
 leave+2.611
 thank+2.605
Neg logit contributions
side-3.415
 sudden-3.317
amed-3.126
Meanwhile-3.007
 distance-2.926
Token have
Token ID423
Feature activation+7.79e-01

Pos logit contributions
 Sorry+7.752
 I+7.050
 give+6.700
 thank+6.660
 leave+6.217
Neg logit contributions
 sudden-10.613
 nearby-9.962
side-9.839
The-9.446
Meanwhile-9.226
Token told
Token ID1297
Feature activation+1.35e+00

Pos logit contributions
 I+1.374
 give+1.354
 Sorry+1.285
 thank+1.251
 leave+1.248
Neg logit contributions
side-1.647
 sudden-1.570
amed-1.496
Meanwhile-1.442
 distance-1.394
Token you
Token ID345
Feature activation+1.81e+00

Pos logit contributions
 give+1.494
 I+1.480
 Sorry+1.401
 thank+1.296
 leave+1.294
Neg logit contributions
side-3.650
 sudden-3.536
amed-3.381
Meanwhile-3.311
 distance-3.230
Token to
Token ID284
Feature activation+2.41e+00

Pos logit contributions
 give+2.940
 Sorry+2.916
 I+2.904
 leave+2.675
 thank+2.669
Neg logit contributions
side-4.020
 sudden-3.745
amed-3.551
Meanwhile-3.486
 distance-3.401
Token be
Token ID307
Feature activation+1.11e+00

Pos logit contributions
 I+2.997
 Sorry+2.890
 give+2.692
 thank+2.474
 leave+2.337
Neg logit contributions
side-6.190
 sudden-5.983
 distance-5.701
amed-5.614
 nearby-5.578
Token eat
Token ID4483
Feature activation+1.00e+00

Pos logit contributions
 Sorry+4.073
 I+3.616
 sorry+3.354
 thank+3.285
 give+3.271
Neg logit contributions
side-8.886
 sudden-8.619
 nearby-8.436
 corner-8.389
 distance-8.286
Token the
Token ID262
Feature activation-9.48e-01

Pos logit contributions
 I+1.802
 give+1.785
 Sorry+1.712
 leave+1.658
 thank+1.653
Neg logit contributions
side-2.052
 sudden-1.950
amed-1.846
Meanwhile-1.781
 distance-1.719
Token sandwich
Token ID20433
Feature activation+1.16e+00

Pos logit contributions
side+2.165
 sudden+2.030
amed+2.003
Meanwhile+1.939
eled+1.817
Neg logit contributions
 I-1.666
 give-1.610
 thank-1.494
 Sorry-1.455
 leave-1.424
Token now
Token ID783
Feature activation+3.01e+00

Pos logit contributions
 I+1.483
 give+1.479
 Sorry+1.385
 leave+1.325
 thank+1.325
Neg logit contributions
side-2.986
 sudden-2.817
amed-2.750
Meanwhile-2.656
 distance-2.588
Token,
Token ID11
Feature activation+2.59e+00

Pos logit contributions
 give+3.170
 Sorry+3.148
 I+3.016
 leave+2.832
 thank+2.741
Neg logit contributions
side-8.341
 sudden-7.704
amed-7.638
 nearby-7.503
Meanwhile-7.321
Token please
Token ID3387
Feature activation+4.76e+00

Pos logit contributions
 give+3.076
 Sorry+3.035
 I+2.840
 leave+2.743
 thank+2.639
Neg logit contributions
 sudden-6.309
side-6.290
 nearby-5.915
Meanwhile-5.914
amed-5.836
Token?"
Token ID1701
Feature activation+8.68e-01

Pos logit contributions
 Sorry+6.056
 belong+5.357
 seem+5.331
 sorry+5.172
 leave+5.171
Neg logit contributions
side-11.992
 nearby-11.931
 corner-11.414
 sudden-11.063
M-11.036
Token His
Token ID2399
Feature activation+5.32e-01

Pos logit contributions
 I+1.854
 give+1.832
 Sorry+1.751
 leave+1.717
 thank+1.715
Neg logit contributions
side-1.515
 sudden-1.428
amed-1.339
Meanwhile-1.286
 distance-1.232
Token mom
Token ID1995
Feature activation+5.14e-01

Pos logit contributions
 I+0.784
 give+0.773
 Sorry+0.722
 thank+0.703
 leave+0.700
Neg logit contributions
side-1.295
 sudden-1.231
amed-1.185
Meanwhile-1.150
 distance-1.115
Token said
Token ID531
Feature activation+1.93e+00

Pos logit contributions
 I+0.975
 give+0.966
 Sorry+0.910
 thank+0.897
 leave+0.891
Neg logit contributions
side-1.027
 sudden-0.971
amed-0.920
Meanwhile-0.887
 distance-0.860
Token,
Token ID11
Feature activation+5.83e-01

Pos logit contributions
 give+2.496
 I+2.480
 Sorry+2.467
 leave+2.319
 ask+2.222
Neg logit contributions
side-4.908
 sudden-4.669
amed-4.472
Meanwhile-4.307
J-4.262
Token I
Token ID314
Feature activation+1.42e+00

Pos logit contributions
 give+5.289
 leave+4.714
 share+4.468
 thank+4.310
 ask+4.193
Neg logit contributions
side-10.588
 sudden-10.252
 distance-9.820
 corner-9.741
Meanwhile-9.685
Token'm
Token ID1101
Feature activation+1.99e+00

Pos logit contributions
 I+2.985
 give+2.890
 Sorry+2.813
 leave+2.740
 thank+2.739
Neg logit contributions
side-2.508
 sudden-2.430
amed-2.265
Meanwhile-2.166
J-2.082
Token sorry
Token ID7926
Feature activation+3.49e+00

Pos logit contributions
 give+2.222
 Sorry+2.156
 I+2.114
 leave+1.950
 thank+1.918
Neg logit contributions
 sudden-5.345
side-5.331
amed-5.164
Meanwhile-5.028
 nowhere-4.937
Token.
Token ID13
Feature activation+3.97e+00

Pos logit contributions
 Sorry+7.478
 give+7.402
 leave+7.013
 share+6.816
 belong+6.815
Neg logit contributions
 sudden-5.557
side-5.293
 nearby-5.100
Meanwhile-5.007
 corner-4.921
Token I
Token ID314
Feature activation+1.35e+00

Pos logit contributions
 give+4.988
 leave+4.549
 share+4.110
 thank+3.988
 belong+3.903
Neg logit contributions
side-10.384
 sudden-10.258
 corner-9.771
Meanwhile-9.737
amed-9.679
Token should
Token ID815
Feature activation+4.74e+00

Pos logit contributions
 I+2.832
 give+2.747
 Sorry+2.673
 thank+2.599
 leave+2.597
Neg logit contributions
side-2.375
 sudden-2.295
amed-2.140
Meanwhile-2.052
 distance-1.964
Token have
Token ID423
Feature activation+5.82e-01

Pos logit contributions
 Sorry+6.947
 give+6.265
 I+6.184
 thank+6.075
 leave+5.929
Neg logit contributions
 sudden-10.777
side-10.158
 nearby-9.971
The-9.952
Meanwhile-9.770
Token told
Token ID1297
Feature activation+9.84e-01

Pos logit contributions
 I+1.007
 give+0.995
 Sorry+0.940
 thank+0.914
 leave+0.913
Neg logit contributions
side-1.249
 sudden-1.186
amed-1.128
Meanwhile-1.092
 distance-1.058
Token you
Token ID345
Feature activation+1.65e+00

Pos logit contributions
 I+0.776
 give+0.767
 Sorry+0.686
 thank+0.630
 leave+0.629
Neg logit contributions
side-3.001
 sudden-2.903
amed-2.802
Meanwhile-2.744
 distance-2.684
Token the
Token ID262
Feature activation-2.04e-01

Pos logit contributions
 give+2.823
 I+2.812
 Sorry+2.788
 leave+2.588
 thank+2.582
Neg logit contributions
side-3.498
 sudden-3.288
amed-3.086
Meanwhile-3.025
 distance-2.947
Token pot
Token ID1787
Feature activation+4.00e-01

Pos logit contributions
side+0.431
 sudden+0.404
amed+0.397
Meanwhile+0.377
aming+0.354
Neg logit contributions
 I-0.379
 give-0.374
 thank-0.349
 Sorry-0.344
 leave-0.337
TokenI
Token ID40
Feature activation+1.13e+00

Pos logit contributions
side+1.696
 sudden+1.621
amed+1.509
Meanwhile+1.465
 distance+1.411
Neg logit contributions
 I-2.109
 give-1.999
 Sorry-1.988
 thank-1.884
 share-1.872
Token'm
Token ID1101
Feature activation+2.47e+00

Pos logit contributions
 I+1.704
 give+1.639
 Sorry+1.575
 leave+1.504
 thank+1.486
Neg logit contributions
side-2.695
 sudden-2.572
amed-2.469
Meanwhile-2.389
 distance-2.334
Token sorry
Token ID7926
Feature activation+3.87e+00

Pos logit contributions
 give+4.840
 I+4.794
 Sorry+4.727
 leave+4.500
 ask+4.384
Neg logit contributions
side-4.652
amed-4.529
 sudden-4.415
Meanwhile-4.138
aming-4.115
Token,
Token ID11
Feature activation+1.81e+00

Pos logit contributions
 Sorry+6.894
 give+6.789
 belong+6.605
 leave+6.223
 share+6.100
Neg logit contributions
 sudden-7.830
side-7.400
Meanwhile-6.942
J-6.828
 nearby-6.796
Token Lily
Token ID20037
Feature activation+1.54e+00

Pos logit contributions
 give+2.887
 I+2.872
 Sorry+2.801
 leave+2.640
 ask+2.552
Neg logit contributions
side-3.892
 sudden-3.765
amed-3.550
Meanwhile-3.518
 distance-3.423
Token.
Token ID13
Feature activation+4.71e+00

Pos logit contributions
 give+3.083
 I+3.083
 Sorry+3.045
 leave+2.891
 thank+2.866
Neg logit contributions
side-2.785
 sudden-2.589
amed-2.494
Meanwhile-2.374
 distance-2.298
Token I
Token ID314
Feature activation+9.52e-01

Pos logit contributions
 give+7.677
 leave+7.302
 ask+6.505
 share+6.472
 lose+6.417
Neg logit contributions
side-12.065
 sudden-10.659
The-10.448
amed-10.375
Meanwhile-10.215
Token'm
Token ID1101
Feature activation+2.27e+00

Pos logit contributions
 I+1.775
 give+1.735
 Sorry+1.662
 leave+1.616
 thank+1.615
Neg logit contributions
side-1.922
 sudden-1.829
amed-1.737
Meanwhile-1.671
 distance-1.612
Token sorry
Token ID7926
Feature activation+3.69e+00

Pos logit contributions
 give+3.769
 Sorry+3.706
 I+3.685
 leave+3.472
 thank+3.365
Neg logit contributions
side-4.901
 sudden-4.812
amed-4.778
Meanwhile-4.503
aming-4.405
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 Sorry+6.988
 give+6.896
 belong+6.520
 leave+6.407
 share+6.142
Neg logit contributions
 sudden-6.972
side-6.632
Meanwhile-6.324
 nearby-6.190
The-6.075
Token Ben
Token ID3932
Feature activation+1.93e+00

Pos logit contributions
 give+2.242
 I+2.222
 Sorry+2.160
 leave+2.034
 thank+1.995
Neg logit contributions
side-3.429
 sudden-3.305
amed-3.129
Meanwhile-3.085
 distance-3.031
Token I
Token ID314
Feature activation+3.35e+00

Pos logit contributions
 give+2.271
 Sorry+2.002
 thank+1.932
 leave+1.925
 ask+1.749
Neg logit contributions
side-9.256
 sudden-9.080
 nowhere-8.729
Meanwhile-8.671
The-8.544
Token have
Token ID423
Feature activation+2.16e+00

Pos logit contributions
 Sorry+4.379
 I+4.121
 give+3.707
 thank+3.698
 sorry+3.674
Neg logit contributions
 nearby-7.961
side-7.936
 sudden-7.712
 distance-7.709
 corner-7.647
Token a
Token ID257
Feature activation+1.19e+00

Pos logit contributions
 I+4.483
 Sorry+4.424
 give+4.357
 leave+4.175
 thank+4.129
Neg logit contributions
side-3.728
 sudden-3.677
amed-3.275
Meanwhile-3.168
 nowhere-3.092
Token lemon
Token ID18873
Feature activation+1.41e+00

Pos logit contributions
 I+2.455
 give+2.393
 Sorry+2.324
 leave+2.259
 thank+2.237
Neg logit contributions
side-2.139
 sudden-2.060
amed-1.891
Meanwhile-1.825
 distance-1.781
Token,
Token ID11
Feature activation+1.92e+00

Pos logit contributions
 I+2.780
 give+2.755
 Sorry+2.683
 leave+2.595
 thank+2.581
Neg logit contributions
side-2.639
 sudden-2.413
amed-2.339
Meanwhile-2.232
 distance-2.164
Token please
Token ID3387
Feature activation+4.69e+00

Pos logit contributions
 give+0.993
 Sorry+0.965
 I+0.951
 leave+0.752
 thank+0.688
Neg logit contributions
side-6.085
 sudden-5.992
Meanwhile-5.729
amed-5.702
 nearby-5.613
Token?"
Token ID1701
Feature activation+8.39e-01

Pos logit contributions
 Sorry+5.956
 seem+5.292
 belong+5.217
 leave+5.124
 aren+5.006
Neg logit contributions
 nearby-11.775
side-11.298
 corner-11.165
ep-11.101
J-10.918
Token
Token ID198
Feature activation-9.75e-01

Pos logit contributions
 I+1.570
 give+1.550
 Sorry+1.472
 leave+1.438
 thank+1.437
Neg logit contributions
side-1.691
 sudden-1.606
amed-1.518
Meanwhile-1.468
 distance-1.415
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
side+1.861
 sudden+1.740
amed+1.708
Meanwhile+1.575
 distance+1.547
Neg logit contributions
 I-2.014
 give-1.928
 Sorry-1.826
 thank-1.822
 ask-1.796
TokenHis
Token ID6653
Feature activation+1.02e-01

Pos logit contributions
side+0.336
 sudden+0.304
amed+0.282
Meanwhile+0.254
 distance+0.249
Neg logit contributions
 I-0.788
 give-0.774
 Sorry-0.752
 thank-0.734
 share-0.731
Token mom
Token ID1995
Feature activation-4.14e-01

Pos logit contributions
 I+0.152
 give+0.151
 Sorry+0.140
 thank+0.139
 leave+0.138
Neg logit contributions
side-0.249
 sudden-0.233
amed-0.226
Meanwhile-0.219
 distance-0.212
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
 give+4.473
 Sorry+4.349
 leave+4.314
 I+4.278
 ask+4.166
Neg logit contributions
side-6.936
 sudden-6.702
Meanwhile-5.960
J-5.879
amed-5.867
Token "
Token ID366
Feature activation-2.58e+00

Pos logit contributions
 give+2.416
 I+2.382
 Sorry+2.319
 leave+2.279
 thank+2.202
Neg logit contributions
side-3.048
 sudden-2.924
amed-2.728
Meanwhile-2.688
J-2.556
TokenL
Token ID43
Feature activation-4.84e-02

Pos logit contributions
side+3.925
 sudden+3.633
amed+3.628
eled+3.131
 depths+3.128
Neg logit contributions
 I-6.583
 give-6.028
 Sorry-5.843
 ask-5.776
 share-5.692
Tokenily
Token ID813
Feature activation+3.06e+00

Pos logit contributions
side+0.090
 sudden+0.087
amed+0.080
Meanwhile+0.077
 distance+0.077
Neg logit contributions
 I-0.099
 give-0.097
 Sorry-0.090
 leave-0.089
 thank-0.089
Token,
Token ID11
Feature activation+3.12e+00

Pos logit contributions
 Sorry+4.546
 give+4.460
 I+4.419
 leave+4.216
 thank+4.092
Neg logit contributions
side-7.213
 sudden-6.841
amed-6.644
Meanwhile-6.253
 nearby-6.157
Token will
Token ID481
Feature activation+4.68e+00

Pos logit contributions
 Sorry+4.944
 give+4.629
 leave+4.467
 I+4.460
 thank+4.351
Neg logit contributions
side-6.722
 sudden-6.704
 nearby-6.046
Meanwhile-6.034
 nowhere-5.886
Token you
Token ID345
Feature activation+2.45e+00

Pos logit contributions
 I+5.890
 Sorry+5.837
 leave+5.708
 give+5.679
 thank+5.622
Neg logit contributions
 sudden-11.839
side-11.726
The-10.963
J-10.456
At-10.211
Token help
Token ID1037
Feature activation+1.92e+00

Pos logit contributions
 Sorry+2.746
 I+2.657
 give+2.333
 leave+2.196
 thank+2.173
Neg logit contributions
side-6.779
 sudden-6.463
amed-6.148
 nearby-6.082
Meanwhile-6.031
Token me
Token ID502
Feature activation+2.50e+00

Pos logit contributions
 I+3.916
 Sorry+3.876
 give+3.802
 thank+3.726
 leave+3.601
Neg logit contributions
side-3.429
 sudden-3.383
amed-3.056
Meanwhile-2.992
J-2.864
Token make
Token ID787
Feature activation+1.23e+00

Pos logit contributions
 Sorry+3.934
 I+3.851
 thank+3.498
 give+3.448
 leave+3.240
Neg logit contributions
side-5.783
 sudden-5.616
amed-5.350
 nearby-5.226
Meanwhile-5.144
Token lemon
Token ID18873
Feature activation+4.60e-01

Pos logit contributions
 I+2.481
 give+2.429
 Sorry+2.369
 leave+2.288
 thank+2.288
Neg logit contributions
side-2.257
 sudden-2.171
amed-2.035
Meanwhile-1.947
J-1.846
TokenI
Token ID40
Feature activation+1.62e+00

Pos logit contributions
side+0.554
 sudden+0.513
amed+0.501
Meanwhile+0.458
 distance+0.446
Neg logit contributions
 I-0.794
 give-0.762
 Sorry-0.747
 thank-0.716
 share-0.704
Token'm
Token ID1101
Feature activation+2.98e+00

Pos logit contributions
 I+2.301
 give+2.155
 Sorry+2.101
 leave+2.003
 thank+1.971
Neg logit contributions
side-3.960
 sudden-3.831
amed-3.621
Meanwhile-3.551
J-3.504
Token sorry
Token ID7926
Feature activation+3.86e+00

Pos logit contributions
 give+2.234
 Sorry+2.110
 I+1.952
 leave+1.855
 ask+1.644
Neg logit contributions
 sudden-9.020
amed-8.947
side-8.946
Meanwhile-8.669
 nowhere-8.605
Token,
Token ID11
Feature activation+2.36e+00

Pos logit contributions
 Sorry+7.269
 give+7.022
 belong+6.779
 leave+6.643
 share+6.438
Neg logit contributions
 sudden-7.238
side-6.797
J-6.723
Meanwhile-6.433
 nearby-6.421
Token Ben
Token ID3932
Feature activation+2.16e+00

Pos logit contributions
 Sorry+3.335
 give+3.314
 I+3.160
 leave+3.022
 share+2.960
Neg logit contributions
side-5.301
 sudden-5.276
Meanwhile-4.964
 nowhere-4.903
J-4.861
Token.
Token ID13
Feature activation+4.68e+00

Pos logit contributions
 I+4.375
 Sorry+4.364
 give+4.359
 leave+4.135
 thank+4.064
Neg logit contributions
side-3.750
 sudden-3.512
amed-3.325
 nearby-3.173
 distance-3.155
Token I
Token ID314
Feature activation+1.11e+00

Pos logit contributions
 give+5.393
 leave+5.184
 ask+4.359
 share+4.315
 listen+4.271
Neg logit contributions
side-13.256
 sudden-12.382
The-12.044
Meanwhile-11.959
 corner-11.924
Token was
Token ID373
Feature activation+1.69e+00

Pos logit contributions
 I+2.245
 give+2.187
 Sorry+2.109
 leave+2.056
 thank+2.054
Neg logit contributions
side-2.062
 sudden-1.964
amed-1.849
Meanwhile-1.775
 distance-1.701
Token mean
Token ID1612
Feature activation+1.17e+00

Pos logit contributions
 give+2.953
 I+2.929
 Sorry+2.902
 leave+2.773
 thank+2.746
Neg logit contributions
side-3.425
 sudden-3.383
amed-3.255
Meanwhile-3.072
 nowhere-3.015
Token.
Token ID13
Feature activation+4.04e+00

Pos logit contributions
 give+2.205
 I+2.203
 Sorry+2.135
 thank+2.052
 leave+2.044
Neg logit contributions
side-2.292
 sudden-2.173
amed-2.063
Meanwhile-1.991
 distance-1.909
Token I
Token ID314
Feature activation+1.26e+00

Pos logit contributions
 give+4.693
 leave+4.513
 belong+3.904
 share+3.791
 ask+3.628
Neg logit contributions
side-10.998
 sudden-10.511
Meanwhile-10.237
J-10.168
The-10.047
Token I
Token ID314
Feature activation+3.56e+00

Pos logit contributions
 give+1.479
 Sorry+1.155
 leave+1.148
 thank+1.097
 ask+0.941
Neg logit contributions
side-9.558
 sudden-9.319
 nowhere-8.974
Meanwhile-8.946
 nearby-8.780
Token have
Token ID423
Feature activation+1.99e+00

Pos logit contributions
 Sorry+5.560
 I+4.952
 sorry+4.895
 give+4.695
 thank+4.647
Neg logit contributions
side-8.012
 nearby-7.953
 sudden-7.800
 corner-7.633
 distance-7.538
Token the
Token ID262
Feature activation+6.99e-01

Pos logit contributions
 I+3.981
 Sorry+3.972
 give+3.960
 leave+3.767
 thank+3.676
Neg logit contributions
side-3.611
 sudden-3.521
amed-3.123
Meanwhile-3.077
 nowhere-3.026
Token apple
Token ID17180
Feature activation+1.62e+00

Pos logit contributions
 I+1.367
 give+1.350
 Sorry+1.286
 thank+1.256
 leave+1.254
Neg logit contributions
side-1.345
 sudden-1.273
amed-1.207
Meanwhile-1.160
 distance-1.117
Token,
Token ID11
Feature activation+1.86e+00

Pos logit contributions
 give+2.139
 I+2.102
 Sorry+2.039
 thank+1.952
 leave+1.914
Neg logit contributions
side-4.042
 sudden-3.782
amed-3.709
Meanwhile-3.577
 distance-3.548
Token please
Token ID3387
Feature activation+4.66e+00

Pos logit contributions
 give+1.897
 I+1.833
 Sorry+1.828
 leave+1.656
 thank+1.589
Neg logit contributions
side-5.001
 sudden-4.884
amed-4.654
Meanwhile-4.627
 distance-4.519
Token?"
Token ID1701
Feature activation+5.64e-01

Pos logit contributions
 Sorry+5.901
 seem+5.150
 belong+5.147
 leave+4.998
 sorry+4.971
Neg logit contributions
 nearby-11.898
side-11.434
 corner-11.363
ep-10.976
 nowhere-10.874
Token His
Token ID2399
Feature activation+6.38e-01

Pos logit contributions
 I+0.900
 give+0.886
 Sorry+0.835
 thank+0.810
 leave+0.806
Neg logit contributions
side-1.294
 sudden-1.236
amed-1.185
Meanwhile-1.144
 distance-1.109
Token mom
Token ID1995
Feature activation+5.76e-01

Pos logit contributions
 I+0.957
 give+0.942
 Sorry+0.883
 thank+0.856
 leave+0.854
Neg logit contributions
side-1.533
 sudden-1.463
amed-1.404
Meanwhile-1.362
 distance-1.322
Token said
Token ID531
Feature activation+1.63e+00

Pos logit contributions
 I+1.089
 give+1.077
 Sorry+1.019
 thank+0.999
 leave+0.995
Neg logit contributions
side-1.153
 sudden-1.091
amed-1.034
Meanwhile-0.997
 distance-0.965
Token,
Token ID11
Feature activation+4.36e-01

Pos logit contributions
 I+2.026
 give+2.019
 Sorry+1.970
 leave+1.843
 ask+1.762
Neg logit contributions
side-4.248
 sudden-4.061
amed-3.911
Meanwhile-3.769
J-3.688
Token the
Token ID262
Feature activation+4.86e-01

Pos logit contributions
 I+3.158
 give+3.129
 Sorry+3.121
 leave+2.970
 thank+2.906
Neg logit contributions
side-3.370
 sudden-3.258
amed-2.958
Meanwhile-2.905
 distance-2.791
Token l
Token ID300
Feature activation+1.12e+00

Pos logit contributions
 I+0.990
 give+0.978
 Sorry+0.925
 thank+0.912
 leave+0.905
Neg logit contributions
side-0.903
 sudden-0.849
amed-0.809
Meanwhile-0.773
 distance-0.739
Tokenoll
Token ID692
Feature activation+2.47e+00

Pos logit contributions
 I+1.991
 give+1.980
 Sorry+1.910
 leave+1.846
 thank+1.828
Neg logit contributions
side-2.305
 sudden-2.174
amed-2.099
Meanwhile-2.018
 distance-1.929
Tokenipop
Token ID42800
Feature activation+2.26e+00

Pos logit contributions
 give+4.643
 I+4.637
 Sorry+4.585
 leave+4.460
 ask+4.278
Neg logit contributions
side-4.814
 sudden-4.412
amed-4.215
Meanwhile-3.984
J-3.975
Token,
Token ID11
Feature activation+2.21e+00

Pos logit contributions
 give+3.037
 I+2.938
 Sorry+2.937
 thank+2.752
 leave+2.748
Neg logit contributions
side-5.610
amed-5.103
 sudden-5.101
 nearby-4.878
Meanwhile-4.869
Token please
Token ID3387
Feature activation+4.65e+00

Pos logit contributions
 give+1.590
 Sorry+1.526
 I+1.477
 leave+1.328
 thank+1.220
Neg logit contributions
side-6.502
 sudden-6.439
amed-6.128
Meanwhile-6.114
 nearby-6.055
Token?"
Token ID1701
Feature activation+5.93e-01

Pos logit contributions
 Sorry+5.664
 seem+5.350
 belong+5.186
 sorry+5.084
 leave+5.027
Neg logit contributions
 nearby-11.625
side-11.611
 corner-11.200
ep-11.128
J-10.885
Token His
Token ID2399
Feature activation+7.22e-01

Pos logit contributions
 I+1.275
 give+1.260
 Sorry+1.207
 thank+1.180
 leave+1.177
Neg logit contributions
side-1.032
 sudden-0.968
amed-0.915
Meanwhile-0.874
 distance-0.837
Token mom
Token ID1995
Feature activation+5.96e-01

Pos logit contributions
 I+1.092
 give+1.074
 Sorry+1.008
 thank+0.976
 leave+0.975
Neg logit contributions
side-1.715
 sudden-1.641
amed-1.572
Meanwhile-1.524
 distance-1.479
Token said
Token ID531
Feature activation+1.74e+00

Pos logit contributions
 I+1.132
 give+1.119
 Sorry+1.059
 thank+1.039
 leave+1.034
Neg logit contributions
side-1.186
 sudden-1.122
amed-1.064
Meanwhile-1.025
 distance-0.991
Token,
Token ID11
Feature activation+6.76e-01

Pos logit contributions
 give+2.196
 I+2.189
 Sorry+2.147
 leave+2.017
 ask+1.940
Neg logit contributions
side-4.458
 sudden-4.262
amed-4.090
Meanwhile-3.938
J-3.881
Token'm
Token ID1101
Feature activation+3.08e+00

Pos logit contributions
 I+2.730
 give+2.546
 Sorry+2.521
 leave+2.369
 thank+2.314
Neg logit contributions
side-4.301
 sudden-4.134
amed-3.909
Meanwhile-3.840
J-3.790
Token sorry
Token ID7926
Feature activation+4.12e+00

Pos logit contributions
 give+3.989
 Sorry+3.938
 leave+3.600
 I+3.590
 share+3.310
Neg logit contributions
side-7.745
 sudden-7.702
amed-7.700
Meanwhile-7.487
aming-7.295
Token,
Token ID11
Feature activation+2.07e+00

Pos logit contributions
 Sorry+7.831
 give+7.581
 belong+7.329
 leave+7.039
 share+6.964
Neg logit contributions
 sudden-7.979
side-7.352
 corner-7.135
 nearby-7.092
Meanwhile-7.079
Token Mom
Token ID11254
Feature activation+1.75e+00

Pos logit contributions
 give+3.161
 Sorry+3.084
 I+3.034
 leave+2.887
 share+2.803
Neg logit contributions
side-4.504
 sudden-4.454
amed-4.151
Meanwhile-4.150
 distance-4.036
Tokenmy
Token ID1820
Feature activation+2.56e+00

Pos logit contributions
 give+4.414
 I+4.410
 Sorry+4.387
 thank+4.215
 leave+4.174
Neg logit contributions
side-2.269
 sudden-2.034
amed-1.919
Meanwhile-1.792
 distance-1.715
Token.
Token ID13
Feature activation+4.63e+00

Pos logit contributions
 Sorry+5.557
 give+5.425
 I+5.373
 leave+5.115
 thank+5.062
Neg logit contributions
side-4.259
 sudden-3.918
amed-3.613
 nearby-3.603
Meanwhile-3.464
Token I
Token ID314
Feature activation+1.34e+00

Pos logit contributions
 give+5.511
 leave+5.120
 share+4.448
 lose+4.241
 ask+4.234
Neg logit contributions
side-13.242
 sudden-12.653
The-12.259
 corner-12.212
 distance-12.001
Token won
Token ID1839
Feature activation+2.02e+00

Pos logit contributions
 I+2.361
 give+2.271
 Sorry+2.198
 leave+2.122
 thank+2.115
Neg logit contributions
side-2.829
 sudden-2.719
amed-2.572
Meanwhile-2.488
 distance-2.406
Token't
Token ID470
Feature activation+2.99e+00

Pos logit contributions
 give+2.638
 I+2.628
 Sorry+2.572
 thank+2.490
 leave+2.466
Neg logit contributions
side-5.063
 sudden-4.947
amed-4.665
Meanwhile-4.608
 nowhere-4.426
Token touch
Token ID3638
Feature activation+4.74e-01

Pos logit contributions
 I+4.786
 Sorry+4.759
 thank+4.245
 give+4.243
 sorry+4.056
Neg logit contributions
 sudden-6.711
side-6.533
 distance-6.153
 nearby-6.109
Meanwhile-6.001
Token insects
Token ID19435
Feature activation+9.02e-01

Pos logit contributions
 I+0.750
 give+0.733
 Sorry+0.686
 leave+0.669
 thank+0.666
Neg logit contributions
side-1.097
 sudden-1.045
amed-1.008
Meanwhile-0.975
 distance-0.940
Token'm
Token ID1101
Feature activation+2.84e+00

Pos logit contributions
 I+2.457
 give+2.320
 Sorry+2.268
 leave+2.161
 thank+2.108
Neg logit contributions
side-3.928
 sudden-3.802
amed-3.580
Meanwhile-3.516
J-3.445
Token sorry
Token ID7926
Feature activation+3.99e+00

Pos logit contributions
 give+2.656
 Sorry+2.528
 I+2.382
 leave+2.257
 share+2.122
Neg logit contributions
side-8.217
 sudden-8.208
amed-8.061
Meanwhile-7.855
 nowhere-7.727
Token,
Token ID11
Feature activation+2.20e+00

Pos logit contributions
 Sorry+7.306
 give+7.061
 belong+6.910
 leave+6.556
 share+6.515
Neg logit contributions
 sudden-8.041
side-7.475
Meanwhile-7.163
 nearby-7.049
 corner-6.999
Token Mom
Token ID11254
Feature activation+1.55e+00

Pos logit contributions
 give+3.591
 Sorry+3.490
 I+3.425
 share+3.282
 leave+3.274
Neg logit contributions
side-4.597
 sudden-4.544
Meanwhile-4.261
amed-4.117
 distance-4.098
Tokenmy
Token ID1820
Feature activation+2.61e+00

Pos logit contributions
 I+3.869
 give+3.861
 Sorry+3.806
 thank+3.682
 leave+3.648
Neg logit contributions
side-2.056
 sudden-1.866
amed-1.758
Meanwhile-1.652
 distance-1.560
Token.
Token ID13
Feature activation+4.63e+00

Pos logit contributions
 Sorry+5.642
 give+5.512
 I+5.471
 leave+5.189
 thank+5.139
Neg logit contributions
side-4.332
 sudden-4.024
amed-3.728
 nearby-3.671
Meanwhile-3.576
Token I
Token ID314
Feature activation+1.30e+00

Pos logit contributions
 give+5.740
 leave+5.277
 share+4.851
 ask+4.578
 belong+4.497
Neg logit contributions
side-13.335
 sudden-12.556
 corner-12.135
The-11.892
Meanwhile-11.858
Token was
Token ID373
Feature activation+1.82e+00

Pos logit contributions
 I+2.437
 give+2.357
 Sorry+2.282
 leave+2.212
 thank+2.203
Neg logit contributions
side-2.599
 sudden-2.493
amed-2.345
Meanwhile-2.265
 distance-2.189
Token naughty
Token ID45128
Feature activation+1.33e+00

Pos logit contributions
 give+3.256
 I+3.201
 Sorry+3.201
 leave+3.045
 thank+3.029
Neg logit contributions
side-3.660
 sudden-3.620
amed-3.448
Meanwhile-3.270
 nowhere-3.219
Token.
Token ID13
Feature activation+4.30e+00

Pos logit contributions
 give+2.643
 I+2.640
 Sorry+2.573
 thank+2.464
 leave+2.461
Neg logit contributions
side-2.460
 sudden-2.323
amed-2.182
Meanwhile-2.105
 distance-2.021
Token I
Token ID314
Feature activation+1.27e+00

Pos logit contributions
 give+5.473
 leave+5.182
 share+4.611
 belong+4.548
 lose+4.343
Neg logit contributions
side-11.676
 sudden-11.196
 corner-10.784
Meanwhile-10.778
The-10.635
Token I
Token ID314
Feature activation+1.46e+00

Pos logit contributions
 give+2.182
 I+2.145
 Sorry+2.124
 leave+1.994
 ask+1.935
Neg logit contributions
side-3.866
 sudden-3.759
amed-3.527
Meanwhile-3.514
 distance-3.446
Token was
Token ID373
Feature activation+1.56e+00

Pos logit contributions
 I+2.822
 give+2.736
 Sorry+2.644
 thank+2.586
 leave+2.579
Neg logit contributions
side-2.790
 sudden-2.736
amed-2.532
Meanwhile-2.441
J-2.364
Token naughty
Token ID45128
Feature activation+9.61e-01

Pos logit contributions
 give+2.765
 I+2.747
 Sorry+2.712
 leave+2.586
 thank+2.580
Neg logit contributions
side-3.105
 sudden-3.091
amed-2.902
Meanwhile-2.772
 nowhere-2.719
Token.
Token ID13
Feature activation+3.85e+00

Pos logit contributions
 I+1.750
 give+1.739
 Sorry+1.667
 leave+1.611
 thank+1.608
Neg logit contributions
side-1.952
 sudden-1.853
amed-1.759
Meanwhile-1.700
 distance-1.644
Token I
Token ID314
Feature activation+1.37e+00

Pos logit contributions
 give+4.765
 leave+4.476
 share+4.058
 belong+3.973
 ask+3.940
Neg logit contributions
side-9.786
 sudden-9.642
Meanwhile-9.251
 corner-9.100
Ah-9.093
Token should
Token ID815
Feature activation+4.62e+00

Pos logit contributions
 I+2.527
 give+2.445
 Sorry+2.369
 thank+2.300
 leave+2.293
Neg logit contributions
side-2.743
 sudden-2.665
amed-2.500
Meanwhile-2.406
 distance-2.324
Token have
Token ID423
Feature activation+8.92e-01

Pos logit contributions
 Sorry+5.674
 I+5.066
 give+4.953
 thank+4.879
 leave+4.778
Neg logit contributions
 sudden-11.245
side-10.582
 nowhere-10.540
 nearby-10.536
The-10.304
Token stopped
Token ID5025
Feature activation+1.33e+00

Pos logit contributions
 I+1.659
 give+1.632
 Sorry+1.556
 thank+1.517
 leave+1.513
Neg logit contributions
side-1.798
 sudden-1.717
amed-1.634
Meanwhile-1.569
 distance-1.509
Token."
Token ID526
Feature activation+4.07e-03

Pos logit contributions
 I+1.661
 give+1.658
 Sorry+1.589
 thank+1.515
 leave+1.480
Neg logit contributions
side-3.384
 sudden-3.317
amed-3.131
Meanwhile-3.071
 distance-2.971
Token
Token ID198
Feature activation-1.54e+00

Pos logit contributions
 I+0.009
 give+0.008
 Sorry+0.008
 thank+0.008
 leave+0.008
Neg logit contributions
side-0.007
 sudden-0.007
amed-0.007
Meanwhile-0.006
aming-0.006
Token
Token ID198
Feature activation-1.83e+00

Pos logit contributions
side+3.212
amed+2.914
 sudden+2.885
 distance+2.667
aming+2.654
Neg logit contributions
 I-3.089
 give-2.936
 thank-2.812
 Sorry-2.733
 ask-2.731
TokenL
Token ID43
Feature activation-7.47e-02

Pos logit contributions
side+2.549
 distance+2.199
 sudden+2.194
amed+2.168
eled+2.156
Neg logit contributions
 I-5.335
 give-5.027
 Sorry-4.934
 share-4.895
 thank-4.728
Tokenily
Token ID813
Feature activation-9.29e-01

Pos logit contributions
side+0.143
 sudden+0.137
amed+0.122
 distance+0.120
Meanwhile+0.119
Neg logit contributions
 I-0.153
 give-0.149
 Sorry-0.139
 thank-0.138
 leave-0.138
Token didn
Token ID1422
Feature activation+1.49e+00

Pos logit contributions
side+1.977
 sudden+1.773
amed+1.664
 distance+1.653
Meanwhile+1.624
Neg logit contributions
 give-1.750
 I-1.702
 leave-1.595
 thank-1.577
 ask-1.562
Token't
Token ID470
Feature activation+3.32e+00

Pos logit contributions
 I+1.858
 give+1.826
 Sorry+1.762
 leave+1.646
 thank+1.641
Neg logit contributions
side-3.888
 sudden-3.704
amed-3.590
Meanwhile-3.536
 distance-3.355
Token want
Token ID765
Feature activation+1.11e+00

Pos logit contributions
 Sorry+4.914
 I+4.742
 we+3.706
 give+3.636
 leave+3.567
Neg logit contributions
side-8.065
amed-7.938
 distance-7.846
J-7.843
 nearby-7.757
Token to
Token ID284
Feature activation+4.62e+00

Pos logit contributions
 I+1.858
 give+1.838
 Sorry+1.784
 thank+1.700
 leave+1.692
Neg logit contributions
side-2.413
 sudden-2.297
amed-2.174
Meanwhile-2.123
 distance-2.064
Token leave
Token ID2666
Feature activation+5.86e-01

Pos logit contributions
 Sorry+8.365
 I+7.975
 thank+6.948
 sorry+6.589
 give+6.559
Neg logit contributions
 sudden-10.093
 nearby-9.767
B-9.704
 distance-9.671
side-9.500
Token her
Token ID607
Feature activation-5.25e-01

Pos logit contributions
 I+0.849
 give+0.832
 Sorry+0.776
 leave+0.753
 thank+0.753
Neg logit contributions
side-1.432
 sudden-1.372
amed-1.318
Meanwhile-1.279
 distance-1.237
Token friend
Token ID1545
Feature activation-2.73e-01

Pos logit contributions
side+1.032
amed+0.938
 sudden+0.932
Meanwhile+0.890
J+0.856
Neg logit contributions
 I-1.054
 give-1.049
 leave-0.995
 thank-0.962
 Sorry-0.949
Token alone
Token ID3436
Feature activation-1.52e-01

Pos logit contributions
side+0.693
 sudden+0.662
amed+0.640
Meanwhile+0.629
J+0.600
Neg logit contributions
 I-0.382
 give-0.381
 leave-0.349
 thank-0.332
 Sorry-0.327
Token,
Token ID11
Feature activation-7.83e-01

Pos logit contributions
side+0.364
 sudden+0.348
amed+0.337
Meanwhile+0.328
 distance+0.312
Neg logit contributions
 I-0.233
 give-0.226
 leave-0.206
 thank-0.202
 Sorry-0.202
TokenI
Token ID40
Feature activation+1.92e+00

Pos logit contributions
 I+1.378
 give+1.357
 Sorry+1.311
 thank+1.280
 leave+1.276
Neg logit contributions
side-0.856
 sudden-0.793
amed-0.744
Meanwhile-0.702
 distance-0.668
Token'm
Token ID1101
Feature activation+3.20e+00

Pos logit contributions
 I+2.837
 give+2.661
 Sorry+2.596
 leave+2.492
 thank+2.442
Neg logit contributions
side-4.587
 sudden-4.505
amed-4.203
J-4.133
Meanwhile-4.116
Token sorry
Token ID7926
Feature activation+4.07e+00

Pos logit contributions
 give+3.111
 Sorry+2.831
 leave+2.669
 I+2.572
 share+2.438
Neg logit contributions
 sudden-9.175
amed-9.096
side-9.072
 nowhere-8.855
Meanwhile-8.812
Token,
Token ID11
Feature activation+2.65e+00

Pos logit contributions
 Sorry+7.648
 give+7.332
 belong+7.252
 leave+6.894
 share+6.762
Neg logit contributions
 sudden-7.977
side-7.296
J-7.206
Meanwhile-7.049
 nearby-7.004
Token mom
Token ID1995
Feature activation+1.75e+00

Pos logit contributions
 Sorry+4.583
 give+4.569
 I+4.350
 leave+4.260
 share+4.201
Neg logit contributions
 sudden-5.125
side-5.114
 nowhere-4.761
Meanwhile-4.723
 distance-4.613
Token.
Token ID13
Feature activation+4.61e+00

Pos logit contributions
 Sorry+3.448
 give+3.441
 I+3.439
 thank+3.233
 leave+3.220
Neg logit contributions
side-3.236
 sudden-3.022
amed-2.873
Meanwhile-2.770
J-2.704
Token I
Token ID314
Feature activation+1.49e+00

Pos logit contributions
 give+4.281
 leave+4.023
 ask+3.175
 share+3.143
 listen+2.835
Neg logit contributions
side-14.219
 sudden-13.638
 corner-13.414
 distance-13.082
 nowhere-13.067
Token wanted
Token ID2227
Feature activation+8.79e-01

Pos logit contributions
 I+3.053
 give+2.955
 Sorry+2.863
 leave+2.802
 thank+2.787
Neg logit contributions
side-2.713
 sudden-2.622
amed-2.438
Meanwhile-2.339
J-2.259
Token to
Token ID284
Feature activation+2.95e+00

Pos logit contributions
 I+1.469
 give+1.453
 Sorry+1.381
 thank+1.335
 leave+1.334
Neg logit contributions
side-1.926
 sudden-1.839
amed-1.743
Meanwhile-1.692
 distance-1.643
Token taste
Token ID6938
Feature activation+9.23e-01

Pos logit contributions
 I+3.733
 Sorry+3.645
 give+3.346
 thank+3.134
 sorry+3.102
Neg logit contributions
side-7.218
 sudden-7.213
 distance-6.840
 nearby-6.667
amed-6.643
Token the
Token ID262
Feature activation+1.77e-01

Pos logit contributions
 I+1.597
 give+1.575
 Sorry+1.506
 leave+1.453
 thank+1.449
Neg logit contributions
side-1.962
 sudden-1.872
amed-1.773
Meanwhile-1.719
 distance-1.667
Token Lily
Token ID20037
Feature activation+1.85e+00

Pos logit contributions
 Sorry+4.191
 give+4.077
 I+3.955
 leave+3.897
 ask+3.800
Neg logit contributions
side-6.203
 sudden-6.167
J-5.818
 nowhere-5.811
 nearby-5.738
Token.
Token ID13
Feature activation+4.50e+00

Pos logit contributions
 give+3.195
 I+3.141
 Sorry+3.104
 leave+2.972
 thank+2.904
Neg logit contributions
side-3.847
 sudden-3.644
amed-3.518
Meanwhile-3.337
J-3.300
Token You
Token ID921
Feature activation+2.71e+00

Pos logit contributions
 give+6.793
 leave+6.614
 ask+6.124
 listen+5.981
 thank+5.926
Neg logit contributions
side-11.283
amed-10.019
J-9.800
 sudden-9.746
 corner-9.609
Token did
Token ID750
Feature activation+2.57e+00

Pos logit contributions
 I+4.734
 Sorry+4.565
 give+4.430
 leave+4.265
 thank+4.198
Neg logit contributions
side-5.646
 sudden-5.493
amed-5.439
Meanwhile-4.934
aming-4.931
Token it
Token ID340
Feature activation+8.62e-01

Pos logit contributions
 Sorry+4.788
 give+4.757
 I+4.753
 leave+4.558
 thank+4.420
Neg logit contributions
side-4.994
 sudden-4.870
 nowhere-4.281
amed-4.226
J-4.198
Token.
Token ID13
Feature activation+4.60e+00

Pos logit contributions
 I+1.517
 give+1.501
 Sorry+1.425
 thank+1.387
 leave+1.383
Neg logit contributions
side-1.824
 sudden-1.724
amed-1.653
Meanwhile-1.591
 distance-1.534
Token You
Token ID921
Feature activation+2.47e+00

Pos logit contributions
 leave+7.435
 give+7.415
 lose+6.642
 share+6.534
 thank+6.487
Neg logit contributions
side-10.752
amed-9.651
J-9.524
M-9.360
Now-9.318
Token ate
Token ID15063
Feature activation-1.60e-02

Pos logit contributions
 I+4.492
 Sorry+4.323
 give+4.217
 leave+4.058
 thank+4.033
Neg logit contributions
side-4.977
 sudden-4.785
amed-4.703
Meanwhile-4.305
 distance-4.249
Token all
Token ID477
Feature activation-1.21e-01

Pos logit contributions
side+0.041
 sudden+0.040
amed+0.039
Meanwhile+0.038
 distance+0.036
Neg logit contributions
 I-0.021
 give-0.020
 Sorry-0.019
 thank-0.018
 leave-0.018
Token your
Token ID534
Feature activation+2.12e-01

Pos logit contributions
side+0.286
 sudden+0.281
amed+0.264
Meanwhile+0.258
 distance+0.245
Neg logit contributions
 I-0.189
 give-0.181
 Sorry-0.166
 thank-0.165
 leave-0.162
Token soup
Token ID17141
Feature activation+3.27e-01

Pos logit contributions
 I+0.377
 give+0.372
 Sorry+0.347
 thank+0.344
 leave+0.340
Neg logit contributions
side-0.449
 sudden-0.421
amed-0.411
Meanwhile-0.397
 distance-0.381
Token sorry
Token ID7926
Feature activation+3.50e+00

Pos logit contributions
 give+2.649
 Sorry+2.505
 I+2.237
 leave+2.177
 thank+2.149
Neg logit contributions
 sudden-8.462
side-8.154
amed-8.095
 nowhere-8.065
Meanwhile-7.935
Token,
Token ID11
Feature activation+1.68e+00

Pos logit contributions
 give+5.835
 Sorry+5.823
 leave+5.389
 share+5.379
 belong+5.247
Neg logit contributions
 sudden-7.362
side-6.733
 nearby-6.675
Meanwhile-6.526
J-6.380
Token Sam
Token ID3409
Feature activation+3.69e-01

Pos logit contributions
 give+2.758
 Sorry+2.715
 I+2.704
 leave+2.526
 share+2.498
Neg logit contributions
side-3.487
 sudden-3.464
Meanwhile-3.212
amed-3.193
 distance-3.125
Token.
Token ID13
Feature activation+3.72e+00

Pos logit contributions
 I+0.507
 give+0.494
 Sorry+0.450
 thank+0.446
 leave+0.441
Neg logit contributions
side-0.937
 sudden-0.896
amed-0.860
Meanwhile-0.835
 distance-0.810
Token I
Token ID314
Feature activation+1.33e+00

Pos logit contributions
 give+5.389
 thank+4.777
 leave+4.774
 share+4.631
 ask+4.521
Neg logit contributions
side-9.083
 sudden-8.905
amed-8.173
 corner-8.152
 distance-8.108
Token should
Token ID815
Feature activation+4.60e+00

Pos logit contributions
 I+2.995
 give+2.925
 Sorry+2.836
 thank+2.784
 leave+2.777
Neg logit contributions
side-2.119
 sudden-2.045
amed-1.890
Meanwhile-1.795
 distance-1.716
Token not
Token ID407
Feature activation+3.07e+00

Pos logit contributions
 Sorry+6.416
 I+5.997
 give+5.886
 thank+5.820
 leave+5.603
Neg logit contributions
 sudden-10.632
side-10.162
The-9.993
 nearby-9.927
 nowhere-9.843
Token have
Token ID423
Feature activation+7.90e-01

Pos logit contributions
 I+3.782
 Sorry+3.709
 thank+3.504
 give+3.385
 sorry+3.177
Neg logit contributions
 sudden-7.421
side-7.391
 nearby-7.234
 distance-7.122
J-7.037
Token taken
Token ID2077
Feature activation+1.80e-01

Pos logit contributions
 I+1.502
 give+1.481
 Sorry+1.412
 thank+1.378
 leave+1.374
Neg logit contributions
side-1.561
 sudden-1.484
amed-1.408
Meanwhile-1.354
 distance-1.305
Token your
Token ID534
Feature activation+1.63e-01

Pos logit contributions
 I+0.274
 give+0.267
 Sorry+0.246
 thank+0.243
 leave+0.243
Neg logit contributions
side-0.433
 sudden-0.412
amed-0.396
Meanwhile-0.385
 distance-0.368
Token truck
Token ID7779
Feature activation-2.69e-01

Pos logit contributions
 give+0.331
 I+0.331
 Sorry+0.312
 thank+0.310
 leave+0.309
Neg logit contributions
side-0.304
 sudden-0.283
amed-0.274
Meanwhile-0.263
J-0.246
Token sorry
Token ID7926
Feature activation+3.42e+00

Pos logit contributions
 Sorry+2.128
 give+2.125
 I+1.983
 leave+1.826
 thank+1.709
Neg logit contributions
 sudden-6.926
side-6.849
amed-6.752
Meanwhile-6.620
 nowhere-6.421
Token,
Token ID11
Feature activation+1.89e+00

Pos logit contributions
 Sorry+5.849
 give+5.687
 leave+5.350
 share+5.212
 belong+5.178
Neg logit contributions
 sudden-7.122
side-6.660
 nearby-6.372
Meanwhile-6.362
J-6.209
Token Anna
Token ID11735
Feature activation+1.60e+00

Pos logit contributions
 Sorry+3.027
 give+3.019
 I+2.944
 leave+2.826
 share+2.733
Neg logit contributions
side-4.008
 sudden-3.969
Meanwhile-3.666
amed-3.580
 distance-3.506
Token.
Token ID13
Feature activation+3.98e+00

Pos logit contributions
 I+2.996
 give+2.995
 Sorry+2.941
 leave+2.806
 thank+2.767
Neg logit contributions
side-3.065
 sudden-2.937
amed-2.766
Meanwhile-2.660
 distance-2.606
Token I
Token ID314
Feature activation+9.74e-01

Pos logit contributions
 give+5.564
 leave+5.097
 share+4.681
 belong+4.629
 thank+4.552
Neg logit contributions
side-10.140
 sudden-9.768
Meanwhile-9.347
The-9.166
Then-9.045
Token should
Token ID815
Feature activation+4.60e+00

Pos logit contributions
 I+2.016
 give+1.978
 Sorry+1.902
 leave+1.856
 thank+1.856
Neg logit contributions
side-1.758
 sudden-1.673
amed-1.572
Meanwhile-1.510
 distance-1.444
Token have
Token ID423
Feature activation+4.28e-01

Pos logit contributions
 Sorry+6.649
 give+6.194
 I+6.027
 thank+5.971
 leave+5.945
Neg logit contributions
 sudden-10.773
side-9.936
The-9.885
 nearby-9.691
Meanwhile-9.629
Token listened
Token ID16399
Feature activation-5.88e-01

Pos logit contributions
 I+0.724
 give+0.716
 Sorry+0.674
 thank+0.655
 leave+0.655
Neg logit contributions
side-0.943
 sudden-0.890
amed-0.849
Meanwhile-0.823
 distance-0.799
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
side+1.446
 sudden+1.423
amed+1.338
Meanwhile+1.291
 distance+1.262
Neg logit contributions
 I-0.890
 give-0.840
 thank-0.767
 leave-0.752
 Sorry-0.738
Token you
Token ID345
Feature activation+4.17e-01

Pos logit contributions
 I+1.408
 give+1.407
 Sorry+1.325
 thank+1.247
 leave+1.245
Neg logit contributions
side-3.203
 sudden-3.110
amed-2.950
Meanwhile-2.903
 distance-2.827
Token,"
Token ID553
Feature activation+1.14e+00

Pos logit contributions
 I+0.659
 give+0.650
 Sorry+0.601
 leave+0.593
 thank+0.591
Neg logit contributions
side-0.963
 sudden-0.923
amed-0.885
Meanwhile-0.854
 distance-0.828
Token spent
Token ID3377
Feature activation-2.09e+00

Pos logit contributions
 I+1.096
 give+1.082
 Sorry+1.026
 thank+1.004
 leave+1.001
Neg logit contributions
side-1.181
 sudden-1.120
amed-1.065
Meanwhile-1.028
 distance-0.989
Token their
Token ID511
Feature activation-1.54e+00

Pos logit contributions
side+5.093
Meanwhile+4.842
 sudden+4.626
amed+4.590
ipt+4.442
Neg logit contributions
 give-3.206
 I-3.158
 share-3.082
 thank-3.049
 leave-2.902
Token time
Token ID640
Feature activation-8.07e-01

Pos logit contributions
side+3.174
Meanwhile+2.924
amed+2.891
 sudden+2.853
J+2.754
Neg logit contributions
 I-2.916
 give-2.901
 thank-2.778
 leave-2.732
 share-2.662
Token together
Token ID1978
Feature activation-6.09e-01

Pos logit contributions
side+1.579
 sudden+1.468
Meanwhile+1.404
amed+1.401
 distance+1.355
Neg logit contributions
 give-1.552
 I-1.533
 share-1.431
 thank-1.417
 leave-1.414
Token.
Token ID13
Feature activation-6.88e-01

Pos logit contributions
side+1.381
 sudden+1.349
amed+1.309
Meanwhile+1.272
J+1.200
Neg logit contributions
 give-0.997
 I-0.990
 thank-0.910
 leave-0.900
 share-0.880
Token
Token ID198
Feature activation-4.22e+00

Pos logit contributions
side+1.481
 sudden+1.440
amed+1.387
Meanwhile+1.306
aming+1.275
Neg logit contributions
 I-1.266
 give-1.171
 Sorry-1.137
 thank-1.094
 leave-1.061
Token
Token ID198
Feature activation-3.85e+00

Pos logit contributions
side+9.202
 sudden+8.291
amed+8.067
 distance+8.007
eled+7.790
Neg logit contributions
 I-9.732
 thank-8.998
 give-8.985
 share-8.728
 have-8.688
TokenOne
Token ID3198
Feature activation-1.21e+00

Pos logit contributions
side+5.699
 sudden+5.510
awed+5.405
 distance+5.286
eled+5.228
Neg logit contributions
 I-10.555
 share-9.957
 give-9.874
 listen-9.545
 please-9.431
Token day
Token ID1110
Feature activation+1.04e+00

Pos logit contributions
side+2.265
 sudden+2.093
amed+2.075
Meanwhile+1.932
J+1.793
Neg logit contributions
 I-2.624
 give-2.559
 thank-2.418
 leave-2.343
 ask-2.312
Token,
Token ID11
Feature activation-6.44e-01

Pos logit contributions
 I+1.269
 give+1.250
 Sorry+1.165
 leave+1.119
 thank+1.105
Neg logit contributions
side-2.757
 sudden-2.661
amed-2.538
Meanwhile-2.476
 distance-2.414
Token the
Token ID262
Feature activation-1.67e+00

Pos logit contributions
side+1.499
amed+1.423
 sudden+1.396
Meanwhile+1.343
aming+1.303
Neg logit contributions
 I-1.059
 give-1.011
 Sorry-0.948
 thank-0.920
 leave-0.895
Token that
Token ID326
Feature activation+3.31e-01

Pos logit contributions
 I+1.456
 give+1.437
 Sorry+1.368
 thank+1.322
 leave+1.321
Neg logit contributions
side-1.891
 sudden-1.794
amed-1.716
Meanwhile-1.654
 distance-1.610
Token frogs
Token ID37475
Feature activation+6.41e-03

Pos logit contributions
 I+0.586
 give+0.575
 Sorry+0.542
 thank+0.529
 leave+0.527
Neg logit contributions
side-0.703
 sudden-0.673
amed-0.643
Meanwhile-0.622
 distance-0.589
Token talk
Token ID1561
Feature activation-8.05e-01

Pos logit contributions
 I+0.012
 give+0.012
 Sorry+0.011
 thank+0.011
 leave+0.011
Neg logit contributions
side-0.013
 sudden-0.012
amed-0.012
Meanwhile-0.012
 distance-0.011
Token about
Token ID546
Feature activation-9.35e-01

Pos logit contributions
 sudden+1.792
side+1.792
amed+1.653
Meanwhile+1.650
 distance+1.548
Neg logit contributions
 I-1.362
 give-1.324
 thank-1.231
 leave-1.211
 Sorry-1.174
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
side+2.176
amed+2.065
 sudden+2.045
Meanwhile+1.989
 distance+1.854
Neg logit contributions
 I-1.566
 give-1.530
 thank-1.400
 leave-1.377
 Sorry-1.366
Token
Token ID198
Feature activation-4.22e+00

Pos logit contributions
side+2.095
 sudden+2.030
amed+1.963
Meanwhile+1.831
aming+1.803
Neg logit contributions
 I-2.010
 give-1.839
 Sorry-1.800
 thank-1.742
 ask-1.681
TokenThe
Token ID464
Feature activation-1.87e+00

Pos logit contributions
side+8.427
 distance+7.638
 sudden+7.514
amed+6.988
eled+6.825
Neg logit contributions
 I-10.077
 thank-9.654
 give-9.383
 share-9.233
 have-8.898
Token two
Token ID734
Feature activation-1.09e+00

Pos logit contributions
side+4.028
amed+3.575
 sudden+3.458
Meanwhile+3.406
J+3.263
Neg logit contributions
 give-3.765
 I-3.734
 thank-3.531
 leave-3.416
 share-3.385
Token were
Token ID547
Feature activation-1.20e+00

Pos logit contributions
side+2.414
amed+2.123
 sudden+2.089
Meanwhile+2.068
J+1.960
Neg logit contributions
 give-1.993
 I-1.971
 leave-1.862
 thank-1.856
 share-1.789
Token very
Token ID845
Feature activation+4.34e-01

Pos logit contributions
side+2.670
 sudden+2.498
Meanwhile+2.363
amed+2.282
J+2.267
Neg logit contributions
 I-2.089
 give-2.065
 thank-1.959
 leave-1.915
 share-1.864
Token happy
Token ID3772
Feature activation+3.96e-01

Pos logit contributions
 I+0.870
 give+0.862
 Sorry+0.819
 thank+0.807
 leave+0.801
Neg logit contributions
side-0.819
 sudden-0.769
amed-0.730
Meanwhile-0.704
 distance-0.675
Token would
Token ID561
Feature activation+4.50e-01

Pos logit contributions
 I+0.750
 give+0.737
 Sorry+0.696
 thank+0.678
 leave+0.676
Neg logit contributions
side-0.962
 sudden-0.919
amed-0.878
Meanwhile-0.851
 distance-0.814
Token last
Token ID938
Feature activation-7.78e-01

Pos logit contributions
 I+0.570
 give+0.562
 Sorry+0.511
 thank+0.502
 leave+0.499
Neg logit contributions
side-1.177
 sudden-1.132
amed-1.094
Meanwhile-1.058
 distance-1.025
Token forever
Token ID8097
Feature activation+2.88e-01

Pos logit contributions
side+1.662
 sudden+1.603
amed+1.575
Meanwhile+1.519
J+1.422
Neg logit contributions
 I-1.396
 give-1.379
 leave-1.273
 thank-1.270
 Sorry-1.241
Token.
Token ID13
Feature activation-8.61e-01

Pos logit contributions
 I+0.518
 give+0.511
 Sorry+0.474
 leave+0.471
 thank+0.470
Neg logit contributions
side-0.601
 sudden-0.576
amed-0.552
Meanwhile-0.531
 distance-0.508
Token
Token ID198
Feature activation-4.15e+00

Pos logit contributions
side+1.653
 sudden+1.581
amed+1.549
Meanwhile+1.421
aming+1.401
Neg logit contributions
 I-1.813
 give-1.671
 Sorry-1.650
 thank-1.583
 ask-1.544
Token
Token ID198
Feature activation-4.21e+00

Pos logit contributions
side+9.697
amed+8.741
 sudden+8.450
 distance+8.404
aming+7.961
Neg logit contributions
 I-8.955
 thank-8.368
 give-8.161
 ask-7.963
 have-7.784
TokenAt
Token ID2953
Feature activation-1.44e+00

Pos logit contributions
side+7.148
amed+6.774
 distance+6.644
awed+6.634
 sudden+6.475
Neg logit contributions
 I-11.110
 share-10.296
 give-10.274
 please-10.156
 listen-10.118
Token the
Token ID262
Feature activation-2.53e+00

Pos logit contributions
side+3.477
 sudden+3.449
amed+3.327
Meanwhile+3.145
aming+3.032
Neg logit contributions
 I-2.226
 give-2.194
 thank-2.063
 leave-2.022
 share-1.901
Token end
Token ID886
Feature activation-4.73e-01

Pos logit contributions
amed+5.789
 sudden+5.625
side+5.622
awed+5.441
Meanwhile+5.270
Neg logit contributions
 give-4.704
 I-4.590
 thank-4.466
 leave-4.407
 share-4.123
Token of
Token ID286
Feature activation-1.49e+00

Pos logit contributions
side+0.889
 sudden+0.828
amed+0.797
Meanwhile+0.755
 distance+0.734
Neg logit contributions
 I-0.991
 give-0.953
 Sorry-0.908
 thank-0.902
 leave-0.886
Token each
Token ID1123
Feature activation-2.01e+00

Pos logit contributions
side+3.705
 sudden+3.573
amed+3.497
Meanwhile+3.379
Days+3.268
Neg logit contributions
 I-2.274
 give-2.222
 thank-2.054
 leave-1.966
 Sorry-1.924
Token have
Token ID423
Feature activation+7.37e-01

Pos logit contributions
 Sorry+3.042
 I+3.008
 give+2.764
 thank+2.489
 leave+2.466
Neg logit contributions
side-6.412
 sudden-6.137
amed-5.909
 nearby-5.855
 distance-5.810
Token a
Token ID257
Feature activation+6.01e-01

Pos logit contributions
 I+1.238
 give+1.220
 Sorry+1.154
 thank+1.120
 leave+1.119
Neg logit contributions
side-1.624
 sudden-1.549
amed-1.479
Meanwhile-1.428
 distance-1.383
Token friend
Token ID1545
Feature activation+7.46e-01

Pos logit contributions
 I+1.185
 give+1.172
 Sorry+1.114
 thank+1.092
 leave+1.089
Neg logit contributions
side-1.150
 sudden-1.083
amed-1.031
Meanwhile-0.990
 distance-0.949
Token."
Token ID526
Feature activation-3.75e-01

Pos logit contributions
 I+0.981
 give+0.964
 Sorry+0.898
 thank+0.864
 leave+0.862
Neg logit contributions
side-1.903
 sudden-1.825
amed-1.756
Meanwhile-1.705
 distance-1.661
Token
Token ID198
Feature activation-3.03e+00

Pos logit contributions
side+0.756
 sudden+0.718
amed+0.687
Meanwhile+0.656
 distance+0.634
Neg logit contributions
 I-0.729
 give-0.702
 Sorry-0.673
 thank-0.663
 leave-0.640
Token
Token ID198
Feature activation-4.16e+00

Pos logit contributions
side+7.023
 sudden+6.281
amed+6.236
 distance+5.938
eled+5.768
Neg logit contributions
 I-5.875
 thank-5.708
 give-5.530
 ask-5.207
 share-5.192
TokenThe
Token ID464
Feature activation-8.96e-01

Pos logit contributions
side+7.237
eled+6.587
 doorway+6.569
amed+6.393
 distance+6.307
Neg logit contributions
 I-10.668
 share-10.502
 give-10.070
 listen-9.907
 thank-9.804
Token rain
Token ID6290
Feature activation-1.29e+00

Pos logit contributions
side+2.062
amed+1.840
 sudden+1.822
Meanwhile+1.750
aming+1.659
Neg logit contributions
 I-1.602
 give-1.586
 thank-1.473
 leave-1.431
 Sorry-1.412
Token stopped
Token ID5025
Feature activation+1.29e-01

Pos logit contributions
side+2.557
 sudden+2.544
Meanwhile+2.271
amed+2.230
J+2.191
Neg logit contributions
 give-2.460
 I-2.402
 thank-2.317
 leave-2.273
 share-2.191
Token,
Token ID11
Feature activation+1.57e-01

Pos logit contributions
 I+0.251
 give+0.250
 leave+0.233
 Sorry+0.232
 thank+0.231
Neg logit contributions
side-0.256
 sudden-0.235
amed-0.229
Meanwhile-0.218
 distance-0.209
Token and
Token ID290
Feature activation-3.61e-01

Pos logit contributions
 I+0.325
 give+0.321
 thank+0.300
 Sorry+0.299
 leave+0.298
Neg logit contributions
side-0.290
 sudden-0.272
amed-0.259
Meanwhile-0.246
J-0.235
Token of
Token ID286
Feature activation-8.25e-01

Pos logit contributions
side+1.793
 sudden+1.771
amed+1.672
Meanwhile+1.626
J+1.548
Neg logit contributions
 I-1.568
 give-1.517
 thank-1.399
 leave-1.397
 Sorry-1.355
Token being
Token ID852
Feature activation-9.00e-01

Pos logit contributions
side+1.915
 sudden+1.806
amed+1.798
Meanwhile+1.735
J+1.626
Neg logit contributions
 give-1.376
 I-1.366
 leave-1.260
 thank-1.250
 share-1.209
Token outdoors
Token ID24349
Feature activation-4.67e-01

Pos logit contributions
side+1.857
 sudden+1.732
Meanwhile+1.658
amed+1.601
 distance+1.521
Neg logit contributions
 I-1.695
 give-1.663
 leave-1.545
 Sorry-1.532
 thank-1.519
Token.
Token ID13
Feature activation-9.25e-01

Pos logit contributions
side+1.053
 sudden+1.032
amed+0.987
Meanwhile+0.969
J+0.910
Neg logit contributions
 I-0.770
 give-0.748
 leave-0.690
 Sorry-0.689
 thank-0.682
Token
Token ID198
Feature activation-3.20e+00

Pos logit contributions
side+1.843
 sudden+1.748
amed+1.733
Meanwhile+1.605
aming+1.580
Neg logit contributions
 I-1.861
 give-1.720
 Sorry-1.683
 thank-1.608
 ask-1.593
Token
Token ID198
Feature activation-4.16e+00

Pos logit contributions
side+7.945
amed+7.398
 sudden+7.028
 distance+6.951
aming+6.825
Neg logit contributions
 I-5.741
 thank-5.127
 give-5.124
 ask-4.877
 please-4.615
TokenAt
Token ID2953
Feature activation-1.31e+00

Pos logit contributions
side+6.514
awed+6.060
 distance+5.986
amed+5.985
eled+5.774
Neg logit contributions
 I-11.339
 give-10.393
 please-10.324
 share-10.313
 listen-10.103
Token the
Token ID262
Feature activation-1.72e+00

Pos logit contributions
side+3.007
 sudden+2.989
amed+2.909
Meanwhile+2.722
aming+2.638
Neg logit contributions
 I-2.142
 give-2.141
 thank-2.001
 leave-1.973
 ask-1.866
Token end
Token ID886
Feature activation-3.39e-01

Pos logit contributions
amed+3.517
side+3.508
 sudden+3.357
Meanwhile+3.147
aming+3.115
Neg logit contributions
 give-3.374
 I-3.256
 leave-3.121
 thank-3.082
 ask-3.029
Token of
Token ID286
Feature activation-1.10e+00

Pos logit contributions
side+0.631
 sudden+0.584
amed+0.565
Meanwhile+0.532
 distance+0.516
Neg logit contributions
 I-0.711
 give-0.689
 Sorry-0.655
 thank-0.650
 leave-0.640
Token the
Token ID262
Feature activation-9.10e-01

Pos logit contributions
side+2.627
 sudden+2.504
amed+2.440
Meanwhile+2.360
Days+2.246
Neg logit contributions
 give-1.732
 I-1.708
 thank-1.576
 leave-1.548
 Sorry-1.506
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
side+2.194
 sudden+2.069
amed+1.995
Meanwhile+1.966
 distance+1.848
Neg logit contributions
 I-1.501
 give-1.478
 thank-1.353
 leave-1.332
 Sorry-1.304
Token clothes
Token ID8242
Feature activation+1.99e-01

Pos logit contributions
side+3.843
amed+3.434
 sudden+3.375
Meanwhile+3.319
 nowhere+3.087
Neg logit contributions
 give-3.088
 I-2.979
 leave-2.811
 thank-2.790
 ask-2.720
Token were
Token ID547
Feature activation-6.24e-01

Pos logit contributions
 I+0.344
 give+0.343
 leave+0.316
 Sorry+0.314
 thank+0.313
Neg logit contributions
side-0.423
 sudden-0.408
amed-0.389
Meanwhile-0.377
 distance-0.358
Token dry
Token ID5894
Feature activation+3.38e-02

Pos logit contributions
side+1.319
 sudden+1.266
amed+1.186
Meanwhile+1.186
J+1.106
Neg logit contributions
 give-1.110
 I-1.097
 leave-1.035
 thank-1.010
 share-0.988
Token!
Token ID0
Feature activation-7.38e-01

Pos logit contributions
 I+0.064
 give+0.063
 leave+0.059
 Sorry+0.058
 thank+0.058
Neg logit contributions
side-0.066
 sudden-0.064
amed-0.060
Meanwhile-0.059
J-0.055
Token
Token ID198
Feature activation-4.16e+00

Pos logit contributions
side+1.445
 sudden+1.356
amed+1.319
Meanwhile+1.225
aming+1.205
Neg logit contributions
 I-1.538
 give-1.417
 Sorry-1.389
 thank-1.342
 ask-1.302
TokenThe
Token ID464
Feature activation-8.56e-01

Pos logit contributions
side+8.648
eled+7.494
 distance+7.218
 sudden+7.112
awed+7.047
Neg logit contributions
 I-10.216
 thank-9.311
 give-9.306
 share-9.077
 please-8.925
Token h
Token ID289
Feature activation+9.02e-01

Pos logit contributions
side+1.840
 sudden+1.624
amed+1.612
Meanwhile+1.535
 distance+1.484
Neg logit contributions
 give-1.627
 I-1.579
 thank-1.487
 leave-1.476
 ask-1.438
Tokenanger
Token ID2564
Feature activation-1.59e+00

Pos logit contributions
 I+1.983
 give+1.961
 Sorry+1.883
 thank+1.839
 leave+1.837
Neg logit contributions
side-1.529
 sudden-1.428
amed-1.356
Meanwhile-1.291
 distance-1.221
Token was
Token ID373
Feature activation-1.41e+00

Pos logit contributions
side+3.182
 sudden+3.068
Meanwhile+2.742
amed+2.721
 distance+2.693
Neg logit contributions
 I-3.152
 give-3.139
 leave-2.929
 thank-2.905
 share-2.744
Token now
Token ID783
Feature activation-2.54e-01

Pos logit contributions
side+3.189
 sudden+3.010
Meanwhile+2.784
J+2.715
amed+2.709
Neg logit contributions
 give-2.397
 I-2.379
 thank-2.245
 leave-2.209
 share-2.092
Token lots
Token ID6041
Feature activation-9.49e-01

Pos logit contributions
side+4.245
 sudden+3.840
Meanwhile+3.810
amed+3.668
eled+3.499
Neg logit contributions
 I-3.774
 give-3.600
 thank-3.411
 share-3.357
 leave-3.280
Token of
Token ID286
Feature activation-8.65e-01

Pos logit contributions
side+1.853
 sudden+1.745
amed+1.687
Meanwhile+1.663
eled+1.535
Neg logit contributions
 I-1.904
 give-1.884
 thank-1.823
 leave-1.720
 share-1.685
Token fun
Token ID1257
Feature activation-1.14e-01

Pos logit contributions
side+2.000
 sudden+1.779
Meanwhile+1.772
amed+1.758
 distance+1.628
Neg logit contributions
 give-1.488
 I-1.481
 thank-1.435
 share-1.354
 leave-1.333
Token.
Token ID13
Feature activation-8.44e-01

Pos logit contributions
side+0.240
 sudden+0.225
amed+0.220
Meanwhile+0.215
 distance+0.204
Neg logit contributions
 I-0.206
 give-0.201
 Sorry-0.187
 thank-0.185
 leave-0.183
Token
Token ID198
Feature activation-3.53e+00

Pos logit contributions
side+1.633
 sudden+1.567
amed+1.515
Meanwhile+1.423
aming+1.367
Neg logit contributions
 I-1.745
 give-1.635
 Sorry-1.589
 thank-1.554
 leave-1.493
Token
Token ID198
Feature activation-4.16e+00

Pos logit contributions
side+6.836
 sudden+6.051
amed+5.870
 distance+5.671
aming+5.273
Neg logit contributions
 I-8.296
 thank-8.161
 give-7.946
 share-7.506
 have-7.311
TokenThe
Token ID464
Feature activation-1.84e+00

Pos logit contributions
side+6.566
awed+6.043
 sudden+5.960
amed+5.941
 distance+5.919
Neg logit contributions
 I-11.105
 give-10.618
 share-10.602
 thank-10.324
 listen-10.137
Token little
Token ID1310
Feature activation-3.90e-02

Pos logit contributions
side+3.881
amed+3.457
Meanwhile+3.249
 sudden+3.249
J+3.197
Neg logit contributions
 give-3.692
 I-3.664
 thank-3.573
 leave-3.376
 sorry-3.245
Token butter
Token ID9215
Feature activation-1.19e+00

Pos logit contributions
side+0.075
 sudden+0.068
amed+0.067
Meanwhile+0.064
J+0.061
Neg logit contributions
 I-0.079
 give-0.078
 Sorry-0.073
 leave-0.073
 thank-0.073
Token is
Token ID318
Feature activation-1.32e+00

Pos logit contributions
side+2.442
 sudden+2.339
amed+2.173
Meanwhile+2.114
J+2.087
Neg logit contributions
 give-2.293
 I-2.284
 thank-2.153
 leave-2.096
 share-2.068
Token no
Token ID645
Feature activation+2.32e-01

Pos logit contributions
side+3.046
 sudden+2.919
Meanwhile+2.726
amed+2.668
J+2.648
Neg logit contributions
 I-2.187
 give-2.174
 thank-2.036
 leave-2.000
 share-1.905
Token that
Token ID326
Feature activation+4.40e-01

Pos logit contributions
side+0.380
 sudden+0.374
amed+0.351
Meanwhile+0.343
 distance+0.326
Neg logit contributions
 I-0.331
 give-0.322
 thank-0.300
 Sorry-0.295
 leave-0.295
Token would
Token ID561
Feature activation+4.50e-01

Pos logit contributions
 I+0.750
 give+0.737
 Sorry+0.696
 thank+0.678
 leave+0.676
Neg logit contributions
side-0.962
 sudden-0.919
amed-0.878
Meanwhile-0.851
 distance-0.814
Token last
Token ID938
Feature activation-7.78e-01

Pos logit contributions
 I+0.570
 give+0.562
 Sorry+0.511
 thank+0.502
 leave+0.499
Neg logit contributions
side-1.177
 sudden-1.132
amed-1.094
Meanwhile-1.058
 distance-1.025
Token forever
Token ID8097
Feature activation+2.88e-01

Pos logit contributions
side+1.662
 sudden+1.603
amed+1.575
Meanwhile+1.519
J+1.422
Neg logit contributions
 I-1.396
 give-1.379
 leave-1.273
 thank-1.270
 Sorry-1.241
Token.
Token ID13
Feature activation-8.61e-01

Pos logit contributions
 I+0.518
 give+0.511
 Sorry+0.474
 leave+0.471
 thank+0.470
Neg logit contributions
side-0.601
 sudden-0.576
amed-0.552
Meanwhile-0.531
 distance-0.508
Token
Token ID198
Feature activation-4.15e+00

Pos logit contributions
side+1.653
 sudden+1.581
amed+1.549
Meanwhile+1.421
aming+1.401
Neg logit contributions
 I-1.813
 give-1.671
 Sorry-1.650
 thank-1.583
 ask-1.544
Token
Token ID198
Feature activation-4.21e+00

Pos logit contributions
side+9.697
amed+8.741
 sudden+8.450
 distance+8.404
aming+7.961
Neg logit contributions
 I-8.955
 thank-8.368
 give-8.161
 ask-7.963
 have-7.784
TokenAt
Token ID2953
Feature activation-1.44e+00

Pos logit contributions
side+7.148
amed+6.774
 distance+6.644
awed+6.634
 sudden+6.475
Neg logit contributions
 I-11.110
 share-10.296
 give-10.274
 please-10.156
 listen-10.118
Token the
Token ID262
Feature activation-2.53e+00

Pos logit contributions
side+3.477
 sudden+3.449
amed+3.327
Meanwhile+3.145
aming+3.032
Neg logit contributions
 I-2.226
 give-2.194
 thank-2.063
 leave-2.022
 share-1.901
Token end
Token ID886
Feature activation-4.73e-01

Pos logit contributions
amed+5.789
 sudden+5.625
side+5.622
awed+5.441
Meanwhile+5.270
Neg logit contributions
 give-4.704
 I-4.590
 thank-4.466
 leave-4.407
 share-4.123
Token of
Token ID286
Feature activation-1.49e+00

Pos logit contributions
side+0.889
 sudden+0.828
amed+0.797
Meanwhile+0.755
 distance+0.734
Neg logit contributions
 I-0.991
 give-0.953
 Sorry-0.908
 thank-0.902
 leave-0.886
Token admire
Token ID21099
Feature activation-6.19e-01

Pos logit contributions
 I+1.479
 give+1.426
 Sorry+1.350
 thank+1.265
 leave+1.255
Neg logit contributions
side-3.579
 sudden-3.427
amed-3.273
Meanwhile-3.210
 distance-3.194
Token its
Token ID663
Feature activation-1.04e+00

Pos logit contributions
side+1.432
 sudden+1.356
amed+1.347
Meanwhile+1.306
 distance+1.220
Neg logit contributions
 I-1.023
 give-0.981
 thank-0.915
 Sorry-0.907
 leave-0.903
Token beauty
Token ID8737
Feature activation-2.78e-01

Pos logit contributions
side+2.104
amed+1.961
 sudden+1.938
Meanwhile+1.890
J+1.823
Neg logit contributions
 give-2.005
 I-1.958
 thank-1.894
 leave-1.869
 Sorry-1.819
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
side+0.595
 sudden+0.589
amed+0.555
Meanwhile+0.545
J+0.514
Neg logit contributions
 I-0.485
 give-0.470
 thank-0.436
 leave-0.429
 Sorry-0.425
Token
Token ID198
Feature activation-4.08e+00

Pos logit contributions
side+2.369
 sudden+2.299
amed+2.271
Meanwhile+2.106
aming+2.078
Neg logit contributions
 I-2.057
 give-1.879
 Sorry-1.817
 thank-1.756
 ask-1.738
Token
Token ID198
Feature activation-4.13e+00

Pos logit contributions
side+11.097
amed+10.760
 distance+9.803
 sudden+9.778
aming+9.754
Neg logit contributions
 I-7.095
 ask-6.235
 give-6.230
 thank-6.097
 please-5.871
TokenOne
Token ID3198
Feature activation-1.02e+00

Pos logit contributions
side+7.217
amed+7.161
awed+7.030
 sudden+6.558
 distance+6.526
Neg logit contributions
 I-10.474
 give-9.741
 please-9.631
 share-9.604
 listen-9.524
Token day
Token ID1110
Feature activation+9.56e-01

Pos logit contributions
side+1.826
 sudden+1.716
amed+1.697
Meanwhile+1.582
awed+1.444
Neg logit contributions
 I-2.227
 give-2.168
 thank-2.042
 leave-2.001
 Sorry-1.989
Token,
Token ID11
Feature activation-8.04e-01

Pos logit contributions
 I+1.106
 give+1.087
 Sorry+1.005
 leave+0.963
 thank+0.956
Neg logit contributions
side-2.594
 sudden-2.504
amed-2.397
Meanwhile-2.338
 distance-2.277
Token a
Token ID257
Feature activation-1.20e+00

Pos logit contributions
side+1.838
amed+1.745
 sudden+1.704
Meanwhile+1.647
aming+1.589
Neg logit contributions
 I-1.348
 give-1.290
 Sorry-1.207
 thank-1.164
 leave-1.145
Token little
Token ID1310
Feature activation-1.23e-01

Pos logit contributions
side+2.465
amed+2.270
Meanwhile+2.139
 sudden+2.093
aming+2.030
Neg logit contributions
 I-2.359
 give-2.306
 thank-2.237
 Sorry-2.171
 leave-2.062
Token continued
Token ID3767
Feature activation-3.36e-01

Pos logit contributions
 I+1.300
 give+1.281
 Sorry+1.216
 thank+1.183
 leave+1.182
Neg logit contributions
side-1.527
 sudden-1.451
amed-1.383
Meanwhile-1.334
 distance-1.289
Token on
Token ID319
Feature activation-6.20e-01

Pos logit contributions
side+0.707
 sudden+0.677
amed+0.651
Meanwhile+0.631
 distance+0.599
Neg logit contributions
 give-0.596
 I-0.595
 leave-0.549
 thank-0.548
 Sorry-0.546
Token his
Token ID465
Feature activation-3.13e-01

Pos logit contributions
side+1.277
 sudden+1.211
amed+1.201
Meanwhile+1.147
aming+1.067
Neg logit contributions
 give-1.139
 I-1.138
 leave-1.060
 thank-1.059
 Sorry-1.045
Token way
Token ID835
Feature activation-3.18e-01

Pos logit contributions
side+0.601
 sudden+0.562
amed+0.557
Meanwhile+0.532
J+0.500
Neg logit contributions
 give-0.617
 I-0.616
 leave-0.582
 thank-0.581
 Sorry-0.570
Token.
Token ID13
Feature activation-7.09e-01

Pos logit contributions
side+0.706
 sudden+0.680
amed+0.656
Meanwhile+0.628
J+0.596
Neg logit contributions
 I-0.545
 give-0.526
 leave-0.490
 thank-0.479
 Sorry-0.474
Token
Token ID198
Feature activation-4.13e+00

Pos logit contributions
side+1.335
 sudden+1.274
amed+1.232
Meanwhile+1.135
aming+1.116
Neg logit contributions
 I-1.495
 give-1.415
 Sorry-1.365
 thank-1.339
 ask-1.297
TokenThe
Token ID464
Feature activation-1.28e+00

Pos logit contributions
side+7.719
 sudden+6.832
 distance+6.532
amed+6.373
awed+6.307
Neg logit contributions
 I-10.245
 thank-10.135
 give-9.927
 share-9.397
 listen-9.135
Token little
Token ID1310
Feature activation+3.24e-01

Pos logit contributions
side+2.569
amed+2.325
 sudden+2.290
Meanwhile+2.189
aming+2.060
Neg logit contributions
 give-2.624
 I-2.606
 thank-2.494
 leave-2.384
 Sorry-2.345
Token boy
Token ID2933
Feature activation-1.44e+00

Pos logit contributions
 I+0.730
 give+0.721
 Sorry+0.686
 leave+0.674
 thank+0.673
Neg logit contributions
side-0.544
 sudden-0.504
amed-0.480
Meanwhile-0.457
 distance-0.429
Token was
Token ID373
Feature activation-1.09e+00

Pos logit contributions
side+2.923
 sudden+2.703
amed+2.527
Meanwhile+2.481
J+2.427
Neg logit contributions
 give-2.864
 I-2.830
 leave-2.683
 thank-2.670
 share-2.545
Token very
Token ID845
Feature activation+6.95e-01

Pos logit contributions
side+2.405
 sudden+2.290
Meanwhile+2.066
amed+2.043
J+2.030
Neg logit contributions
 I-1.917
 give-1.902
 leave-1.768
 thank-1.768
 sorry-1.655
Token to
Token ID284
Feature activation+1.69e+00

Pos logit contributions
side+1.337
 sudden+1.325
amed+1.202
Meanwhile+1.189
 distance+1.126
Neg logit contributions
 I-1.181
 give-1.124
 Sorry-1.036
 thank-1.025
 leave-1.009
Token his
Token ID465
Feature activation-5.97e-02

Pos logit contributions
 I+3.116
 Sorry+3.055
 give+3.001
 leave+2.815
 thank+2.789
Neg logit contributions
side-3.327
 sudden-3.186
amed-2.981
Meanwhile-2.922
 distance-2.843
Token mom
Token ID1995
Feature activation-9.86e-02

Pos logit contributions
side+0.150
 sudden+0.141
amed+0.139
Meanwhile+0.134
 distance+0.128
Neg logit contributions
 I-0.087
 give-0.086
 thank-0.079
 Sorry-0.078
 leave-0.078
Token.
Token ID13
Feature activation-7.86e-01

Pos logit contributions
side+0.214
 sudden+0.210
amed+0.198
Meanwhile+0.193
 distance+0.183
Neg logit contributions
 I-0.170
 give-0.170
 thank-0.155
 leave-0.155
 Sorry-0.153
Token
Token ID198
Feature activation-3.43e+00

Pos logit contributions
side+1.771
 sudden+1.739
amed+1.662
 distance+1.583
Meanwhile+1.578
Neg logit contributions
 I-1.321
 give-1.227
 Sorry-1.214
 thank-1.155
 leave-1.108
Token
Token ID198
Feature activation-4.12e+00

Pos logit contributions
side+6.580
 sudden+6.195
 distance+6.008
amed+5.873
aming+5.420
Neg logit contributions
 I-7.918
 thank-7.500
 give-7.480
 share-7.128
 ask-6.969
TokenMax
Token ID11518
Feature activation-5.40e-01

Pos logit contributions
 distance+7.057
side+6.965
 valleys+6.533
 sudden+6.500
eled+6.258
Neg logit contributions
 I-10.248
 share-10.088
 give-9.972
 listen-9.473
 thank-9.380
Tokeni
Token ID72
Feature activation-4.05e-01

Pos logit contributions
side+1.109
 sudden+1.047
amed+0.979
Meanwhile+0.949
 distance+0.937
Neg logit contributions
 give-1.014
 I-1.014
 thank-0.940
 leave-0.934
 share-0.908
Token loved
Token ID6151
Feature activation-2.82e-01

Pos logit contributions
side+0.803
 sudden+0.758
amed+0.714
Meanwhile+0.692
 distance+0.680
Neg logit contributions
 I-0.782
 give-0.782
 thank-0.724
 leave-0.720
 Sorry-0.709
Token having
Token ID1719
Feature activation-6.72e-01

Pos logit contributions
side+0.639
amed+0.604
 sudden+0.603
Meanwhile+0.573
 distance+0.554
Neg logit contributions
 I-0.470
 give-0.467
 thank-0.432
 leave-0.417
 Sorry-0.417
Token dinner
Token ID8073
Feature activation-7.82e-01

Pos logit contributions
side+1.495
 sudden+1.393
amed+1.384
Meanwhile+1.358
 distance+1.289
Neg logit contributions
 I-1.151
 give-1.121
 thank-1.048
 share-1.011
 leave-1.002
Token happily
Token ID18177
Feature activation+3.32e-01

Pos logit contributions
side+3.827
 sudden+3.558
Meanwhile+3.409
 distance+3.043
amed+3.035
Neg logit contributions
 give-4.524
 share-4.445
 I-4.412
 thank-4.321
 leave-4.206
Token ever
Token ID1683
Feature activation+2.63e-01

Pos logit contributions
 I+0.546
 give+0.538
 Sorry+0.503
 leave+0.492
 thank+0.492
Neg logit contributions
side-0.742
 sudden-0.710
amed-0.676
Meanwhile-0.661
 distance-0.635
Token after
Token ID706
Feature activation-4.60e-01

Pos logit contributions
 I+0.399
 give+0.395
 Sorry+0.363
 leave+0.358
 thank+0.357
Neg logit contributions
side-0.617
 sudden-0.592
amed-0.573
Meanwhile-0.552
 distance-0.529
Token.
Token ID13
Feature activation+2.72e-01

Pos logit contributions
side+1.039
 sudden+1.005
amed+0.971
Meanwhile+0.948
 distance+0.913
Neg logit contributions
 I-0.759
 give-0.742
 thank-0.686
 Sorry-0.680
 leave-0.668
Token
Token ID198
Feature activation-2.70e+00

Pos logit contributions
 I+0.593
 give+0.576
 Sorry+0.555
 thank+0.544
 leave+0.537
Neg logit contributions
side-0.470
 sudden-0.442
amed-0.420
Meanwhile-0.397
 distance-0.379
Token
Token ID198
Feature activation-4.11e+00

Pos logit contributions
side+4.215
 sudden+3.870
amed+3.418
Meanwhile+3.184
 distance+3.088
Neg logit contributions
 thank-6.974
 I-6.957
 give-6.894
 share-6.485
 ask-6.399
TokenThe
Token ID464
Feature activation-1.00e+00

Pos logit contributions
 sudden+6.317
eled+6.122
side+5.998
amed+5.710
awed+5.492
Neg logit contributions
 I-11.206
 thank-10.561
 give-10.468
 share-10.461
 listen-10.323
Token moral
Token ID6573
Feature activation-4.24e-01

Pos logit contributions
side+1.880
 sudden+1.689
amed+1.652
Meanwhile+1.527
J+1.502
Neg logit contributions
 give-2.165
 I-2.132
 thank-2.081
 leave-1.994
 ask-1.980
Token of
Token ID286
Feature activation-1.31e+00

Pos logit contributions
side+0.799
 sudden+0.768
amed+0.708
Meanwhile+0.671
 distance+0.642
Neg logit contributions
 I-0.883
 give-0.855
 thank-0.804
 Sorry-0.799
 leave-0.785
Token the
Token ID262
Feature activation+8.06e-01

Pos logit contributions
side+3.220
 sudden+3.194
amed+3.004
Meanwhile+2.902
Days+2.751
Neg logit contributions
 give-1.987
 I-1.972
 thank-1.791
 share-1.734
 leave-1.719
Token story
Token ID1621
Feature activation+1.42e+00

Pos logit contributions
 I+1.710
 give+1.686
 Sorry+1.619
 leave+1.579
 thank+1.576
Neg logit contributions
side-1.426
 sudden-1.342
amed-1.264
Meanwhile-1.211
 distance-1.165
Token wait
Token ID4043
Feature activation+3.31e-01

Pos logit contributions
 I+0.918
 give+0.842
 Sorry+0.822
 thank+0.695
 leave+0.671
Neg logit contributions
side-3.904
 sudden-3.768
amed-3.623
Meanwhile-3.549
 distance-3.539
Token to
Token ID284
Feature activation+2.29e+00

Pos logit contributions
 I+0.523
 give+0.516
 thank+0.474
 Sorry+0.474
 leave+0.469
Neg logit contributions
side-0.765
 sudden-0.729
amed-0.705
Meanwhile-0.681
 distance-0.658
Token explore
Token ID7301
Feature activation-1.06e+00

Pos logit contributions
 I+2.970
 Sorry+2.890
 give+2.676
 thank+2.456
 leave+2.358
Neg logit contributions
side-5.645
 sudden-5.580
 distance-5.289
amed-5.249
 nearby-5.120
Token more
Token ID517
Feature activation-9.57e-01

Pos logit contributions
side+2.512
amed+2.417
 sudden+2.380
Meanwhile+2.306
aming+2.120
Neg logit contributions
 I-1.737
 give-1.668
 thank-1.551
 leave-1.498
 share-1.457
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
side+2.113
amed+2.044
 sudden+1.955
Meanwhile+1.904
aming+1.763
Neg logit contributions
 I-1.707
 give-1.612
 thank-1.483
 leave-1.479
 ask-1.455
Token
Token ID198
Feature activation-4.11e+00

Pos logit contributions
side+0.592
 sudden+0.567
amed+0.551
Meanwhile+0.514
aming+0.500
Neg logit contributions
 I-0.611
 give-0.585
 Sorry-0.556
 thank-0.547
 leave-0.539
Token
Token ID198
Feature activation-3.15e+00

Pos logit contributions
side+7.325
 sudden+6.806
amed+6.725
awed+6.488
eled+6.393
Neg logit contributions
 I-9.987
 give-9.878
 thank-9.797
 please-9.359
 ask-9.316
TokenLater
Token ID18602
Feature activation+3.15e-01

Pos logit contributions
side+3.995
 sudden+3.876
amed+3.791
awed+3.790
eled+3.657
Neg logit contributions
 I-8.445
 give-8.240
 please-7.960
 share-7.928
 listen-7.849
Token that
Token ID326
Feature activation-7.60e-01

Pos logit contributions
 I+0.428
 give+0.414
 Sorry+0.382
 thank+0.375
 leave+0.373
Neg logit contributions
side-0.800
 sudden-0.766
amed-0.741
Meanwhile-0.715
 distance-0.695
Token day
Token ID1110
Feature activation+9.22e-01

Pos logit contributions
side+1.345
 sudden+1.297
amed+1.281
Meanwhile+1.155
J+1.134
Neg logit contributions
 I-1.644
 give-1.597
 thank-1.507
 Sorry-1.499
 leave-1.472
Token,
Token ID11
Feature activation+1.86e-01

Pos logit contributions
 I+1.065
 give+1.045
 Sorry+0.966
 leave+0.924
 thank+0.918
Neg logit contributions
side-2.516
 sudden-2.426
amed-2.326
Meanwhile-2.269
 distance-2.209
Token park
Token ID3952
Feature activation-9.68e-01

Pos logit contributions
side+3.491
amed+3.425
 sudden+3.339
Meanwhile+3.035
J+3.010
Neg logit contributions
 I-4.437
 give-4.257
 leave-4.047
 Sorry-4.004
 thank-3.994
Token and
Token ID290
Feature activation+8.71e-01

Pos logit contributions
 sudden+2.225
side+2.165
amed+2.086
Meanwhile+2.033
aming+1.894
Neg logit contributions
 I-1.620
 give-1.562
 leave-1.431
 thank-1.393
 share-1.374
Token play
Token ID711
Feature activation-1.20e+00

Pos logit contributions
 I+1.225
 give+1.202
 Sorry+1.130
 thank+1.090
 leave+1.084
Neg logit contributions
side-2.139
 sudden-2.047
amed-1.960
Meanwhile-1.906
 distance-1.862
Token together
Token ID1978
Feature activation-1.26e+00

Pos logit contributions
side+2.573
 sudden+2.477
Meanwhile+2.349
amed+2.323
 distance+2.200
Neg logit contributions
 I-2.238
 give-2.171
 share-2.029
 thank-2.014
 leave-1.941
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
side+2.991
 sudden+2.989
amed+2.913
Meanwhile+2.839
aming+2.670
Neg logit contributions
 give-1.952
 I-1.943
 thank-1.759
 leave-1.758
 share-1.710
Token
Token ID198
Feature activation-4.10e+00

Pos logit contributions
side+2.617
 sudden+2.519
amed+2.505
Meanwhile+2.312
aming+2.309
Neg logit contributions
 I-1.936
 give-1.743
 Sorry-1.695
 ask-1.603
 thank-1.578
Token
Token ID198
Feature activation-4.00e+00

Pos logit contributions
side+11.024
amed+10.505
 sudden+10.053
 distance+9.789
awed+9.701
Neg logit contributions
 I-7.271
 ask-6.354
 give-6.084
 please-5.915
 thank-5.769
TokenOne
Token ID3198
Feature activation-1.43e+00

Pos logit contributions
side+6.459
awed+6.125
amed+6.003
 distance+5.967
 sudden+5.934
Neg logit contributions
 I-10.504
 please-9.580
 give-9.431
 share-9.418
 ask-9.311
Token day
Token ID1110
Feature activation+7.61e-01

Pos logit contributions
side+2.738
 sudden+2.544
amed+2.508
Meanwhile+2.300
 nowhere+2.165
Neg logit contributions
 I-3.123
 give-2.996
 thank-2.842
 leave-2.776
 ask-2.738
Token,
Token ID11
Feature activation-7.17e-01

Pos logit contributions
 I+0.909
 give+0.891
 Sorry+0.822
 leave+0.788
 thank+0.788
Neg logit contributions
side-2.041
 sudden-1.964
amed-1.889
Meanwhile-1.839
 distance-1.791
Token they
Token ID484
Feature activation-2.23e-01

Pos logit contributions
side+1.780
amed+1.689
 sudden+1.663
Meanwhile+1.604
 distance+1.555
Neg logit contributions
 I-1.062
 give-1.008
 Sorry-0.942
 thank-0.894
 leave-0.882
Token'm
Token ID1101
Feature activation+1.92e+00

Pos logit contributions
 I+1.281
 give+1.230
 Sorry+1.170
 thank+1.101
 leave+1.097
Neg logit contributions
side-2.874
 sudden-2.762
amed-2.656
Meanwhile-2.590
 distance-2.534
Token still
Token ID991
Feature activation+1.07e+00

Pos logit contributions
 Sorry+3.309
 I+3.283
 give+3.235
 leave+3.034
 thank+2.989
Neg logit contributions
side-3.942
 sudden-3.867
amed-3.671
Meanwhile-3.555
 nowhere-3.466
Token hungry
Token ID14720
Feature activation+9.15e-01

Pos logit contributions
 I+1.912
 give+1.893
 Sorry+1.823
 thank+1.757
 leave+1.755
Neg logit contributions
side-2.201
 sudden-2.108
amed-1.999
Meanwhile-1.921
 distance-1.870
Token".
Token ID1911
Feature activation-6.76e-01

Pos logit contributions
 I+1.308
 give+1.294
 Sorry+1.229
 thank+1.175
 leave+1.169
Neg logit contributions
side-2.217
 sudden-2.114
amed-2.024
Meanwhile-1.967
 distance-1.922
Token
Token ID198
Feature activation-3.21e+00

Pos logit contributions
side+1.363
 sudden+1.276
amed+1.255
Meanwhile+1.166
aming+1.142
Neg logit contributions
 I-1.340
 give-1.274
 Sorry-1.219
 thank-1.186
 ask-1.164
Token
Token ID198
Feature activation-4.09e+00

Pos logit contributions
side+7.959
amed+7.541
 distance+7.080
 sudden+7.063
eled+7.015
Neg logit contributions
 I-5.657
 give-5.093
 thank-5.082
 ask-4.955
 Sorry-4.616
TokenThe
Token ID464
Feature activation-1.50e+00

Pos logit contributions
side+6.113
eled+5.783
amed+5.771
 doorway+5.574
 distance+5.325
Neg logit contributions
 I-11.403
 share-10.372
 give-10.347
 ask-10.138
 please-10.053
Token cone
Token ID27763
Feature activation-2.08e+00

Pos logit contributions
side+3.178
amed+2.851
 sudden+2.718
J+2.558
Meanwhile+2.551
Neg logit contributions
 I-3.078
 give-2.992
 thank-2.927
 leave-2.731
 ask-2.653
Token was
Token ID373
Feature activation-1.64e+00

Pos logit contributions
side+4.577
 sudden+4.407
amed+4.164
J+3.961
Meanwhile+3.952
Neg logit contributions
 give-3.808
 I-3.677
 thank-3.512
 leave-3.428
 share-3.352
Token angry
Token ID7954
Feature activation+1.80e-01

Pos logit contributions
side+4.023
 sudden+3.694
J+3.539
amed+3.412
Meanwhile+3.404
Neg logit contributions
 give-2.624
 I-2.520
 thank-2.443
 leave-2.417
 sorry-2.397
Token and
Token ID290
Feature activation+4.31e-02

Pos logit contributions
 I+0.349
 give+0.339
 Sorry+0.314
 thank+0.314
 leave+0.313
Neg logit contributions
side-0.360
 sudden-0.341
amed-0.325
Meanwhile-0.310
 distance-0.302
Token having
Token ID1719
Feature activation-1.91e+00

Pos logit contributions
 I+0.488
 give+0.480
 Sorry+0.452
 leave+0.441
 thank+0.440
Neg logit contributions
side-0.580
 sudden-0.550
amed-0.527
Meanwhile-0.510
 distance-0.488
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
side+4.486
Meanwhile+4.144
amed+4.088
 sudden+4.032
eled+3.897
Neg logit contributions
 I-3.330
 give-3.138
 thank-2.967
 share-2.884
 leave-2.838
Token best
Token ID1266
Feature activation-7.50e-01

Pos logit contributions
side+3.769
 sudden+3.479
amed+3.382
Meanwhile+3.381
eled+3.304
Neg logit contributions
 I-2.958
 give-2.899
 thank-2.731
 Sorry-2.605
 share-2.600
Token time
Token ID640
Feature activation-4.11e-01

Pos logit contributions
side+1.292
amed+1.162
 sudden+1.158
Meanwhile+1.115
aming+1.019
Neg logit contributions
 I-1.700
 give-1.671
 thank-1.579
 Sorry-1.561
 leave-1.561
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
side+0.914
 sudden+0.868
amed+0.843
Meanwhile+0.820
 distance+0.790
Neg logit contributions
 I-0.716
 give-0.680
 Sorry-0.629
 thank-0.625
 leave-0.619
Token
Token ID198
Feature activation-4.09e+00

Pos logit contributions
side+0.917
 sudden+0.875
amed+0.853
Meanwhile+0.801
 distance+0.784
Neg logit contributions
 I-0.770
 give-0.717
 Sorry-0.698
 thank-0.665
 leave-0.650
Token
Token ID198
Feature activation-3.57e+00

Pos logit contributions
side+10.513
amed+9.733
 distance+9.257
 sudden+9.178
eled+8.859
Neg logit contributions
 I-8.008
 ask-7.055
 give-7.038
 thank-6.925
 don-6.696
TokenThe
Token ID464
Feature activation-2.08e+00

Pos logit contributions
side+5.619
amed+5.188
 distance+5.005
awed+4.984
 sudden+4.799
Neg logit contributions
 I-9.553
 give-8.856
 share-8.803
 please-8.745
 Sorry-8.663
Token raven
Token ID37735
Feature activation-1.65e+00

Pos logit contributions
side+4.595
amed+4.148
 sudden+3.999
Meanwhile+3.899
eled+3.853
Neg logit contributions
 I-4.177
 give-4.107
 thank-3.936
 Sorry-3.698
 leave-3.626
Token was
Token ID373
Feature activation-1.17e+00

Pos logit contributions
side+3.451
 sudden+3.121
amed+3.083
Meanwhile+2.946
 distance+2.867
Neg logit contributions
 give-3.223
 I-3.203
 thank-2.942
 leave-2.933
 share-2.874
Token so
Token ID523
Feature activation-4.42e-01

Pos logit contributions
side+2.591
 sudden+2.447
Meanwhile+2.235
amed+2.182
J+2.170
Neg logit contributions
 give-2.065
 I-2.062
 leave-1.916
 thank-1.904
 share-1.850
Token and
Token ID290
Feature activation+5.29e-01

Pos logit contributions
 I+0.485
 give+0.475
 Sorry+0.442
 thank+0.436
 leave+0.434
Neg logit contributions
side-0.668
 sudden-0.640
amed-0.608
Meanwhile-0.592
 distance-0.568
Token always
Token ID1464
Feature activation+1.51e+00

Pos logit contributions
 I+0.887
 give+0.874
 Sorry+0.823
 thank+0.804
 leave+0.801
Neg logit contributions
side-1.166
 sudden-1.110
amed-1.059
Meanwhile-1.027
 distance-0.990
Token played
Token ID2826
Feature activation-6.21e-01

Pos logit contributions
 I+2.678
 give+2.621
 Sorry+2.528
 thank+2.439
 leave+2.427
Neg logit contributions
side-3.092
 sudden-2.954
amed-2.839
Meanwhile-2.684
 distance-2.648
Token together
Token ID1978
Feature activation-1.46e+00

Pos logit contributions
side+1.187
 sudden+1.126
Meanwhile+1.055
amed+1.050
 distance+1.014
Neg logit contributions
 I-1.257
 give-1.230
 share-1.146
 Sorry-1.143
 thank-1.139
Token.
Token ID13
Feature activation-6.69e-01

Pos logit contributions
side+3.215
 sudden+3.204
amed+3.136
Meanwhile+3.046
J+2.859
Neg logit contributions
 I-2.524
 give-2.493
 leave-2.264
 thank-2.252
 share-2.233
Token
Token ID198
Feature activation-4.09e+00

Pos logit contributions
side+1.386
 sudden+1.339
amed+1.303
Meanwhile+1.210
aming+1.185
Neg logit contributions
 I-1.297
 give-1.197
 Sorry-1.168
 thank-1.122
 leave-1.099
Token
Token ID198
Feature activation-3.75e+00

Pos logit contributions
side+8.777
amed+7.996
 sudden+7.823
 distance+7.336
aming+7.182
Neg logit contributions
 I-9.518
 thank-8.716
 give-8.658
 ask-8.562
 share-8.275
TokenOne
Token ID3198
Feature activation-1.19e+00

Pos logit contributions
side+6.071
amed+5.783
 sudden+5.637
awed+5.575
 distance+5.520
Neg logit contributions
 I-9.805
 share-9.033
 give-8.950
 please-8.935
 Sorry-8.792
Token day
Token ID1110
Feature activation+1.09e+00

Pos logit contributions
side+2.231
 sudden+2.093
amed+2.037
Meanwhile+1.918
J+1.748
Neg logit contributions
 I-2.596
 give-2.498
 thank-2.352
 Sorry-2.303
 leave-2.302
Token,
Token ID11
Feature activation-4.98e-01

Pos logit contributions
 I+1.326
 give+1.312
 Sorry+1.214
 leave+1.176
 thank+1.158
Neg logit contributions
side-2.893
 sudden-2.794
amed-2.663
Meanwhile-2.599
 distance-2.528
Token Lucy
Token ID22162
Feature activation-4.05e-01

Pos logit contributions
side+1.218
amed+1.151
 sudden+1.143
Meanwhile+1.094
aming+1.060
Neg logit contributions
 I-0.756
 give-0.714
 Sorry-0.676
 thank-0.642
 leave-0.627
Token nice
Token ID3621
Feature activation+1.05e-01

Pos logit contributions
 I+0.088
 give+0.086
 Sorry+0.081
 leave+0.080
 thank+0.080
Neg logit contributions
side-0.086
 sudden-0.080
amed-0.075
Meanwhile-0.074
 distance-0.070
Token to
Token ID284
Feature activation+2.02e+00

Pos logit contributions
 I+0.186
 give+0.183
 Sorry+0.170
 leave+0.169
 thank+0.169
Neg logit contributions
side-0.225
 sudden-0.214
amed-0.206
Meanwhile-0.201
 distance-0.190
Token one
Token ID530
Feature activation-9.52e-01

Pos logit contributions
 I+3.736
 give+3.604
 Sorry+3.572
 thank+3.358
 leave+3.323
Neg logit contributions
side-3.963
 sudden-3.838
amed-3.525
 distance-3.375
 nearby-3.360
Token another
Token ID1194
Feature activation-1.38e+00

Pos logit contributions
side+2.019
amed+1.923
 sudden+1.878
Meanwhile+1.812
J+1.697
Neg logit contributions
 I-1.813
 give-1.786
 thank-1.673
 leave-1.660
 Sorry-1.636
Token.
Token ID13
Feature activation-5.16e-01

Pos logit contributions
side+2.988
amed+2.872
 sudden+2.664
Meanwhile+2.626
aming+2.554
Neg logit contributions
 give-2.620
 I-2.604
 thank-2.478
 ask-2.360
 leave-2.348
Token
Token ID198
Feature activation-4.09e+00

Pos logit contributions
side+1.099
 sudden+1.062
amed+1.029
Meanwhile+0.974
aming+0.942
Neg logit contributions
 I-0.943
 give-0.892
 Sorry-0.866
 thank-0.837
 leave-0.816
TokenThe
Token ID464
Feature activation-1.72e+00

Pos logit contributions
side+7.366
 sudden+6.678
 distance+6.633
amed+6.269
awed+5.978
Neg logit contributions
 thank-10.184
 I-10.106
 give-9.828
 share-9.594
 listen-9.210
Token End
Token ID5268
Feature activation-4.50e-01

Pos logit contributions
side+3.600
amed+3.210
 sudden+3.048
Meanwhile+2.998
J+2.936
Neg logit contributions
 I-3.549
 give-3.523
 thank-3.474
 leave-3.222
 Sorry-3.215
Token.
Token ID13
Feature activation-6.35e-01

Pos logit contributions
side+1.005
 sudden+0.947
amed+0.940
Meanwhile+0.896
 distance+0.859
Neg logit contributions
 I-0.772
 give-0.737
 Sorry-0.697
 thank-0.686
 leave-0.655
Token<|endoftext|>
Token ID50256
Feature activation-1.71e+00

Pos logit contributions
side+1.145
 sudden+1.093
amed+1.039
Meanwhile+0.970
 distance+0.935
Neg logit contributions
 I-1.392
 give-1.320
 Sorry-1.274
 thank-1.246
 leave-1.214
TokenOnce
Token ID7454
Feature activation-2.49e-01

Pos logit contributions
side+3.302
 sudden+3.251
amed+3.046
Meanwhile+2.989
Days+2.929
Neg logit contributions
 I-3.644
 give-3.278
 Sorry-3.122
 thank-3.025
 ask-3.000
Token really
Token ID1107
Feature activation-2.24e-01

Pos logit contributions
side+3.804
 sudden+3.521
Meanwhile+3.429
amed+3.410
J+3.266
Neg logit contributions
 give-3.432
 I-3.368
 leave-3.176
 thank-3.170
 share-3.072
Token enjoyed
Token ID8359
Feature activation-1.46e+00

Pos logit contributions
side+0.444
 sudden+0.421
amed+0.404
Meanwhile+0.396
J+0.367
Neg logit contributions
 I-0.426
 give-0.424
 thank-0.397
 Sorry-0.394
 leave-0.391
Token themselves
Token ID2405
Feature activation-1.18e+00

Pos logit contributions
side+3.470
amed+3.316
 sudden+3.281
Meanwhile+3.258
 distance+3.024
Neg logit contributions
 I-2.315
 give-2.263
 thank-2.134
 share-2.051
 leave-2.015
Token.
Token ID13
Feature activation-7.32e-01

Pos logit contributions
side+2.651
 sudden+2.532
amed+2.473
Meanwhile+2.457
J+2.292
Neg logit contributions
 give-2.028
 I-2.019
 leave-1.853
 thank-1.852
 share-1.803
Token
Token ID198
Feature activation-3.01e+00

Pos logit contributions
side+1.536
 sudden+1.470
amed+1.414
Meanwhile+1.339
 distance+1.304
Neg logit contributions
 I-1.386
 give-1.290
 Sorry-1.256
 thank-1.214
 leave-1.170
Token
Token ID198
Feature activation-4.09e+00

Pos logit contributions
side+7.304
amed+6.542
 sudden+6.457
 distance+6.284
aming+6.038
Neg logit contributions
 I-5.431
 give-5.071
 thank-5.069
 ask-4.744
 share-4.664
TokenFinally
Token ID11158
Feature activation+1.64e+00

Pos logit contributions
side+6.460
 doorway+5.821
awed+5.750
amed+5.738
 distance+5.701
Neg logit contributions
 I-10.851
 share-10.307
 give-10.295
 listen-9.849
 thank-9.834
Token the
Token ID262
Feature activation-5.79e-01

Pos logit contributions
 I+2.083
 give+2.066
 Sorry+1.977
 leave+1.843
 thank+1.807
Neg logit contributions
side-4.215
 sudden-4.086
amed-3.805
Meanwhile-3.755
 distance-3.669
Token frog
Token ID21264
Feature activation-1.27e+00

Pos logit contributions
side+1.220
amed+1.111
 sudden+1.096
Meanwhile+1.056
J+1.017
Neg logit contributions
 I-1.095
 give-1.087
 thank-1.024
 Sorry-1.010
 leave-0.996
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
side+2.660
 sudden+2.483
amed+2.402
Meanwhile+2.285
J+2.199
Neg logit contributions
 give-2.450
 I-2.406
 leave-2.271
 thank-2.251
 ask-2.129
Token the
Token ID262
Feature activation-8.48e-01

Pos logit contributions
side+4.067
 sudden+3.770
amed+3.761
Meanwhile+3.553
aming+3.516
Neg logit contributions
 I-2.475
 give-2.465
 thank-2.242
 leave-2.238
 Sorry-2.201
Token that
Token ID326
Feature activation+5.12e-01

Pos logit contributions
 I+1.458
 give+1.438
 Sorry+1.366
 thank+1.318
 leave+1.315
Neg logit contributions
side-1.999
 sudden-1.905
amed-1.815
Meanwhile-1.760
 distance-1.709
Token everyone
Token ID2506
Feature activation+1.19e+00

Pos logit contributions
 I+0.821
 give+0.806
 Sorry+0.756
 thank+0.737
 leave+0.735
Neg logit contributions
side-1.167
 sudden-1.113
amed-1.070
Meanwhile-1.033
 distance-0.999
Token was
Token ID373
Feature activation+1.42e-01

Pos logit contributions
 I+2.341
 give+2.315
 Sorry+2.238
 thank+2.164
 leave+2.162
Neg logit contributions
side-2.231
 sudden-2.112
amed-2.003
Meanwhile-1.911
 distance-1.855
Token happy
Token ID3772
Feature activation+8.27e-01

Pos logit contributions
 I+0.254
 give+0.253
 thank+0.234
 Sorry+0.233
 leave+0.233
Neg logit contributions
side-0.297
 sudden-0.280
amed-0.265
Meanwhile-0.259
 distance-0.250
Token.
Token ID13
Feature activation-3.82e-01

Pos logit contributions
 I+1.541
 give+1.524
 Sorry+1.454
 thank+1.413
 leave+1.412
Neg logit contributions
side-1.655
 sudden-1.569
amed-1.490
Meanwhile-1.434
 distance-1.386
Token
Token ID198
Feature activation-4.08e+00

Pos logit contributions
side+0.805
 sudden+0.775
amed+0.749
Meanwhile+0.699
 distance+0.688
Neg logit contributions
 I-0.711
 give-0.660
 Sorry-0.648
 thank-0.618
 leave-0.603
Token
Token ID198
Feature activation-3.77e+00

Pos logit contributions
side+9.494
 sudden+8.516
amed+8.494
 distance+8.231
awed+8.052
Neg logit contributions
 I-8.908
 thank-8.177
 give-7.945
 ask-7.867
 please-7.520
TokenOne
Token ID3198
Feature activation-1.28e+00

Pos logit contributions
side+5.830
awed+5.412
 sudden+5.374
 distance+5.265
amed+5.183
Neg logit contributions
 I-10.288
 give-9.422
 please-9.260
 share-9.252
 Sorry-9.199
Token day
Token ID1110
Feature activation+8.43e-01

Pos logit contributions
side+2.382
amed+2.200
 sudden+2.138
Meanwhile+2.032
awed+1.869
Neg logit contributions
 I-2.834
 give-2.731
 thank-2.569
 Sorry-2.523
 leave-2.491
Token,
Token ID11
Feature activation-5.94e-01

Pos logit contributions
 I+1.070
 give+1.051
 Sorry+0.977
 leave+0.940
 thank+0.936
Neg logit contributions
side-2.192
 sudden-2.107
amed-2.021
Meanwhile-1.967
 distance-1.916
Token he
Token ID339
Feature activation-2.25e-01

Pos logit contributions
side+1.370
amed+1.294
 sudden+1.286
Meanwhile+1.226
aming+1.185
Neg logit contributions
 I-0.980
 give-0.937
 Sorry-0.882
 thank-0.854
 leave-0.833
Token and
Token ID290
Feature activation+4.74e-01

Pos logit contributions
side+0.370
 sudden+0.347
amed+0.341
Meanwhile+0.325
J+0.307
Neg logit contributions
 I-0.343
 give-0.334
 thank-0.310
 leave-0.310
 Sorry-0.303
Token asked
Token ID1965
Feature activation-1.73e-01

Pos logit contributions
 I+0.821
 give+0.808
 Sorry+0.759
 thank+0.743
 leave+0.739
Neg logit contributions
side-1.027
 sudden-0.971
amed-0.930
Meanwhile-0.894
 distance-0.869
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
side+0.448
 sudden+0.431
amed+0.414
Meanwhile+0.402
 distance+0.395
Neg logit contributions
 I-0.238
 give-0.233
 thank-0.211
 leave-0.207
 Sorry-0.207
Token first
Token ID717
Feature activation-5.46e-01

Pos logit contributions
side+3.044
amed+2.710
 sudden+2.680
Meanwhile+2.523
aming+2.517
Neg logit contributions
 give-2.654
 I-2.639
 thank-2.423
 ask-2.399
 leave-2.394
Token,
Token ID11
Feature activation-2.67e-01

Pos logit contributions
side+1.110
amed+1.017
 sudden+1.014
Meanwhile+0.949
aming+0.928
Neg logit contributions
 give-1.067
 I-1.067
 leave-0.977
 thank-0.972
 Sorry-0.972
Token �
Token ID6184
Feature activation+2.88e+00

Pos logit contributions
side+0.660
 sudden+0.616
amed+0.609
Meanwhile+0.572
 distance+0.570
Neg logit contributions
 I-0.405
 give-0.388
 thank-0.353
 ask-0.341
 Sorry-0.340
Token�
Token ID95
Feature activation+1.35e+00

Pos logit contributions
 give+6.136
 I+6.131
 Sorry+5.952
 leave+5.860
 thank+5.828
Neg logit contributions
side-5.159
 sudden-4.584
amed-4.358
J-4.248
 nearby-4.248
Token€
Token ID26391
Feature activation+1.51e+00

Pos logit contributions
 I+2.741
 give+2.731
 Sorry+2.628
 thank+2.575
 leave+2.563
Neg logit contributions
side-2.512
 sudden-2.296
amed-2.256
Meanwhile-2.147
 distance-2.045
Token�
Token ID129
Feature activation+8.69e-01

Pos logit contributions
 give+3.196
 I+3.148
 Sorry+3.011
 leave+2.997
 thank+2.987
Neg logit contributions
side-2.682
 sudden-2.512
amed-2.361
Meanwhile-2.237
J-2.161
Token�
Token ID241
Feature activation-1.84e+00

Pos logit contributions
 I+1.706
 give+1.680
 Sorry+1.610
 thank+1.567
 leave+1.560
Neg logit contributions
side-1.661
 sudden-1.564
amed-1.492
Meanwhile-1.433
 distance-1.375
Tokenwhat
Token ID10919
Feature activation+1.18e+00

Pos logit contributions
side+3.750
amed+3.589
 sudden+3.437
Meanwhile+3.193
 distance+3.151
Neg logit contributions
 I-3.897
 give-3.446
 Sorry-3.383
 ask-3.190
 thank-3.186
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
 sudden+2.731
side+2.669
amed+2.514
Meanwhile+2.509
Days+2.331
Neg logit contributions
 I-1.878
 give-1.766
 leave-1.614
 Sorry-1.600
 share-1.595
Token Tim
Token ID5045
Feature activation-1.50e+00

Pos logit contributions
 I+1.245
 give+1.227
 Sorry+1.166
 thank+1.137
 leave+1.134
Neg logit contributions
side-1.354
 sudden-1.283
amed-1.222
Meanwhile-1.176
 distance-1.135
Token was
Token ID373
Feature activation+2.22e-01

Pos logit contributions
side+3.097
 sudden+2.769
amed+2.610
Meanwhile+2.595
J+2.508
Neg logit contributions
 I-2.971
 give-2.946
 ask-2.776
 leave-2.768
 share-2.715
Token so
Token ID523
Feature activation+8.98e-01

Pos logit contributions
 I+0.434
 give+0.429
 Sorry+0.401
 thank+0.397
 leave+0.397
Neg logit contributions
side-0.436
 sudden-0.407
amed-0.382
Meanwhile-0.373
 distance-0.357
Token happy
Token ID3772
Feature activation+1.08e+00

Pos logit contributions
 I+1.912
 give+1.892
 Sorry+1.807
 leave+1.773
 thank+1.766
Neg logit contributions
side-1.579
 sudden-1.494
amed-1.412
Meanwhile-1.343
 distance-1.289
Token to
Token ID284
Feature activation+2.79e+00

Pos logit contributions
 I+2.072
 give+2.069
 Sorry+1.994
 leave+1.935
 thank+1.933
Neg logit contributions
side-2.113
 sudden-2.004
amed-1.876
Meanwhile-1.830
 distance-1.761
Token be
Token ID307
Feature activation+1.25e+00

Pos logit contributions
 I+3.482
 Sorry+3.401
 give+3.019
 thank+2.779
 sorry+2.708
Neg logit contributions
side-7.250
 sudden-6.912
 distance-6.729
amed-6.713
 corner-6.556
Token on
Token ID319
Feature activation+2.52e-01

Pos logit contributions
 I+2.591
 give+2.549
 Sorry+2.481
 thank+2.407
 leave+2.385
Neg logit contributions
side-2.279
 sudden-2.146
amed-2.094
Meanwhile-1.939
 distance-1.888
Token the
Token ID262
Feature activation-5.79e-01

Pos logit contributions
 I+0.447
 give+0.441
 Sorry+0.411
 leave+0.406
 thank+0.406
Neg logit contributions
side-0.537
 sudden-0.509
amed-0.495
Meanwhile-0.472
 distance-0.454
Token boat
Token ID8848
Feature activation-3.82e-01

Pos logit contributions
side+1.181
amed+1.091
 sudden+1.082
Meanwhile+1.029
J+1.005
Neg logit contributions
 I-1.098
 give-1.095
 thank-1.020
 leave-1.016
 Sorry-1.005
Token.
Token ID13
Feature activation+9.12e-01

Pos logit contributions
side+0.850
 sudden+0.838
amed+0.806
Meanwhile+0.774
J+0.748
Neg logit contributions
 I-0.633
 give-0.631
 leave-0.580
 thank-0.564
 Sorry-0.556
Token her
Token ID607
Feature activation+9.59e-01

Pos logit contributions
side+0.002
 sudden+0.002
amed+0.002
Meanwhile+0.002
 distance+0.002
Neg logit contributions
 I-0.002
 give-0.002
 leave-0.001
 thank-0.001
 Sorry-0.001
Token head
Token ID1182
Feature activation-1.01e+00

Pos logit contributions
 I+1.408
 give+1.376
 Sorry+1.308
 thank+1.246
 leave+1.240
Neg logit contributions
side-2.328
 sudden-2.241
amed-2.142
Meanwhile-2.078
 distance-2.027
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
 sudden+2.278
side+2.224
amed+2.100
Meanwhile+2.059
 distance+1.956
Neg logit contributions
 give-1.685
 I-1.678
 leave-1.549
 thank-1.532
 share-1.445
Token She
Token ID1375
Feature activation-1.40e-01

Pos logit contributions
 give+2.398
 I+2.336
 Sorry+2.243
 leave+2.196
 thank+2.181
Neg logit contributions
side-3.523
 sudden-3.341
amed-3.176
Meanwhile-3.111
J-2.973
Token did
Token ID750
Feature activation+1.40e+00

Pos logit contributions
side+0.306
 sudden+0.279
amed+0.271
Meanwhile+0.260
 distance+0.258
Neg logit contributions
 I-0.249
 give-0.246
 Sorry-0.225
 thank-0.224
 leave-0.224
Token not
Token ID407
Feature activation+2.85e+00

Pos logit contributions
 I+1.640
 give+1.579
 Sorry+1.566
 leave+1.412
 thank+1.400
Neg logit contributions
side-3.786
 sudden-3.631
amed-3.441
Meanwhile-3.413
 distance-3.305
Token want
Token ID765
Feature activation+7.25e-01

Pos logit contributions
 Sorry+3.464
 I+3.277
 give+2.674
 thank+2.532
 leave+2.488
Neg logit contributions
side-7.562
 distance-7.283
 sudden-7.256
amed-7.147
 nearby-7.057
Token the
Token ID262
Feature activation+1.29e-01

Pos logit contributions
 I+1.195
 give+1.177
 Sorry+1.113
 thank+1.080
 leave+1.078
Neg logit contributions
side-1.620
 sudden-1.544
amed-1.475
Meanwhile-1.427
 distance-1.383
Token pill
Token ID9582
Feature activation+2.43e-01

Pos logit contributions
 I+0.262
 give+0.262
 thank+0.243
 Sorry+0.243
 leave+0.242
Neg logit contributions
side-0.240
 sudden-0.225
amed-0.219
Meanwhile-0.206
aming-0.195
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
 I+0.418
 give+0.413
 Sorry+0.382
 leave+0.382
 thank+0.379
Neg logit contributions
side-0.521
 sudden-0.507
amed-0.479
Meanwhile-0.466
 distance-0.446
Token She
Token ID1375
Feature activation-4.18e-01

Pos logit contributions
 give+1.997
 I+1.944
 Sorry+1.847
 leave+1.816
 thank+1.798
Neg logit contributions
side-3.271
 sudden-3.127
amed-2.968
Meanwhile-2.912
J-2.796
Token not
Token ID407
Feature activation+9.04e-01

Pos logit contributions
 I+1.129
 give+1.113
 Sorry+1.051
 leave+1.027
 thank+1.027
Neg logit contributions
side-1.332
 sudden-1.265
amed-1.203
Meanwhile-1.162
 distance-1.123
Token happy
Token ID3772
Feature activation+7.21e-01

Pos logit contributions
 I+1.779
 give+1.753
 Sorry+1.688
 thank+1.636
 leave+1.631
Neg logit contributions
side-1.724
 sudden-1.636
amed-1.557
Meanwhile-1.488
 distance-1.438
Token.
Token ID13
Feature activation+2.59e-01

Pos logit contributions
 I+1.295
 give+1.277
 Sorry+1.212
 thank+1.180
 leave+1.178
Neg logit contributions
side-1.508
 sudden-1.433
amed-1.364
Meanwhile-1.316
 distance-1.272
Token They
Token ID1119
Feature activation-4.30e-01

Pos logit contributions
 I+0.404
 give+0.390
 Sorry+0.372
 thank+0.358
 leave+0.353
Neg logit contributions
side-0.605
 sudden-0.578
amed-0.558
Meanwhile-0.536
 distance-0.522
Token wanted
Token ID2227
Feature activation-8.97e-02

Pos logit contributions
side+0.954
 sudden+0.887
amed+0.853
Meanwhile+0.826
 distance+0.795
Neg logit contributions
 give-0.752
 I-0.732
 leave-0.688
 thank-0.686
 Sorry-0.672
Token to
Token ID284
Feature activation+2.73e+00

Pos logit contributions
side+0.214
 sudden+0.205
amed+0.200
Meanwhile+0.192
aming+0.183
Neg logit contributions
 give-0.138
 I-0.137
 leave-0.126
 thank-0.123
 Sorry-0.122
Token stay
Token ID2652
Feature activation+9.35e-01

Pos logit contributions
 I+3.352
 Sorry+3.312
 give+2.945
 thank+2.752
 sorry+2.732
Neg logit contributions
side-6.966
 sudden-6.787
 distance-6.552
amed-6.383
 corner-6.279
Token home
Token ID1363
Feature activation+1.41e-01

Pos logit contributions
 I+1.909
 give+1.877
 Sorry+1.807
 thank+1.758
 leave+1.743
Neg logit contributions
side-1.729
 sudden-1.639
amed-1.553
Meanwhile-1.477
 distance-1.431
Token and
Token ID290
Feature activation+2.07e+00

Pos logit contributions
 I+0.262
 give+0.261
 leave+0.243
 Sorry+0.241
 thank+0.240
Neg logit contributions
side-0.280
 sudden-0.271
amed-0.257
Meanwhile-0.248
 distance-0.234
Token play
Token ID711
Feature activation-5.45e-02

Pos logit contributions
 I+2.826
 give+2.687
 Sorry+2.675
 thank+2.453
 leave+2.391
Neg logit contributions
side-5.135
 sudden-4.915
amed-4.627
 distance-4.621
Meanwhile-4.519
Token with
Token ID351
Feature activation-5.96e-01

Pos logit contributions
side+0.110
 sudden+0.106
amed+0.100
Meanwhile+0.098
 distance+0.093
Neg logit contributions
 I-0.102
 give-0.101
 thank-0.093
 leave-0.093
 Sorry-0.093
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 I+0.719
 give+0.710
 Sorry+0.668
 leave+0.660
 thank+0.660
Neg logit contributions
side-0.697
 sudden-0.666
amed-0.627
Meanwhile-0.605
 distance-0.580
Token
Token ID198
Feature activation-2.67e+00

Pos logit contributions
 give+1.798
 I+1.790
 Sorry+1.702
 leave+1.666
 thank+1.654
Neg logit contributions
side-2.103
 sudden-2.004
amed-1.895
Meanwhile-1.846
 distance-1.777
Token
Token ID198
Feature activation-1.87e+00

Pos logit contributions
side+6.163
amed+5.713
 sudden+5.642
aming+5.088
eled+5.071
Neg logit contributions
 I-5.344
 ask-4.686
 give-4.659
 thank-4.499
 please-4.297
Token"
Token ID1
Feature activation-1.27e+00

Pos logit contributions
side+3.554
 sudden+3.297
amed+3.223
eled+2.946
 distance+2.944
Neg logit contributions
 I-4.127
 give-3.720
 Sorry-3.655
 ask-3.614
 share-3.517
TokenWhat
Token ID2061
Feature activation+2.54e+00

Pos logit contributions
side+2.363
 sudden+2.236
amed+2.088
Meanwhile+1.951
 distance+1.923
Neg logit contributions
 I-2.730
 give-2.628
 Sorry-2.514
 thank-2.473
 ask-2.401
Token are
Token ID389
Feature activation+2.83e+00

Pos logit contributions
 give+4.203
 I+4.091
 Sorry+4.076
 leave+4.016
 thank+3.830
Neg logit contributions
side-5.165
 sudden-5.001
amed-4.799
J-4.717
 nearby-4.637
Token you
Token ID345
Feature activation+1.66e+00

Pos logit contributions
 give+3.122
 leave+2.914
 Sorry+2.610
 I+2.597
 thank+2.543
Neg logit contributions
 sudden-7.833
side-7.760
Meanwhile-7.196
amed-7.171
J-7.120
Token doing
Token ID1804
Feature activation+1.47e+00

Pos logit contributions
 I+2.702
 give+2.653
 Sorry+2.579
 thank+2.463
 leave+2.456
Neg logit contributions
side-3.670
 sudden-3.570
amed-3.459
Meanwhile-3.286
aming-3.196
Token here
Token ID994
Feature activation+1.04e+00

Pos logit contributions
 I+1.884
 give+1.870
 Sorry+1.790
 leave+1.690
 thank+1.685
Neg logit contributions
side-3.734
 sudden-3.597
amed-3.456
Meanwhile-3.346
 nearby-3.255
Token,
Token ID11
Feature activation+5.59e-01

Pos logit contributions
 I+0.747
 give+0.729
 Sorry+0.657
 thank+0.592
 leave+0.590
Neg logit contributions
side-3.259
 sudden-3.148
amed-3.057
Meanwhile-2.983
 distance-2.925
Token children
Token ID1751
Feature activation+1.50e+00

Pos logit contributions
 I+0.893
 give+0.876
 Sorry+0.821
 thank+0.802
 leave+0.799
Neg logit contributions
side-1.287
 sudden-1.221
amed-1.175
Meanwhile-1.131
 distance-1.098
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
 give+0.182
 I+0.181
 leave+0.168
 Sorry+0.166
 thank+0.163
Neg logit contributions
side-0.223
 sudden-0.212
amed-0.204
Meanwhile-0.197
 distance-0.189
Token She
Token ID1375
Feature activation-1.78e-01

Pos logit contributions
 give+2.228
 I+2.184
 Sorry+2.077
 leave+2.063
 thank+2.033
Neg logit contributions
side-3.454
 sudden-3.296
amed-3.117
Meanwhile-3.078
 distance-2.904
Token told
Token ID1297
Feature activation+1.37e+00

Pos logit contributions
side+0.380
 sudden+0.342
amed+0.327
 distance+0.321
Meanwhile+0.314
Neg logit contributions
 give-0.326
 I-0.325
 thank-0.298
 Sorry-0.295
 leave-0.293
Token Lily
Token ID20037
Feature activation+1.30e+00

Pos logit contributions
 I+2.190
 give+2.113
 Sorry+2.088
 thank+1.944
 leave+1.942
Neg logit contributions
side-3.091
 sudden-2.980
amed-2.781
Meanwhile-2.740
J-2.635
Token that
Token ID326
Feature activation+1.02e+00

Pos logit contributions
 I+2.080
 give+2.034
 Sorry+2.005
 thank+1.874
 leave+1.870
Neg logit contributions
side-2.941
 sudden-2.804
amed-2.671
Meanwhile-2.614
 distance-2.485
Token they
Token ID484
Feature activation+1.62e+00

Pos logit contributions
 I+1.593
 give+1.577
 Sorry+1.506
 thank+1.451
 leave+1.450
Neg logit contributions
side-2.321
 sudden-2.224
amed-2.103
Meanwhile-2.061
 distance-1.989
Token needed
Token ID2622
Feature activation+6.75e-01

Pos logit contributions
 I+3.183
 give+3.114
 Sorry+3.076
 thank+2.973
 leave+2.898
Neg logit contributions
side-3.051
 sudden-2.945
amed-2.780
Meanwhile-2.631
 nowhere-2.509
Token to
Token ID284
Feature activation+2.65e+00

Pos logit contributions
 I+1.066
 give+1.049
 Sorry+0.986
 thank+0.958
 leave+0.956
Neg logit contributions
side-1.556
 sudden-1.486
amed-1.423
Meanwhile-1.377
 distance-1.336
Token use
Token ID779
Feature activation-3.33e-02

Pos logit contributions
 I+3.805
 Sorry+3.680
 give+3.292
 thank+3.259
 sorry+2.940
Neg logit contributions
side-6.393
 sudden-6.190
amed-5.766
Meanwhile-5.730
 nearby-5.631
Token a
Token ID257
Feature activation+8.47e-03

Pos logit contributions
side+0.074
 sudden+0.070
amed+0.068
Meanwhile+0.066
 distance+0.063
Neg logit contributions
 I-0.056
 give-0.055
 leave-0.051
 Sorry-0.050
 thank-0.050
Token needle
Token ID17598
Feature activation-1.27e-01

Pos logit contributions
 I+0.018
 give+0.018
 Sorry+0.016
 thank+0.016
 leave+0.016
Neg logit contributions
side-0.015
 sudden-0.014
amed-0.014
Meanwhile-0.013
aming-0.013
Token not
Token ID407
Feature activation+2.49e+00

Pos logit contributions
 I+4.168
 Sorry+4.078
 give+3.955
 leave+3.796
 thank+3.655
Neg logit contributions
side-5.002
 sudden-4.956
amed-4.668
Meanwhile-4.577
 nearby-4.450
Token be
Token ID307
Feature activation+2.99e-01

Pos logit contributions
 I+4.293
 Sorry+4.210
 give+3.961
 thank+3.931
 leave+3.730
Neg logit contributions
side-5.211
 sudden-5.175
 distance-4.757
 nearby-4.697
amed-4.683
Token afraid
Token ID7787
Feature activation+8.98e-01

Pos logit contributions
 I+0.518
 give+0.515
 Sorry+0.474
 leave+0.472
 thank+0.466
Neg logit contributions
side-0.650
 sudden-0.609
amed-0.578
Meanwhile-0.564
 distance-0.546
Token of
Token ID286
Feature activation+7.27e-02

Pos logit contributions
 I+1.668
 give+1.655
 Sorry+1.587
 thank+1.536
 leave+1.531
Neg logit contributions
side-1.792
 sudden-1.708
amed-1.607
Meanwhile-1.556
 distance-1.506
Token it
Token ID340
Feature activation+7.63e-01

Pos logit contributions
 I+0.128
 give+0.125
 leave+0.115
 thank+0.114
 Sorry+0.114
Neg logit contributions
side-0.159
 sudden-0.148
amed-0.147
Meanwhile-0.139
 distance-0.133
Token.
Token ID13
Feature activation+1.44e+00

Pos logit contributions
 I+1.390
 give+1.369
 Sorry+1.304
 thank+1.269
 leave+1.265
Neg logit contributions
side-1.571
 sudden-1.490
amed-1.418
Meanwhile-1.367
 distance-1.321
Token Lily
Token ID20037
Feature activation-5.25e-01

Pos logit contributions
 give+2.063
 I+2.032
 Sorry+1.939
 leave+1.883
 thank+1.876
Neg logit contributions
side-3.512
 sudden-3.346
amed-3.179
Meanwhile-3.149
 distance-2.967
Token listened
Token ID16399
Feature activation-3.81e-01

Pos logit contributions
side+1.139
 sudden+1.040
amed+0.987
 distance+0.971
Meanwhile+0.947
Neg logit contributions
 give-0.937
 I-0.922
 leave-0.851
 ask-0.841
 thank-0.838
Token to
Token ID284
Feature activation+8.52e-01

Pos logit contributions
side+0.798
 sudden+0.773
amed+0.739
 distance+0.707
Meanwhile+0.700
Neg logit contributions
 I-0.666
 give-0.662
 thank-0.608
 leave-0.592
 ask-0.588
Token her
Token ID607
Feature activation+4.20e-01

Pos logit contributions
 I+1.535
 give+1.510
 Sorry+1.447
 thank+1.395
 leave+1.395
Neg logit contributions
side-1.777
 sudden-1.695
amed-1.608
Meanwhile-1.556
 distance-1.505
Token mom
Token ID1995
Feature activation-4.77e-02

Pos logit contributions
 I+0.614
 give+0.612
 Sorry+0.562
 thank+0.555
 leave+0.553
Neg logit contributions
side-1.034
 sudden-0.983
amed-0.952
Meanwhile-0.919
 distance-0.893
TokenCan
Token ID6090
Feature activation+2.40e+00

Pos logit contributions
side+3.283
 sudden+2.932
amed+2.863
Meanwhile+2.574
 distance+2.553
Neg logit contributions
 I-4.934
 give-4.707
 Sorry-4.567
 thank-4.378
 ask-4.343
Token we
Token ID356
Feature activation+3.41e+00

Pos logit contributions
 give+1.047
 Sorry+0.808
 leave+0.749
 thank+0.721
 I+0.662
Neg logit contributions
side-7.969
 sudden-7.799
Meanwhile-7.421
 nowhere-7.407
amed-7.251
Token have
Token ID423
Feature activation+1.76e+00

Pos logit contributions
 Sorry+5.122
 I+4.942
 give+4.557
 sorry+4.516
 thank+4.438
Neg logit contributions
 sudden-7.568
side-7.461
 nearby-7.183
 distance-7.119
 corner-6.859
Token the
Token ID262
Feature activation-1.23e-01

Pos logit contributions
 I+3.525
 give+3.498
 Sorry+3.449
 leave+3.294
 thank+3.248
Neg logit contributions
side-3.175
 sudden-3.114
amed-2.782
Meanwhile-2.702
 distance-2.623
Token rabbit
Token ID22746
Feature activation+1.01e+00

Pos logit contributions
side+0.251
 sudden+0.235
amed+0.230
Meanwhile+0.219
J+0.209
Neg logit contributions
 I-0.234
 give-0.233
 thank-0.217
 Sorry-0.215
 leave-0.215
Token,
Token ID11
Feature activation+1.20e+00

Pos logit contributions
 I+1.070
 give+1.045
 Sorry+0.977
 thank+0.916
 leave+0.907
Neg logit contributions
side-2.810
 sudden-2.700
amed-2.614
Meanwhile-2.539
 distance-2.492
Token Mom
Token ID11254
Feature activation+2.00e+00

Pos logit contributions
 I+1.516
 give+1.507
 Sorry+1.435
 leave+1.349
 thank+1.339
Neg logit contributions
side-3.025
 sudden-2.936
amed-2.798
Meanwhile-2.743
 distance-2.680
Token?"
Token ID1701
Feature activation+5.22e-01

Pos logit contributions
 Sorry+3.148
 give+3.118
 I+3.111
 thank+2.954
 leave+2.851
Neg logit contributions
side-4.488
 sudden-4.189
amed-4.100
Meanwhile-3.967
 nearby-3.897
Token they
Token ID484
Feature activation+5.61e-01

Pos logit contributions
 I+0.689
 give+0.670
 Sorry+0.622
 thank+0.601
 leave+0.595
Neg logit contributions
side-1.352
 sudden-1.290
amed-1.251
Meanwhile-1.207
 distance-1.177
Token asked
Token ID1965
Feature activation-3.32e-01

Pos logit contributions
 I+0.979
 give+0.966
 Sorry+0.911
 thank+0.890
 leave+0.888
Neg logit contributions
side-1.201
 sudden-1.134
amed-1.086
Meanwhile-1.046
 distance-1.014
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
side+0.759
 sudden+0.717
amed+0.693
Meanwhile+0.681
 distance+0.653
Neg logit contributions
 give-0.544
 I-0.544
 leave-0.498
 thank-0.495
 Sorry-0.494
Token made
Token ID925
Feature activation-1.61e+00

Pos logit contributions
side+2.104
 sudden+1.834
 distance+1.802
amed+1.773
Meanwhile+1.707
Neg logit contributions
 give-1.593
 I-1.584
 thank-1.432
 ask-1.409
 leave-1.392
Token the
Token ID262
Feature activation-9.65e-01

Pos logit contributions
side+4.069
 sudden+3.709
amed+3.647
 distance+3.617
Meanwhile+3.548
Neg logit contributions
 give-2.528
 I-2.452
 thank-2.301
 leave-2.234
 share-2.169
Token rope
Token ID17182
Feature activation-1.67e-01

Pos logit contributions
side+2.021
 sudden+1.811
amed+1.801
J+1.672
Meanwhile+1.668
Neg logit contributions
 I-1.864
 give-1.829
 thank-1.715
 leave-1.675
 ask-1.661
Token go
Token ID467
Feature activation-6.59e-01

Pos logit contributions
side+0.379
 sudden+0.365
amed+0.344
Meanwhile+0.334
 distance+0.322
Neg logit contributions
 give-0.277
 I-0.273
 leave-0.254
 thank-0.249
 Sorry-0.245
Token over
Token ID625
Feature activation-2.58e+00

Pos logit contributions
side+1.191
 sudden+1.156
amed+1.072
Meanwhile+1.036
aming+0.985
Neg logit contributions
 I-1.384
 give-1.383
 leave-1.281
 thank-1.249
 Sorry-1.247
Token and
Token ID290
Feature activation+7.13e-01

Pos logit contributions
side+6.899
amed+6.810
 sudden+6.502
Meanwhile+6.442
 distance+6.171
Neg logit contributions
 I-3.850
 give-3.707
 share-3.190
 leave-3.156
 thank-3.097
Token under
Token ID739
Feature activation-2.14e+00

Pos logit contributions
 I+1.310
 give+1.292
 Sorry+1.228
 thank+1.196
 leave+1.194
Neg logit contributions
side-1.458
 sudden-1.384
amed-1.317
Meanwhile-1.269
 distance-1.225
Token and
Token ID290
Feature activation-4.28e-02

Pos logit contributions
amed+5.798
side+5.643
 sudden+5.320
Meanwhile+5.278
aming+5.216
Neg logit contributions
 I-3.178
 give-2.943
 thank-2.616
 leave-2.612
 share-2.364
Token around
Token ID1088
Feature activation-9.23e-01

Pos logit contributions
side+0.091
 sudden+0.087
amed+0.083
Meanwhile+0.080
 distance+0.077
Neg logit contributions
 I-0.076
 give-0.076
 Sorry-0.070
 leave-0.069
 thank-0.069
Token,
Token ID11
Feature activation-6.38e-01

Pos logit contributions
side+2.174
 sudden+2.095
amed+2.081
Meanwhile+2.002
aming+1.890
Neg logit contributions
 I-1.487
 give-1.471
 thank-1.335
 leave-1.329
 share-1.265
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
side+1.391
 sudden+1.323
amed+1.282
Meanwhile+1.222
J+1.190
Neg logit contributions
 I-1.123
 give-1.109
 leave-1.011
 thank-1.009
 Sorry-0.985
Token
Token ID198
Feature activation-1.77e+00

Pos logit contributions
side+0.156
 sudden+0.147
amed+0.137
Meanwhile+0.130
 distance+0.128
Neg logit contributions
 I-0.202
 give-0.196
 Sorry-0.188
 thank-0.184
 leave-0.182
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
side+3.651
amed+3.451
 sudden+3.400
aming+3.187
 distance+3.119
Neg logit contributions
 I-3.432
 give-3.221
 ask-3.075
 thank-2.993
 Sorry-2.987
TokenL
Token ID43
Feature activation+2.65e-01

Pos logit contributions
side+1.990
amed+1.822
 sudden+1.777
 distance+1.691
 valleys+1.554
Neg logit contributions
 I-4.371
 give-4.239
 Sorry-4.115
 ask-4.088
 share-4.086
Tokenily
Token ID813
Feature activation-1.45e-01

Pos logit contributions
 I+0.549
 give+0.542
 Sorry+0.511
 thank+0.504
 leave+0.504
Neg logit contributions
side-0.487
 sudden-0.463
amed-0.423
Meanwhile-0.412
 distance-0.406
Token decided
Token ID3066
Feature activation+1.40e+00

Pos logit contributions
side+0.301
 sudden+0.282
amed+0.266
Meanwhile+0.258
 distance+0.257
Neg logit contributions
 give-0.265
 I-0.263
 leave-0.243
 thank-0.242
 Sorry-0.242
Token to
Token ID284
Feature activation+2.58e+00

Pos logit contributions
 I+2.423
 give+2.402
 Sorry+2.361
 leave+2.251
 thank+2.222
Neg logit contributions
side-2.977
 sudden-2.805
amed-2.642
Meanwhile-2.607
 distance-2.493
Token show
Token ID905
Feature activation+1.76e+00

Pos logit contributions
 I+3.396
 Sorry+3.233
 give+2.970
 sorry+2.892
 thank+2.860
Neg logit contributions
side-6.428
 sudden-6.131
amed-5.883
 distance-5.713
Meanwhile-5.705
Token her
Token ID607
Feature activation+1.76e-01

Pos logit contributions
 I+3.354
 Sorry+3.260
 give+3.252
 thank+3.045
 leave+3.024
Neg logit contributions
side-3.383
 sudden-3.240
amed-3.092
Meanwhile-2.950
J-2.846
Token coin
Token ID10752
Feature activation+3.37e-01

Pos logit contributions
 I+0.261
 give+0.260
 Sorry+0.238
 thank+0.236
 leave+0.234
Neg logit contributions
side-0.431
 sudden-0.405
amed-0.395
Meanwhile-0.383
 distance-0.371
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
 I+0.556
 give+0.548
 Sorry+0.504
 leave+0.499
 thank+0.498
Neg logit contributions
side-0.763
 sudden-0.735
amed-0.695
Meanwhile-0.676
 distance-0.652
Token her
Token ID607
Feature activation-2.36e-02

Pos logit contributions
 I+3.014
 give+2.948
 Sorry+2.896
 leave+2.767
 thank+2.744
Neg logit contributions
side-3.091
 sudden-2.955
amed-2.837
Meanwhile-2.700
 distance-2.602
Token it
Token ID340
Feature activation-1.75e+00

Pos logit contributions
side+1.571
 sudden+1.497
amed+1.439
Meanwhile+1.414
 distance+1.297
Neg logit contributions
 I-1.523
 give-1.500
 leave-1.406
 thank-1.357
 share-1.342
Token under
Token ID739
Feature activation-1.77e+00

Pos logit contributions
 sudden+3.840
side+3.723
amed+3.530
Meanwhile+3.427
J+3.337
Neg logit contributions
 give-3.204
 I-3.127
 leave-2.967
 share-2.791
 thank-2.783
Token a
Token ID257
Feature activation-1.32e+00

Pos logit contributions
side+4.431
amed+4.370
 sudden+4.273
Meanwhile+4.206
eh+3.944
Neg logit contributions
 I-2.736
 give-2.661
 leave-2.420
 thank-2.270
 share-2.253
Token bush
Token ID24484
Feature activation-1.19e+00

Pos logit contributions
amed+2.403
side+2.363
 sudden+2.236
Meanwhile+2.111
aming+2.104
Neg logit contributions
 I-2.951
 give-2.862
 thank-2.687
 leave-2.617
 ask-2.565
Token and
Token ID290
Feature activation+1.28e+00

Pos logit contributions
 sudden+2.884
amed+2.778
side+2.767
Meanwhile+2.593
aming+2.513
Neg logit contributions
 give-1.838
 I-1.827
 leave-1.673
 thank-1.599
 ask-1.550
Token try
Token ID1949
Feature activation-7.20e-01

Pos logit contributions
 I+1.724
 give+1.667
 Sorry+1.614
 thank+1.527
 leave+1.498
Neg logit contributions
side-3.195
 sudden-3.070
amed-2.930
Meanwhile-2.855
 distance-2.821
Token again
Token ID757
Feature activation-2.89e-01

Pos logit contributions
side+1.799
 sudden+1.736
amed+1.709
Meanwhile+1.638
aming+1.588
Neg logit contributions
 give-1.031
 I-1.020
 leave-0.918
 ask-0.891
 share-0.885
Token later
Token ID1568
Feature activation+2.31e-02

Pos logit contributions
side+0.649
 sudden+0.620
amed+0.611
Meanwhile+0.584
 distance+0.549
Neg logit contributions
 I-0.485
 give-0.481
 leave-0.439
 thank-0.429
 ask-0.428
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
 I+0.039
 give+0.038
 Sorry+0.035
 thank+0.035
 leave+0.035
Neg logit contributions
side-0.051
 sudden-0.049
amed-0.048
Meanwhile-0.046
 distance-0.044
Token
Token ID198
Feature activation-2.82e+00

Pos logit contributions
side+0.038
 sudden+0.037
amed+0.036
Meanwhile+0.034
 distance+0.034
Neg logit contributions
 I-0.020
 give-0.019
 Sorry-0.018
 thank-0.017
 leave-0.017
Token
Token ID198
Feature activation-2.79e+00

Pos logit contributions
side+6.761
amed+6.325
 sudden+6.148
 distance+5.853
aming+5.820
Neg logit contributions
 I-5.053
 give-4.705
 ask-4.556
 thank-4.396
 share-4.235
Token and
Token ID290
Feature activation+1.58e+00

Pos logit contributions
side+0.211
 sudden+0.198
amed+0.194
Meanwhile+0.183
J+0.172
Neg logit contributions
 I-0.223
 give-0.215
 leave-0.203
 Sorry-0.202
 thank-0.198
Token she
Token ID673
Feature activation-1.38e-01

Pos logit contributions
 give+2.978
 I+2.930
 Sorry+2.850
 thank+2.788
 leave+2.726
Neg logit contributions
side-3.090
 sudden-2.986
amed-2.764
Meanwhile-2.710
 distance-2.661
Token wanted
Token ID2227
Feature activation+6.28e-02

Pos logit contributions
side+0.297
 sudden+0.270
amed+0.266
Meanwhile+0.254
 distance+0.246
Neg logit contributions
 I-0.244
 give-0.240
 leave-0.223
 Sorry-0.222
 thank-0.217
Token her
Token ID607
Feature activation+6.98e-01

Pos logit contributions
 I+0.098
 give+0.096
 leave+0.088
 thank+0.086
 Sorry+0.086
Neg logit contributions
side-0.148
 sudden-0.142
amed-0.138
Meanwhile-0.131
aming-0.126
Token mom
Token ID1995
Feature activation+2.35e-01

Pos logit contributions
 I+1.217
 give+1.200
 Sorry+1.136
 thank+1.106
 leave+1.104
Neg logit contributions
side-1.493
 sudden-1.420
amed-1.355
Meanwhile-1.308
 distance-1.265
Tokenmy
Token ID1820
Feature activation+2.75e-01

Pos logit contributions
 I+0.477
 give+0.475
 leave+0.443
 Sorry+0.437
 thank+0.435
Neg logit contributions
side-0.437
 sudden-0.420
amed-0.394
Meanwhile-0.380
 distance-0.356
Token to
Token ID284
Feature activation+1.91e+00

Pos logit contributions
 I+0.435
 give+0.432
 leave+0.394
 Sorry+0.389
 thank+0.387
Neg logit contributions
side-0.633
 sudden-0.615
amed-0.585
Meanwhile-0.567
 distance-0.542
Token bring
Token ID2222
Feature activation+1.21e+00

Pos logit contributions
 I+2.366
 Sorry+2.312
 give+2.181
 thank+1.998
 leave+1.880
Neg logit contributions
side-4.914
 sudden-4.732
 distance-4.569
amed-4.453
Meanwhile-4.437
Token her
Token ID607
Feature activation-1.32e-01

Pos logit contributions
 I+2.255
 give+2.205
 Sorry+2.154
 thank+2.059
 leave+2.046
Neg logit contributions
side-2.424
 sudden-2.303
amed-2.173
Meanwhile-2.078
 distance-2.048
Token a
Token ID257
Feature activation-1.05e-01

Pos logit contributions
side+0.274
 sudden+0.261
amed+0.250
Meanwhile+0.245
J+0.230
Neg logit contributions
 give-0.241
 I-0.240
 leave-0.224
 thank-0.220
 Sorry-0.219
Token night
Token ID1755
Feature activation+1.14e-01

Pos logit contributions
side+0.206
 sudden+0.190
amed+0.189
Meanwhile+0.178
aming+0.170
Neg logit contributions
 give-0.207
 I-0.205
 thank-0.193
 Sorry-0.192
 leave-0.191
Token a
Token ID257
Feature activation+2.71e-01

Pos logit contributions
 I+3.000
 give+2.972
 Sorry+2.893
 leave+2.787
 thank+2.747
Neg logit contributions
side-2.804
 sudden-2.692
amed-2.464
Meanwhile-2.378
 distance-2.292
Token bath
Token ID7837
Feature activation+4.99e-01

Pos logit contributions
 I+0.490
 give+0.488
 Sorry+0.458
 thank+0.455
 leave+0.446
Neg logit contributions
side-0.563
 sudden-0.528
amed-0.516
Meanwhile-0.495
 distance-0.471
Token first
Token ID717
Feature activation+1.41e+00

Pos logit contributions
 I+0.806
 give+0.791
 Sorry+0.745
 thank+0.724
 leave+0.724
Neg logit contributions
side-1.124
 sudden-1.081
amed-1.027
Meanwhile-1.000
 distance-0.966
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 I+1.492
 give+1.473
 Sorry+1.393
 thank+1.307
 leave+1.276
Neg logit contributions
side-3.921
 sudden-3.750
amed-3.610
Meanwhile-3.516
 distance-3.455
Token mom
Token ID1995
Feature activation+2.03e+00

Pos logit contributions
 I+1.740
 give+1.734
 Sorry+1.650
 leave+1.554
 thank+1.535
Neg logit contributions
side-3.232
 sudden-3.121
amed-2.970
Meanwhile-2.922
 distance-2.835
Token?"
Token ID1701
Feature activation-8.19e-02

Pos logit contributions
 Sorry+2.893
 give+2.891
 I+2.864
 thank+2.710
 leave+2.643
Neg logit contributions
side-4.863
 sudden-4.507
amed-4.395
Meanwhile-4.267
 nearby-4.198
Token Their
Token ID5334
Feature activation-2.30e-01

Pos logit contributions
side+0.152
 sudden+0.143
amed+0.141
Meanwhile+0.129
aming+0.126
Neg logit contributions
 I-0.172
 give-0.166
 Sorry-0.160
 thank-0.157
 leave-0.154
Token mom
Token ID1995
Feature activation+9.24e-01

Pos logit contributions
side+0.602
 sudden+0.563
amed+0.556
Meanwhile+0.537
 distance+0.520
Neg logit contributions
 I-0.305
 give-0.304
 Sorry-0.284
 leave-0.279
 thank-0.278
Token laughs
Token ID22051
Feature activation+1.09e-01

Pos logit contributions
 I+1.420
 give+1.389
 Sorry+1.328
 leave+1.269
 thank+1.265
Neg logit contributions
side-2.148
 sudden-2.056
amed-1.974
Meanwhile-1.915
 distance-1.851
Token and
Token ID290
Feature activation+2.73e+00

Pos logit contributions
 I+0.216
 give+0.214
 thank+0.200
 leave+0.199
 Sorry+0.198
Neg logit contributions
side-0.211
 sudden-0.201
amed-0.188
Meanwhile-0.182
 distance-0.176
Token says
Token ID1139
Feature activation+2.00e+00

Pos logit contributions
 Sorry+4.258
 I+4.246
 give+3.934
 leave+3.788
 belong+3.734
Neg logit contributions
side-5.942
 sudden-5.814
amed-5.632
Meanwhile-5.576
 nearby-5.332
Token�
Token ID250
Feature activation-2.27e+00

Pos logit contributions
 I+0.296
 give+0.286
 Sorry+0.275
 thank+0.263
 leave+0.256
Neg logit contributions
side-0.496
 sudden-0.474
amed-0.456
Meanwhile-0.446
 distance-0.432
Token it
Token ID340
Feature activation+1.65e+00

Pos logit contributions
 sudden+3.645
side+3.587
Meanwhile+3.111
amed+3.105
 distance+3.041
Neg logit contributions
 I-5.497
 give-5.098
 Sorry-4.984
 share-4.746
 please-4.598
Token was
Token ID373
Feature activation+6.28e-01

Pos logit contributions
 I+2.812
 give+2.803
 Sorry+2.743
 leave+2.624
 thank+2.622
Neg logit contributions
side-3.426
 sudden-3.259
amed-3.155
Meanwhile-3.009
 distance-2.917
Token loaded
Token ID9639
Feature activation-8.47e-01

Pos logit contributions
 I+1.165
 give+1.148
 Sorry+1.086
 thank+1.063
 leave+1.061
Neg logit contributions
side-1.274
 sudden-1.209
amed-1.149
Meanwhile-1.108
 distance-1.067
Token with
Token ID351
Feature activation-4.21e-01

Pos logit contributions
side+2.154
 sudden+2.154
Meanwhile+1.992
amed+1.988
 distance+1.903
Neg logit contributions
 I-1.184
 give-1.125
 leave-1.000
 thank-0.982
 Sorry-0.977
Token lots
Token ID6041
Feature activation-2.17e-01

Pos logit contributions
side+1.022
 sudden+0.963
amed+0.934
Meanwhile+0.920
 distance+0.874
Neg logit contributions
 I-0.651
 give-0.634
 Sorry-0.578
 leave-0.576
 thank-0.576
Token of
Token ID286
Feature activation+2.72e-01

Pos logit contributions
side+0.393
 sudden+0.370
amed+0.354
Meanwhile+0.339
 distance+0.319
Neg logit contributions
 I-0.460
 give-0.450
 thank-0.425
 leave-0.415
 Sorry-0.414
Token snacks
Token ID28487
Feature activation+1.65e-01

Pos logit contributions
 I+0.515
 give+0.511
 Sorry+0.478
 thank+0.478
 leave+0.470
Neg logit contributions
side-0.550
 sudden-0.509
amed-0.493
Meanwhile-0.476
 distance-0.452
Token and
Token ID290
Feature activation+8.68e-01

Pos logit contributions
 I+0.244
 give+0.235
 Sorry+0.216
 thank+0.213
 leave+0.212
Neg logit contributions
side-0.397
 sudden-0.388
amed-0.366
Meanwhile-0.357
 distance-0.345
Token drinks
Token ID11758
Feature activation+2.50e-01

Pos logit contributions
 I+1.701
 give+1.683
 Sorry+1.610
 thank+1.567
 leave+1.567
Neg logit contributions
side-1.645
 sudden-1.561
amed-1.473
Meanwhile-1.414
 distance-1.365
Token for
Token ID329
Feature activation+5.27e-01

Pos logit contributions
 I+0.366
 give+0.357
 Sorry+0.328
 thank+0.325
 leave+0.320
Neg logit contributions
side-0.607
 sudden-0.589
amed-0.560
Meanwhile-0.547
 distance-0.526
Token't
Token ID470
Feature activation+2.34e+00

Pos logit contributions
 give+1.801
 I+1.790
 Sorry+1.697
 leave+1.631
 thank+1.625
Neg logit contributions
side-3.803
 sudden-3.661
amed-3.555
Meanwhile-3.473
 distance-3.300
Token need
Token ID761
Feature activation+8.44e-01

Pos logit contributions
 I+2.570
 Sorry+2.552
 give+2.266
 thank+2.196
 leave+2.028
Neg logit contributions
side-6.208
 sudden-6.110
amed-5.835
 distance-5.833
J-5.727
Token you
Token ID345
Feature activation+2.32e-01

Pos logit contributions
 I+1.166
 give+1.149
 Sorry+1.082
 thank+1.036
 leave+1.036
Neg logit contributions
side-2.091
 sudden-2.010
amed-1.921
Meanwhile-1.869
 distance-1.819
Token!"
Token ID2474
Feature activation+2.78e-01

Pos logit contributions
 I+0.369
 give+0.363
 Sorry+0.333
 leave+0.333
 thank+0.329
Neg logit contributions
side-0.535
 sudden-0.517
amed-0.493
Meanwhile-0.479
 distance-0.459
Token
Token ID198
Feature activation-1.77e+00

Pos logit contributions
 I+0.588
 give+0.576
 Sorry+0.552
 thank+0.541
 leave+0.535
Neg logit contributions
side-0.501
 sudden-0.470
amed-0.454
Meanwhile-0.427
 distance-0.410
Token
Token ID198
Feature activation-1.66e+00

Pos logit contributions
side+3.737
amed+3.388
 sudden+3.329
eled+3.118
 distance+3.115
Neg logit contributions
 I-3.552
 give-3.364
 thank-3.226
 Sorry-3.127
 ask-3.110
TokenHer
Token ID9360
Feature activation+3.66e-01

Pos logit contributions
side+2.732
amed+2.488
 sudden+2.340
 distance+2.252
eled+2.233
Neg logit contributions
 I-3.949
 give-3.783
 Sorry-3.672
 share-3.601
 ask-3.558
Token mom
Token ID1995
Feature activation-1.18e+00

Pos logit contributions
 I+0.524
 give+0.521
 Sorry+0.481
 thank+0.474
 leave+0.471
Neg logit contributions
side-0.911
 sudden-0.863
amed-0.834
Meanwhile-0.809
 distance-0.786
Token sighed
Token ID21893
Feature activation-1.78e-01

Pos logit contributions
side+2.697
 sudden+2.400
amed+2.304
 distance+2.232
Meanwhile+2.232
Neg logit contributions
 give-2.137
 I-2.063
 leave-1.925
 thank-1.920
 ask-1.842
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
side+0.341
 sudden+0.324
amed+0.309
Meanwhile+0.293
 distance+0.283
Neg logit contributions
 give-0.357
 I-0.354
 leave-0.331
 thank-0.330
 Sorry-0.322
Token left
Token ID1364
Feature activation-7.68e-01

Pos logit contributions
 I+2.698
 give+2.624
 Sorry+2.581
 leave+2.461
 thank+2.436
Neg logit contributions
side-2.678
 sudden-2.617
amed-2.476
Meanwhile-2.390
 distance-2.292
Token.
Token ID13
Feature activation-5.49e-01

Pos logit contributions
 I+0.689
 give+0.678
 Sorry+0.630
 thank+0.618
 leave+0.613
Neg logit contributions
side-1.145
 sudden-1.095
amed-1.049
Meanwhile-1.021
 distance-0.998
Token The
Token ID383
Feature activation-1.44e+00

Pos logit contributions
side+1.438
 sudden+1.371
amed+1.363
Meanwhile+1.299
 distance+1.294
Neg logit contributions
 I-0.711
 give-0.673
 Sorry-0.647
 thank-0.613
 leave-0.589
Token block
Token ID2512
Feature activation-1.91e+00

Pos logit contributions
side+3.114
amed+2.842
 sudden+2.820
Meanwhile+2.727
aming+2.590
Neg logit contributions
 I-2.769
 give-2.732
 thank-2.590
 leave-2.513
 Sorry-2.494
Token flew
Token ID13112
Feature activation-1.74e+00

Pos logit contributions
side+4.498
 sudden+4.227
amed+4.067
aming+3.903
Meanwhile+3.902
Neg logit contributions
 give-3.280
 I-3.223
 leave-3.003
 thank-2.925
 Sorry-2.771
Token out
Token ID503
Feature activation-1.33e+00

Pos logit contributions
 sudden+4.152
side+4.079
Meanwhile+3.694
amed+3.684
J+3.573
Neg logit contributions
 I-2.876
 give-2.785
 leave-2.616
 thank-2.530
 please-2.437
Token of
Token ID286
Feature activation-2.01e+00

Pos logit contributions
side+2.828
 sudden+2.680
amed+2.602
Meanwhile+2.470
aming+2.324
Neg logit contributions
 I-2.502
 give-2.469
 leave-2.286
 thank-2.264
 share-2.123
Token their
Token ID511
Feature activation-6.02e-01

Pos logit contributions
side+5.177
 sudden+5.002
amed+4.903
Meanwhile+4.710
Days+4.545
Neg logit contributions
 I-2.963
 give-2.949
 leave-2.648
 thank-2.639
 share-2.480
Token hands
Token ID2832
Feature activation-1.83e+00

Pos logit contributions
side+1.320
 sudden+1.230
amed+1.213
Meanwhile+1.182
J+1.150
Neg logit contributions
 I-1.036
 give-1.032
 leave-0.987
 Sorry-0.959
 thank-0.944
Token and
Token ID290
Feature activation-3.48e-01

Pos logit contributions
 sudden+4.156
side+3.996
amed+3.862
Meanwhile+3.788
J+3.622
Neg logit contributions
 give-3.135
 I-3.117
 leave-2.860
 thank-2.783
 share-2.635
Token hit
Token ID2277
Feature activation-2.36e+00

Pos logit contributions
side+0.709
 sudden+0.655
amed+0.626
Meanwhile+0.608
 distance+0.576
Neg logit contributions
 give-0.667
 I-0.665
 Sorry-0.621
 thank-0.615
 leave-0.613
Token their
Token ID511
Feature activation-1.06e+00

Pos logit contributions
side+6.641
 sudden+6.258
amed+6.130
Meanwhile+6.098
aming+5.708
Neg logit contributions
 I-3.375
 give-3.142
 leave-2.912
 thank-2.762
 Sorry-2.717
Token still
Token ID991
Feature activation+9.07e-01

Pos logit contributions
side+1.688
 sudden+1.574
amed+1.513
 distance+1.459
Meanwhile+1.453
Neg logit contributions
 I-1.194
 give-1.191
 leave-1.080
 thank-1.072
 Sorry-1.063
Token said
Token ID531
Feature activation+8.20e-01

Pos logit contributions
 I+1.634
 give+1.611
 Sorry+1.539
 thank+1.491
 leave+1.487
Neg logit contributions
side-1.867
 sudden-1.776
amed-1.692
Meanwhile-1.629
 distance-1.574
Token no
Token ID645
Feature activation+2.90e-01

Pos logit contributions
 I+1.202
 give+1.185
 Sorry+1.115
 leave+1.074
 thank+1.071
Neg logit contributions
side-1.965
 sudden-1.879
amed-1.797
Meanwhile-1.745
 distance-1.693
Token.
Token ID13
Feature activation-3.97e-01

Pos logit contributions
 I+0.512
 give+0.505
 Sorry+0.467
 thank+0.464
 leave+0.463
Neg logit contributions
side-0.625
 sudden-0.592
amed-0.577
Meanwhile-0.548
 distance-0.532
Token
Token ID220
Feature activation-7.58e-01

Pos logit contributions
side+0.739
 sudden+0.704
amed+0.683
Meanwhile+0.633
 distance+0.620
Neg logit contributions
 I-0.821
 give-0.776
 Sorry-0.770
 thank-0.731
 ask-0.718
Token
Token ID198
Feature activation-2.50e+00

Pos logit contributions
side+1.357
 sudden+1.323
amed+1.210
 distance+1.170
Meanwhile+1.140
Neg logit contributions
 I-1.640
 give-1.551
 Sorry-1.493
 thank-1.458
 ask-1.438
Token
Token ID198
Feature activation-3.12e+00

Pos logit contributions
side+5.479
amed+5.207
 sudden+5.163
 distance+4.893
aming+4.753
Neg logit contributions
 I-4.707
 give-4.248
 thank-4.167
 Sorry-4.105
 ask-4.072
TokenJohn
Token ID7554
Feature activation-1.95e+00

Pos logit contributions
side+3.744
amed+3.314
 distance+3.299
 sudden+3.182
 valleys+3.009
Neg logit contributions
 I-8.859
 Sorry-8.461
 give-8.212
 please-8.152
 share-8.006
Token started
Token ID2067
Feature activation-8.80e-01

Pos logit contributions
side+4.511
 sudden+4.101
amed+3.955
 distance+3.882
J+3.821
Neg logit contributions
 give-3.482
 leave-3.248
 I-3.245
 ask-3.094
 thank-3.014
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
side+2.138
amed+2.044
 sudden+2.004
Meanwhile+1.917
aming+1.886
Neg logit contributions
 give-1.347
 I-1.260
 leave-1.208
 thank-1.146
 ask-1.145
Token cry
Token ID3960
Feature activation-1.03e-01

Pos logit contributions
 I+1.156
 give+1.089
 Sorry+1.027
 thank+0.968
 leave+0.947
Neg logit contributions
side-2.977
 sudden-2.881
amed-2.745
Meanwhile-2.694
 distance-2.650
Tokenmy
Token ID1820
Feature activation-1.21e+00

Pos logit contributions
side+3.621
 sudden+3.367
Meanwhile+3.101
amed+3.074
 distance+2.936
Neg logit contributions
 give-3.495
 I-3.432
 leave-3.245
 ask-3.166
 share-3.150
Token found
Token ID1043
Feature activation-1.91e+00

Pos logit contributions
side+2.482
 sudden+2.223
amed+2.156
Meanwhile+2.127
 distance+2.033
Neg logit contributions
 give-2.410
 I-2.353
 thank-2.221
 leave-2.183
 share-2.134
Token it
Token ID340
Feature activation-1.76e+00

Pos logit contributions
side+4.943
 sudden+4.595
Meanwhile+4.288
amed+4.214
 distance+4.108
Neg logit contributions
 I-3.115
 give-3.092
 thank-2.926
 leave-2.674
 share-2.545
Token fascinating
Token ID13899
Feature activation-7.69e-01

Pos logit contributions
side+4.249
 sudden+4.209
amed+3.907
Meanwhile+3.840
J+3.695
Neg logit contributions
 I-2.978
 give-2.840
 leave-2.554
 thank-2.489
 share-2.414
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
side+1.786
 sudden+1.720
amed+1.675
Meanwhile+1.642
J+1.535
Neg logit contributions
 I-1.263
 give-1.236
 leave-1.124
 thank-1.116
 Sorry-1.072
Token Tim
Token ID5045
Feature activation-1.98e+00

Pos logit contributions
side+0.262
 sudden+0.250
amed+0.241
Meanwhile+0.229
aming+0.223
Neg logit contributions
 I-0.193
 give-0.183
 Sorry-0.174
 thank-0.169
 leave-0.165
Tokenmy
Token ID1820
Feature activation-1.30e+00

Pos logit contributions
side+4.341
 sudden+3.967
amed+3.602
Meanwhile+3.596
 distance+3.460
Neg logit contributions
 give-3.882
 I-3.773
 leave-3.637
 ask-3.597
 share-3.522
Token wanted
Token ID2227
Feature activation-5.35e-01

Pos logit contributions
side+2.628
 sudden+2.301
amed+2.217
 distance+2.159
Meanwhile+2.141
Neg logit contributions
 give-2.682
 I-2.621
 thank-2.462
 leave-2.422
 share-2.399
Token to
Token ID284
Feature activation+2.32e+00

Pos logit contributions
side+1.338
 sudden+1.268
amed+1.250
Meanwhile+1.196
aming+1.157
Neg logit contributions
 I-0.805
 give-0.804
 thank-0.721
 leave-0.709
 ask-0.684
Token ride
Token ID6594
Feature activation-1.09e+00

Pos logit contributions
 I+2.860
 Sorry+2.826
 give+2.541
 sorry+2.396
 thank+2.338
Neg logit contributions
side-5.841
 sudden-5.717
 distance-5.451
amed-5.377
 nearby-5.279
Token the
Token ID262
Feature activation-2.17e+00

Pos logit contributions
side+2.482
 sudden+2.415
Meanwhile+2.274
amed+2.259
aming+2.101
Neg logit contributions
 I-1.899
 give-1.858
 thank-1.719
 leave-1.687
 share-1.675
Token man
Token ID582
Feature activation-6.22e-01

Pos logit contributions
side+0.529
 sudden+0.483
amed+0.474
Meanwhile+0.450
aming+0.428
Neg logit contributions
 give-0.499
 I-0.493
 thank-0.464
 leave-0.458
 Sorry-0.450
Token's
Token ID338
Feature activation-4.07e-01

Pos logit contributions
side+1.411
 sudden+1.362
amed+1.287
Meanwhile+1.270
 distance+1.209
Neg logit contributions
 I-1.059
 give-1.049
 leave-0.958
 thank-0.950
 Sorry-0.931
Token work
Token ID670
Feature activation-7.52e-01

Pos logit contributions
side+1.119
 sudden+1.055
amed+1.033
Meanwhile+1.017
 distance+0.974
Neg logit contributions
 give-0.493
 I-0.473
 thank-0.444
 leave-0.430
 Sorry-0.413
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
side+1.682
 sudden+1.675
Meanwhile+1.551
amed+1.549
 distance+1.469
Neg logit contributions
 I-1.258
 give-1.242
 thank-1.139
 leave-1.123
 Sorry-1.106
Token
Token ID198
Feature activation-2.66e+00

Pos logit contributions
 I+0.443
 give+0.427
 Sorry+0.410
 thank+0.399
 leave+0.392
Neg logit contributions
side-0.475
 sudden-0.453
amed-0.430
Meanwhile-0.411
 distance-0.399
Token
Token ID198
Feature activation-2.51e+00

Pos logit contributions
side+5.102
 sudden+4.799
amed+4.474
eled+4.278
 distance+4.258
Neg logit contributions
 I-5.828
 give-5.667
 thank-5.630
 share-5.317
 ask-5.272
TokenThe
Token ID464
Feature activation-5.02e-01

Pos logit contributions
side+3.449
 sudden+3.289
amed+3.061
eled+3.010
 nowhere+2.867
Neg logit contributions
 I-6.571
 give-6.342
 share-6.174
 thank-6.021
 Sorry-5.992
Token man
Token ID582
Feature activation-9.34e-01

Pos logit contributions
side+0.893
 sudden+0.799
amed+0.789
Meanwhile+0.742
 distance+0.683
Neg logit contributions
 give-1.125
 I-1.095
 thank-1.073
 leave-1.036
 Sorry-1.023
Token was
Token ID373
Feature activation-1.22e-01

Pos logit contributions
side+1.879
 sudden+1.712
amed+1.588
Meanwhile+1.572
 distance+1.570
Neg logit contributions
 give-1.886
 I-1.858
 thank-1.743
 leave-1.728
 share-1.691
Token very
Token ID845
Feature activation+1.48e+00

Pos logit contributions
side+0.248
 sudden+0.236
amed+0.215
Meanwhile+0.214
 distance+0.204
Neg logit contributions
 I-0.231
 give-0.231
 thank-0.215
 leave-0.213
 Sorry-0.210
Token pleased
Token ID10607
Feature activation+7.19e-01

Pos logit contributions
 I+3.136
 give+3.058
 Sorry+2.961
 leave+2.906
 thank+2.832
Neg logit contributions
side-2.581
 sudden-2.477
amed-2.358
Meanwhile-2.221
 distance-2.152
Token,
Token ID11
Feature activation+4.19e-01

Pos logit contributions
side+0.450
 sudden+0.443
amed+0.427
Meanwhile+0.415
 distance+0.404
Neg logit contributions
 I-0.159
 give-0.149
 Sorry-0.130
 leave-0.129
 thank-0.128
Token they
Token ID484
Feature activation-8.71e-01

Pos logit contributions
 I+0.654
 give+0.637
 Sorry+0.601
 leave+0.580
 thank+0.580
Neg logit contributions
side-0.976
 sudden-0.928
amed-0.901
Meanwhile-0.870
 distance-0.842
Token spread
Token ID4104
Feature activation-2.10e+00

Pos logit contributions
side+1.559
 sudden+1.457
amed+1.383
Meanwhile+1.336
aming+1.233
Neg logit contributions
 I-1.872
 give-1.871
 leave-1.734
 thank-1.729
 share-1.708
Token the
Token ID262
Feature activation-1.55e+00

Pos logit contributions
side+5.001
 sudden+4.809
Meanwhile+4.651
amed+4.421
Days+4.346
Neg logit contributions
 give-3.448
 I-3.255
 thank-3.178
 leave-3.171
 share-3.006
Token blanket
Token ID18447
Feature activation-1.25e+00

Pos logit contributions
side+3.359
 sudden+3.086
Meanwhile+3.060
amed+3.052
J+3.002
Neg logit contributions
 I-2.822
 give-2.745
 thank-2.558
 leave-2.558
 Sorry-2.492
Token on
Token ID319
Feature activation-1.95e+00

Pos logit contributions
 sudden+2.590
side+2.541
Meanwhile+2.348
amed+2.314
J+2.305
Neg logit contributions
 give-2.294
 I-2.255
 leave-2.154
 thank-2.083
 share-2.018
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
amed+5.034
side+4.841
 sudden+4.811
Meanwhile+4.683
aming+4.480
Neg logit contributions
 give-2.836
 I-2.811
 thank-2.505
 leave-2.483
 share-2.430
Token grass
Token ID8701
Feature activation-1.96e+00

Pos logit contributions
side+3.114
 sudden+2.947
amed+2.890
J+2.818
Meanwhile+2.800
Neg logit contributions
 I-4.408
 give-4.168
 leave-4.040
 thank-3.988
 share-3.854
Token and
Token ID290
Feature activation+2.16e-02

Pos logit contributions
 sudden+4.876
side+4.398
amed+4.346
Meanwhile+4.300
J+4.177
Neg logit contributions
 I-3.101
 give-2.966
 leave-2.742
 thank-2.604
 share-2.548
Token began
Token ID2540
Feature activation+1.70e-02

Pos logit contributions
 I+0.041
 give+0.040
 Sorry+0.037
 leave+0.037
 thank+0.037
Neg logit contributions
side-0.043
 sudden-0.041
amed-0.039
Meanwhile-0.038
J-0.036
Token to
Token ID284
Feature activation+1.32e+00

Pos logit contributions
 I+0.026
 give+0.026
 Sorry+0.024
 leave+0.024
 thank+0.024
Neg logit contributions
side-0.040
 sudden-0.038
amed-0.037
Meanwhile-0.035
 distance-0.034