EC2 Default User
commited on
Commit
•
faa7399
1
Parent(s):
0c4a7b8
Update spaCy pipeline
Browse files- LICENSES_SOURCES +0 -551
- README.md +0 -0
- accuracy.json +108 -264
- attribute_ruler/patterns +0 -0
- config.cfg +39 -17
- lemmatizer/cfg +730 -0
- lemmatizer/{lookups/lookups.bin → model} +2 -2
- lemmatizer/trees +0 -0
- lt_core_news_md-any-py3-none-any.whl +2 -2
- meta.json +116 -283
- morphologizer/model +2 -2
- ner/model +2 -2
- parser/model +1 -1
- parser/moves +1 -1
- senter/model +2 -2
- tagger/cfg +1 -0
- tagger/model +2 -2
- tok2vec/model +2 -2
- tokenizer +2 -2
- vocab/strings.json +2 -2
LICENSES_SOURCES
CHANGED
@@ -450,557 +450,6 @@ Creative Commons may be contacted at creativecommons.org.
|
|
450 |
|
451 |
|
452 |
|
453 |
-
# Lemmatization Lists
|
454 |
-
|
455 |
-
* Author: Michal Měchura
|
456 |
-
* URL: https://github.com/michmech/lemmatization-lists/
|
457 |
-
* License: ODbL
|
458 |
-
|
459 |
-
```
|
460 |
-
## ODC Open Database License (ODbL)
|
461 |
-
|
462 |
-
### Preamble
|
463 |
-
|
464 |
-
The Open Database License (ODbL) is a license agreement intended to
|
465 |
-
allow users to freely share, modify, and use this Database while
|
466 |
-
maintaining this same freedom for others. Many databases are covered by
|
467 |
-
copyright, and therefore this document licenses these rights. Some
|
468 |
-
jurisdictions, mainly in the European Union, have specific rights that
|
469 |
-
cover databases, and so the ODbL addresses these rights, too. Finally,
|
470 |
-
the ODbL is also an agreement in contract for users of this Database to
|
471 |
-
act in certain ways in return for accessing this Database.
|
472 |
-
|
473 |
-
Databases can contain a wide variety of types of content (images,
|
474 |
-
audiovisual material, and sounds all in the same database, for example),
|
475 |
-
and so the ODbL only governs the rights over the Database, and not the
|
476 |
-
contents of the Database individually. Licensors should use the ODbL
|
477 |
-
together with another license for the contents, if the contents have a
|
478 |
-
single set of rights that uniformly covers all of the contents. If the
|
479 |
-
contents have multiple sets of different rights, Licensors should
|
480 |
-
describe what rights govern what contents together in the individual
|
481 |
-
record or in some other way that clarifies what rights apply.
|
482 |
-
|
483 |
-
Sometimes the contents of a database, or the database itself, can be
|
484 |
-
covered by other rights not addressed here (such as private contracts,
|
485 |
-
trade mark over the name, or privacy rights / data protection rights
|
486 |
-
over information in the contents), and so you are advised that you may
|
487 |
-
have to consult other documents or clear other rights before doing
|
488 |
-
activities not covered by this License.
|
489 |
-
|
490 |
-
------
|
491 |
-
|
492 |
-
The Licensor (as defined below)
|
493 |
-
|
494 |
-
and
|
495 |
-
|
496 |
-
You (as defined below)
|
497 |
-
|
498 |
-
agree as follows:
|
499 |
-
|
500 |
-
### 1.0 Definitions of Capitalised Words
|
501 |
-
|
502 |
-
"Collective Database" – Means this Database in unmodified form as part
|
503 |
-
of a collection of independent databases in themselves that together are
|
504 |
-
assembled into a collective whole. A work that constitutes a Collective
|
505 |
-
Database will not be considered a Derivative Database.
|
506 |
-
|
507 |
-
"Convey" – As a verb, means Using the Database, a Derivative Database,
|
508 |
-
or the Database as part of a Collective Database in any way that enables
|
509 |
-
a Person to make or receive copies of the Database or a Derivative
|
510 |
-
Database. Conveying does not include interaction with a user through a
|
511 |
-
computer network, or creating and Using a Produced Work, where no
|
512 |
-
transfer of a copy of the Database or a Derivative Database occurs.
|
513 |
-
"Contents" – The contents of this Database, which includes the
|
514 |
-
information, independent works, or other material collected into the
|
515 |
-
Database. For example, the contents of the Database could be factual
|
516 |
-
data or works such as images, audiovisual material, text, or sounds.
|
517 |
-
|
518 |
-
"Database" – A collection of material (the Contents) arranged in a
|
519 |
-
systematic or methodical way and individually accessible by electronic
|
520 |
-
or other means offered under the terms of this License.
|
521 |
-
|
522 |
-
"Database Directive" – Means Directive 96/9/EC of the European
|
523 |
-
Parliament and of the Council of 11 March 1996 on the legal protection
|
524 |
-
of databases, as amended or succeeded.
|
525 |
-
|
526 |
-
"Database Right" – Means rights resulting from the Chapter III ("sui
|
527 |
-
generis") rights in the Database Directive (as amended and as transposed
|
528 |
-
by member states), which includes the Extraction and Re-utilisation of
|
529 |
-
the whole or a Substantial part of the Contents, as well as any similar
|
530 |
-
rights available in the relevant jurisdiction under Section 10.4.
|
531 |
-
|
532 |
-
"Derivative Database" – Means a database based upon the Database, and
|
533 |
-
includes any translation, adaptation, arrangement, modification, or any
|
534 |
-
other alteration of the Database or of a Substantial part of the
|
535 |
-
Contents. This includes, but is not limited to, Extracting or
|
536 |
-
Re-utilising the whole or a Substantial part of the Contents in a new
|
537 |
-
Database.
|
538 |
-
|
539 |
-
"Extraction" – Means the permanent or temporary transfer of all or a
|
540 |
-
Substantial part of the Contents to another medium by any means or in
|
541 |
-
any form.
|
542 |
-
|
543 |
-
"License" – Means this license agreement and is both a license of rights
|
544 |
-
such as copyright and Database Rights and an agreement in contract.
|
545 |
-
|
546 |
-
"Licensor" – Means the Person that offers the Database under the terms
|
547 |
-
of this License.
|
548 |
-
|
549 |
-
"Person" – Means a natural or legal person or a body of persons
|
550 |
-
corporate or incorporate.
|
551 |
-
|
552 |
-
"Produced Work" – a work (such as an image, audiovisual material, text,
|
553 |
-
or sounds) resulting from using the whole or a Substantial part of the
|
554 |
-
Contents (via a search or other query) from this Database, a Derivative
|
555 |
-
Database, or this Database as part of a Collective Database.
|
556 |
-
|
557 |
-
"Publicly" – means to Persons other than You or under Your control by
|
558 |
-
either more than 50% ownership or by the power to direct their
|
559 |
-
activities (such as contracting with an independent consultant).
|
560 |
-
|
561 |
-
"Re-utilisation" – means any form of making available to the public all
|
562 |
-
or a Substantial part of the Contents by the distribution of copies, by
|
563 |
-
renting, by online or other forms of transmission.
|
564 |
-
|
565 |
-
"Substantial" – Means substantial in terms of quantity or quality or a
|
566 |
-
combination of both. The repeated and systematic Extraction or
|
567 |
-
Re-utilisation of insubstantial parts of the Contents may amount to the
|
568 |
-
Extraction or Re-utilisation of a Substantial part of the Contents.
|
569 |
-
|
570 |
-
"Use" – As a verb, means doing any act that is restricted by copyright
|
571 |
-
or Database Rights whether in the original medium or any other; and
|
572 |
-
includes without limitation distributing, copying, publicly performing,
|
573 |
-
publicly displaying, and preparing derivative works of the Database, as
|
574 |
-
well as modifying the Database as may be technically necessary to use it
|
575 |
-
in a different mode or format.
|
576 |
-
|
577 |
-
"You" – Means a Person exercising rights under this License who has not
|
578 |
-
previously violated the terms of this License with respect to the
|
579 |
-
Database, or who has received express permission from the Licensor to
|
580 |
-
exercise rights under this License despite a previous violation.
|
581 |
-
|
582 |
-
Words in the singular include the plural and vice versa.
|
583 |
-
|
584 |
-
### 2.0 What this License covers
|
585 |
-
|
586 |
-
2.1. Legal effect of this document. This License is:
|
587 |
-
|
588 |
-
a. A license of applicable copyright and neighbouring rights;
|
589 |
-
|
590 |
-
b. A license of the Database Right; and
|
591 |
-
|
592 |
-
c. An agreement in contract between You and the Licensor.
|
593 |
-
|
594 |
-
2.2 Legal rights covered. This License covers the legal rights in the
|
595 |
-
Database, including:
|
596 |
-
|
597 |
-
a. Copyright. Any copyright or neighbouring rights in the Database.
|
598 |
-
The copyright licensed includes any individual elements of the
|
599 |
-
Database, but does not cover the copyright over the Contents
|
600 |
-
independent of this Database. See Section 2.4 for details. Copyright
|
601 |
-
law varies between jurisdictions, but is likely to cover: the Database
|
602 |
-
model or schema, which is the structure, arrangement, and organisation
|
603 |
-
of the Database, and can also include the Database tables and table
|
604 |
-
indexes; the data entry and output sheets; and the Field names of
|
605 |
-
Contents stored in the Database;
|
606 |
-
|
607 |
-
b. Database Rights. Database Rights only extend to the Extraction and
|
608 |
-
Re-utilisation of the whole or a Substantial part of the Contents.
|
609 |
-
Database Rights can apply even when there is no copyright over the
|
610 |
-
Database. Database Rights can also apply when the Contents are removed
|
611 |
-
from the Database and are selected and arranged in a way that would
|
612 |
-
not infringe any applicable copyright; and
|
613 |
-
|
614 |
-
c. Contract. This is an agreement between You and the Licensor for
|
615 |
-
access to the Database. In return you agree to certain conditions of
|
616 |
-
use on this access as outlined in this License.
|
617 |
-
|
618 |
-
2.3 Rights not covered.
|
619 |
-
|
620 |
-
a. This License does not apply to computer programs used in the making
|
621 |
-
or operation of the Database;
|
622 |
-
|
623 |
-
b. This License does not cover any patents over the Contents or the
|
624 |
-
Database; and
|
625 |
-
|
626 |
-
c. This License does not cover any trademarks associated with the
|
627 |
-
Database.
|
628 |
-
|
629 |
-
2.4 Relationship to Contents in the Database. The individual items of
|
630 |
-
the Contents contained in this Database may be covered by other rights,
|
631 |
-
including copyright, patent, data protection, privacy, or personality
|
632 |
-
rights, and this License does not cover any rights (other than Database
|
633 |
-
Rights or in contract) in individual Contents contained in the Database.
|
634 |
-
For example, if used on a Database of images (the Contents), this
|
635 |
-
License would not apply to copyright over individual images, which could
|
636 |
-
have their own separate licenses, or one single license covering all of
|
637 |
-
the rights over the images.
|
638 |
-
|
639 |
-
### 3.0 Rights granted
|
640 |
-
|
641 |
-
3.1 Subject to the terms and conditions of this License, the Licensor
|
642 |
-
grants to You a worldwide, royalty-free, non-exclusive, terminable (but
|
643 |
-
only under Section 9) license to Use the Database for the duration of
|
644 |
-
any applicable copyright and Database Rights. These rights explicitly
|
645 |
-
include commercial use, and do not exclude any field of endeavour. To
|
646 |
-
the extent possible in the relevant jurisdiction, these rights may be
|
647 |
-
exercised in all media and formats whether now known or created in the
|
648 |
-
future.
|
649 |
-
|
650 |
-
The rights granted cover, for example:
|
651 |
-
|
652 |
-
a. Extraction and Re-utilisation of the whole or a Substantial part of
|
653 |
-
the Contents;
|
654 |
-
|
655 |
-
b. Creation of Derivative Databases;
|
656 |
-
|
657 |
-
c. Creation of Collective Databases;
|
658 |
-
|
659 |
-
d. Creation of temporary or permanent reproductions by any means and
|
660 |
-
in any form, in whole or in part, including of any Derivative
|
661 |
-
Databases or as a part of Collective Databases; and
|
662 |
-
|
663 |
-
e. Distribution, communication, display, lending, making available, or
|
664 |
-
performance to the public by any means and in any form, in whole or in
|
665 |
-
part, including of any Derivative Database or as a part of Collective
|
666 |
-
Databases.
|
667 |
-
|
668 |
-
3.2 Compulsory license schemes. For the avoidance of doubt:
|
669 |
-
|
670 |
-
a. Non-waivable compulsory license schemes. In those jurisdictions in
|
671 |
-
which the right to collect royalties through any statutory or
|
672 |
-
compulsory licensing scheme cannot be waived, the Licensor reserves
|
673 |
-
the exclusive right to collect such royalties for any exercise by You
|
674 |
-
of the rights granted under this License;
|
675 |
-
|
676 |
-
b. Waivable compulsory license schemes. In those jurisdictions in
|
677 |
-
which the right to collect royalties through any statutory or
|
678 |
-
compulsory licensing scheme can be waived, the Licensor waives the
|
679 |
-
exclusive right to collect such royalties for any exercise by You of
|
680 |
-
the rights granted under this License; and,
|
681 |
-
|
682 |
-
c. Voluntary license schemes. The Licensor waives the right to collect
|
683 |
-
royalties, whether individually or, in the event that the Licensor is
|
684 |
-
a member of a collecting society that administers voluntary licensing
|
685 |
-
schemes, via that society, from any exercise by You of the rights
|
686 |
-
granted under this License.
|
687 |
-
|
688 |
-
3.3 The right to release the Database under different terms, or to stop
|
689 |
-
distributing or making available the Database, is reserved. Note that
|
690 |
-
this Database may be multiple-licensed, and so You may have the choice
|
691 |
-
of using alternative licenses for this Database. Subject to Section
|
692 |
-
10.4, all other rights not expressly granted by Licensor are reserved.
|
693 |
-
|
694 |
-
### 4.0 Conditions of Use
|
695 |
-
|
696 |
-
4.1 The rights granted in Section 3 above are expressly made subject to
|
697 |
-
Your complying with the following conditions of use. These are important
|
698 |
-
conditions of this License, and if You fail to follow them, You will be
|
699 |
-
in material breach of its terms.
|
700 |
-
|
701 |
-
4.2 Notices. If You Publicly Convey this Database, any Derivative
|
702 |
-
Database, or the Database as part of a Collective Database, then You
|
703 |
-
must:
|
704 |
-
|
705 |
-
a. Do so only under the terms of this License or another license
|
706 |
-
permitted under Section 4.4;
|
707 |
-
|
708 |
-
b. Include a copy of this License (or, as applicable, a license
|
709 |
-
permitted under Section 4.4) or its Uniform Resource Identifier (URI)
|
710 |
-
with the Database or Derivative Database, including both in the
|
711 |
-
Database or Derivative Database and in any relevant documentation; and
|
712 |
-
|
713 |
-
c. Keep intact any copyright or Database Right notices and notices
|
714 |
-
that refer to this License.
|
715 |
-
|
716 |
-
d. If it is not possible to put the required notices in a particular
|
717 |
-
file due to its structure, then You must include the notices in a
|
718 |
-
location (such as a relevant directory) where users would be likely to
|
719 |
-
look for it.
|
720 |
-
|
721 |
-
4.3 Notice for using output (Contents). Creating and Using a Produced
|
722 |
-
Work does not require the notice in Section 4.2. However, if you
|
723 |
-
Publicly Use a Produced Work, You must include a notice associated with
|
724 |
-
the Produced Work reasonably calculated to make any Person that uses,
|
725 |
-
views, accesses, interacts with, or is otherwise exposed to the Produced
|
726 |
-
Work aware that Content was obtained from the Database, Derivative
|
727 |
-
Database, or the Database as part of a Collective Database, and that it
|
728 |
-
is available under this License.
|
729 |
-
|
730 |
-
a. Example notice. The following text will satisfy notice under
|
731 |
-
Section 4.3:
|
732 |
-
|
733 |
-
Contains information from DATABASE NAME, which is made available
|
734 |
-
here under the Open Database License (ODbL).
|
735 |
-
|
736 |
-
DATABASE NAME should be replaced with the name of the Database and a
|
737 |
-
hyperlink to the URI of the Database. "Open Database License" should
|
738 |
-
contain a hyperlink to the URI of the text of this License. If
|
739 |
-
hyperlinks are not possible, You should include the plain text of the
|
740 |
-
required URI's with the above notice.
|
741 |
-
|
742 |
-
4.4 Share alike.
|
743 |
-
|
744 |
-
a. Any Derivative Database that You Publicly Use must be only under
|
745 |
-
the terms of:
|
746 |
-
|
747 |
-
i. This License;
|
748 |
-
|
749 |
-
ii. A later version of this License similar in spirit to this
|
750 |
-
License; or
|
751 |
-
|
752 |
-
iii. A compatible license.
|
753 |
-
|
754 |
-
If You license the Derivative Database under one of the licenses
|
755 |
-
mentioned in (iii), You must comply with the terms of that license.
|
756 |
-
|
757 |
-
b. For the avoidance of doubt, Extraction or Re-utilisation of the
|
758 |
-
whole or a Substantial part of the Contents into a new database is a
|
759 |
-
Derivative Database and must comply with Section 4.4.
|
760 |
-
|
761 |
-
c. Derivative Databases and Produced Works. A Derivative Database is
|
762 |
-
Publicly Used and so must comply with Section 4.4. if a Produced Work
|
763 |
-
created from the Derivative Database is Publicly Used.
|
764 |
-
|
765 |
-
d. Share Alike and additional Contents. For the avoidance of doubt,
|
766 |
-
You must not add Contents to Derivative Databases under Section 4.4 a
|
767 |
-
that are incompatible with the rights granted under this License.
|
768 |
-
|
769 |
-
e. Compatible licenses. Licensors may authorise a proxy to determine
|
770 |
-
compatible licenses under Section 4.4 a iii. If they do so, the
|
771 |
-
authorised proxy's public statement of acceptance of a compatible
|
772 |
-
license grants You permission to use the compatible license.
|
773 |
-
|
774 |
-
|
775 |
-
4.5 Limits of Share Alike. The requirements of Section 4.4 do not apply
|
776 |
-
in the following:
|
777 |
-
|
778 |
-
a. For the avoidance of doubt, You are not required to license
|
779 |
-
Collective Databases under this License if You incorporate this
|
780 |
-
Database or a Derivative Database in the collection, but this License
|
781 |
-
still applies to this Database or a Derivative Database as a part of
|
782 |
-
the Collective Database;
|
783 |
-
|
784 |
-
b. Using this Database, a Derivative Database, or this Database as
|
785 |
-
part of a Collective Database to create a Produced Work does not
|
786 |
-
create a Derivative Database for purposes of Section 4.4; and
|
787 |
-
|
788 |
-
c. Use of a Derivative Database internally within an organisation is
|
789 |
-
not to the public and therefore does not fall under the requirements
|
790 |
-
of Section 4.4.
|
791 |
-
|
792 |
-
4.6 Access to Derivative Databases. If You Publicly Use a Derivative
|
793 |
-
Database or a Produced Work from a Derivative Database, You must also
|
794 |
-
offer to recipients of the Derivative Database or Produced Work a copy
|
795 |
-
in a machine readable form of:
|
796 |
-
|
797 |
-
a. The entire Derivative Database; or
|
798 |
-
|
799 |
-
b. A file containing all of the alterations made to the Database or
|
800 |
-
the method of making the alterations to the Database (such as an
|
801 |
-
algorithm), including any additional Contents, that make up all the
|
802 |
-
differences between the Database and the Derivative Database.
|
803 |
-
|
804 |
-
The Derivative Database (under a.) or alteration file (under b.) must be
|
805 |
-
available at no more than a reasonable production cost for physical
|
806 |
-
distributions and free of charge if distributed over the internet.
|
807 |
-
|
808 |
-
4.7 Technological measures and additional terms
|
809 |
-
|
810 |
-
a. This License does not allow You to impose (except subject to
|
811 |
-
Section 4.7 b.) any terms or any technological measures on the
|
812 |
-
Database, a Derivative Database, or the whole or a Substantial part of
|
813 |
-
the Contents that alter or restrict the terms of this License, or any
|
814 |
-
rights granted under it, or have the effect or intent of restricting
|
815 |
-
the ability of any person to exercise those rights.
|
816 |
-
|
817 |
-
b. Parallel distribution. You may impose terms or technological
|
818 |
-
measures on the Database, a Derivative Database, or the whole or a
|
819 |
-
Substantial part of the Contents (a "Restricted Database") in
|
820 |
-
contravention of Section 4.74 a. only if You also make a copy of the
|
821 |
-
Database or a Derivative Database available to the recipient of the
|
822 |
-
Restricted Database:
|
823 |
-
|
824 |
-
i. That is available without additional fee;
|
825 |
-
|
826 |
-
ii. That is available in a medium that does not alter or restrict
|
827 |
-
the terms of this License, or any rights granted under it, or have
|
828 |
-
the effect or intent of restricting the ability of any person to
|
829 |
-
exercise those rights (an "Unrestricted Database"); and
|
830 |
-
|
831 |
-
iii. The Unrestricted Database is at least as accessible to the
|
832 |
-
recipient as a practical matter as the Restricted Database.
|
833 |
-
|
834 |
-
c. For the avoidance of doubt, You may place this Database or a
|
835 |
-
Derivative Database in an authenticated environment, behind a
|
836 |
-
password, or within a similar access control scheme provided that You
|
837 |
-
do not alter or restrict the terms of this License or any rights
|
838 |
-
granted under it or have the effect or intent of restricting the
|
839 |
-
ability of any person to exercise those rights.
|
840 |
-
|
841 |
-
4.8 Licensing of others. You may not sublicense the Database. Each time
|
842 |
-
You communicate the Database, the whole or Substantial part of the
|
843 |
-
Contents, or any Derivative Database to anyone else in any way, the
|
844 |
-
Licensor offers to the recipient a license to the Database on the same
|
845 |
-
terms and conditions as this License. You are not responsible for
|
846 |
-
enforcing compliance by third parties with this License, but You may
|
847 |
-
enforce any rights that You have over a Derivative Database. You are
|
848 |
-
solely responsible for any modifications of a Derivative Database made
|
849 |
-
by You or another Person at Your direction. You may not impose any
|
850 |
-
further restrictions on the exercise of the rights granted or affirmed
|
851 |
-
under this License.
|
852 |
-
|
853 |
-
### 5.0 Moral rights
|
854 |
-
|
855 |
-
5.1 Moral rights. This section covers moral rights, including any rights
|
856 |
-
to be identified as the author of the Database or to object to treatment
|
857 |
-
that would otherwise prejudice the author's honour and reputation, or
|
858 |
-
any other derogatory treatment:
|
859 |
-
|
860 |
-
a. For jurisdictions allowing waiver of moral rights, Licensor waives
|
861 |
-
all moral rights that Licensor may have in the Database to the fullest
|
862 |
-
extent possible by the law of the relevant jurisdiction under Section
|
863 |
-
10.4;
|
864 |
-
|
865 |
-
b. If waiver of moral rights under Section 5.1 a in the relevant
|
866 |
-
jurisdiction is not possible, Licensor agrees not to assert any moral
|
867 |
-
rights over the Database and waives all claims in moral rights to the
|
868 |
-
fullest extent possible by the law of the relevant jurisdiction under
|
869 |
-
Section 10.4; and
|
870 |
-
|
871 |
-
c. For jurisdictions not allowing waiver or an agreement not to assert
|
872 |
-
moral rights under Section 5.1 a and b, the author may retain their
|
873 |
-
moral rights over certain aspects of the Database.
|
874 |
-
|
875 |
-
Please note that some jurisdictions do not allow for the waiver of moral
|
876 |
-
rights, and so moral rights may still subsist over the Database in some
|
877 |
-
jurisdictions.
|
878 |
-
|
879 |
-
### 6.0 Fair dealing, Database exceptions, and other rights not affected
|
880 |
-
|
881 |
-
6.1 This License does not affect any rights that You or anyone else may
|
882 |
-
independently have under any applicable law to make any use of this
|
883 |
-
Database, including without limitation:
|
884 |
-
|
885 |
-
a. Exceptions to the Database Right including: Extraction of Contents
|
886 |
-
from non-electronic Databases for private purposes, Extraction for
|
887 |
-
purposes of illustration for teaching or scientific research, and
|
888 |
-
Extraction or Re-utilisation for public security or an administrative
|
889 |
-
or judicial procedure.
|
890 |
-
|
891 |
-
b. Fair dealing, fair use, or any other legally recognised limitation
|
892 |
-
or exception to infringement of copyright or other applicable laws.
|
893 |
-
|
894 |
-
6.2 This License does not affect any rights of lawful users to Extract
|
895 |
-
and Re-utilise insubstantial parts of the Contents, evaluated
|
896 |
-
quantitatively or qualitatively, for any purposes whatsoever, including
|
897 |
-
creating a Derivative Database (subject to other rights over the
|
898 |
-
Contents, see Section 2.4). The repeated and systematic Extraction or
|
899 |
-
Re-utilisation of insubstantial parts of the Contents may however amount
|
900 |
-
to the Extraction or Re-utilisation of a Substantial part of the
|
901 |
-
Contents.
|
902 |
-
|
903 |
-
### 7.0 Warranties and Disclaimer
|
904 |
-
|
905 |
-
7.1 The Database is licensed by the Licensor "as is" and without any
|
906 |
-
warranty of any kind, either express, implied, or arising by statute,
|
907 |
-
custom, course of dealing, or trade usage. Licensor specifically
|
908 |
-
disclaims any and all implied warranties or conditions of title,
|
909 |
-
non-infringement, accuracy or completeness, the presence or absence of
|
910 |
-
errors, fitness for a particular purpose, merchantability, or otherwise.
|
911 |
-
Some jurisdictions do not allow the exclusion of implied warranties, so
|
912 |
-
this exclusion may not apply to You.
|
913 |
-
|
914 |
-
### 8.0 Limitation of liability
|
915 |
-
|
916 |
-
8.1 Subject to any liability that may not be excluded or limited by law,
|
917 |
-
the Licensor is not liable for, and expressly excludes, all liability
|
918 |
-
for loss or damage however and whenever caused to anyone by any use
|
919 |
-
under this License, whether by You or by anyone else, and whether caused
|
920 |
-
by any fault on the part of the Licensor or not. This exclusion of
|
921 |
-
liability includes, but is not limited to, any special, incidental,
|
922 |
-
consequential, punitive, or exemplary damages such as loss of revenue,
|
923 |
-
data, anticipated profits, and lost business. This exclusion applies
|
924 |
-
even if the Licensor has been advised of the possibility of such
|
925 |
-
damages.
|
926 |
-
|
927 |
-
8.2 If liability may not be excluded by law, it is limited to actual and
|
928 |
-
direct financial loss to the extent it is caused by proved negligence on
|
929 |
-
the part of the Licensor.
|
930 |
-
|
931 |
-
### 9.0 Termination of Your rights under this License
|
932 |
-
|
933 |
-
9.1 Any breach by You of the terms and conditions of this License
|
934 |
-
automatically terminates this License with immediate effect and without
|
935 |
-
notice to You. For the avoidance of doubt, Persons who have received the
|
936 |
-
Database, the whole or a Substantial part of the Contents, Derivative
|
937 |
-
Databases, or the Database as part of a Collective Database from You
|
938 |
-
under this License will not have their licenses terminated provided
|
939 |
-
their use is in full compliance with this License or a license granted
|
940 |
-
under Section 4.8 of this License. Sections 1, 2, 7, 8, 9 and 10 will
|
941 |
-
survive any termination of this License.
|
942 |
-
|
943 |
-
9.2 If You are not in breach of the terms of this License, the Licensor
|
944 |
-
will not terminate Your rights under it.
|
945 |
-
|
946 |
-
9.3 Unless terminated under Section 9.1, this License is granted to You
|
947 |
-
for the duration of applicable rights in the Database.
|
948 |
-
|
949 |
-
9.4 Reinstatement of rights. If you cease any breach of the terms and
|
950 |
-
conditions of this License, then your full rights under this License
|
951 |
-
will be reinstated:
|
952 |
-
|
953 |
-
a. Provisionally and subject to permanent termination until the 60th
|
954 |
-
day after cessation of breach;
|
955 |
-
|
956 |
-
b. Permanently on the 60th day after cessation of breach unless
|
957 |
-
otherwise reasonably notified by the Licensor; or
|
958 |
-
|
959 |
-
c. Permanently if reasonably notified by the Licensor of the
|
960 |
-
violation, this is the first time You have received notice of
|
961 |
-
violation of this License from the Licensor, and You cure the
|
962 |
-
violation prior to 30 days after your receipt of the notice.
|
963 |
-
|
964 |
-
Persons subject to permanent termination of rights are not eligible to
|
965 |
-
be a recipient and receive a license under Section 4.8.
|
966 |
-
|
967 |
-
9.5 Notwithstanding the above, Licensor reserves the right to release
|
968 |
-
the Database under different license terms or to stop distributing or
|
969 |
-
making available the Database. Releasing the Database under different
|
970 |
-
license terms or stopping the distribution of the Database will not
|
971 |
-
withdraw this License (or any other license that has been, or is
|
972 |
-
required to be, granted under the terms of this License), and this
|
973 |
-
License will continue in full force and effect unless terminated as
|
974 |
-
stated above.
|
975 |
-
|
976 |
-
### 10.0 General
|
977 |
-
|
978 |
-
10.1 If any provision of this License is held to be invalid or
|
979 |
-
unenforceable, that must not affect the validity or enforceability of
|
980 |
-
the remainder of the terms and conditions of this License and each
|
981 |
-
remaining provision of this License shall be valid and enforced to the
|
982 |
-
fullest extent permitted by law.
|
983 |
-
|
984 |
-
10.2 This License is the entire agreement between the parties with
|
985 |
-
respect to the rights granted here over the Database. It replaces any
|
986 |
-
earlier understandings, agreements or representations with respect to
|
987 |
-
the Database.
|
988 |
-
|
989 |
-
10.3 If You are in breach of the terms of this License, You will not be
|
990 |
-
entitled to rely on the terms of this License or to complain of any
|
991 |
-
breach by the Licensor.
|
992 |
-
|
993 |
-
10.4 Choice of law. This License takes effect in and will be governed by
|
994 |
-
the laws of the relevant jurisdiction in which the License terms are
|
995 |
-
sought to be enforced. If the standard suite of rights granted under
|
996 |
-
applicable copyright law and Database Rights in the relevant
|
997 |
-
jurisdiction includes additional rights not granted under this License,
|
998 |
-
these additional rights are granted in this License in order to meet the
|
999 |
-
terms of this License.```
|
1000 |
-
|
1001 |
-
|
1002 |
-
|
1003 |
-
|
1004 |
# Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)
|
1005 |
|
1006 |
* Author: Explosion
|
|
|
450 |
|
451 |
|
452 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
453 |
# Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)
|
454 |
|
455 |
* Author: Explosion
|
README.md
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
accuracy.json
CHANGED
@@ -3,311 +3,155 @@
|
|
3 |
"token_p": 0.998357254,
|
4 |
"token_r": 0.9988754325,
|
5 |
"token_f": 0.9986162761,
|
6 |
-
"
|
7 |
-
"
|
8 |
-
"
|
9 |
-
"
|
10 |
-
"
|
11 |
-
"dep_las_per_type": {
|
12 |
-
"root": {
|
13 |
-
"p": 0.7385620915,
|
14 |
-
"r": 0.7325769854,
|
15 |
-
"f": 0.7355573637
|
16 |
-
},
|
17 |
-
"obl": {
|
18 |
-
"p": 0.5110132159,
|
19 |
-
"r": 0.5296803653,
|
20 |
-
"f": 0.5201793722
|
21 |
-
},
|
22 |
-
"nmod": {
|
23 |
-
"p": 0.7538644471,
|
24 |
-
"r": 0.7624774504,
|
25 |
-
"f": 0.7581464873
|
26 |
-
},
|
27 |
-
"amod": {
|
28 |
-
"p": 0.7503805175,
|
29 |
-
"r": 0.7458396369,
|
30 |
-
"f": 0.7481031866
|
31 |
-
},
|
32 |
-
"cc": {
|
33 |
-
"p": 0.7351129363,
|
34 |
-
"r": 0.7665952891,
|
35 |
-
"f": 0.750524109
|
36 |
-
},
|
37 |
-
"conj": {
|
38 |
-
"p": 0.4879032258,
|
39 |
-
"r": 0.5475113122,
|
40 |
-
"f": 0.5159914712
|
41 |
-
},
|
42 |
-
"obl:arg": {
|
43 |
-
"p": 0.5672131148,
|
44 |
-
"r": 0.5831460674,
|
45 |
-
"f": 0.5750692521
|
46 |
-
},
|
47 |
-
"acl": {
|
48 |
-
"p": 0.4695945946,
|
49 |
-
"r": 0.4527687296,
|
50 |
-
"f": 0.4610281924
|
51 |
-
},
|
52 |
-
"advmod": {
|
53 |
-
"p": 0.7456359102,
|
54 |
-
"r": 0.736453202,
|
55 |
-
"f": 0.741016109
|
56 |
-
},
|
57 |
-
"det": {
|
58 |
-
"p": 0.7043010753,
|
59 |
-
"r": 0.8238993711,
|
60 |
-
"f": 0.7594202899
|
61 |
-
},
|
62 |
-
"xcomp": {
|
63 |
-
"p": 0.7944664032,
|
64 |
-
"r": 0.858974359,
|
65 |
-
"f": 0.8254620123
|
66 |
-
},
|
67 |
-
"advcl": {
|
68 |
-
"p": 0.4106280193,
|
69 |
-
"r": 0.3373015873,
|
70 |
-
"f": 0.3703703704
|
71 |
-
},
|
72 |
-
"parataxis": {
|
73 |
-
"p": 0.4444444444,
|
74 |
-
"r": 0.3636363636,
|
75 |
-
"f": 0.4
|
76 |
-
},
|
77 |
-
"advmod:emph": {
|
78 |
-
"p": 0.6756756757,
|
79 |
-
"r": 0.5841121495,
|
80 |
-
"f": 0.626566416
|
81 |
-
},
|
82 |
-
"nsubj": {
|
83 |
-
"p": 0.7166392092,
|
84 |
-
"r": 0.7038834951,
|
85 |
-
"f": 0.7102040816
|
86 |
-
},
|
87 |
-
"acl:relcl": {
|
88 |
-
"p": 0.6865671642,
|
89 |
-
"r": 0.6388888889,
|
90 |
-
"f": 0.6618705036
|
91 |
-
},
|
92 |
-
"case": {
|
93 |
-
"p": 0.8396946565,
|
94 |
-
"r": 0.8291457286,
|
95 |
-
"f": 0.8343868521
|
96 |
-
},
|
97 |
-
"csubj": {
|
98 |
-
"p": 0.5454545455,
|
99 |
-
"r": 0.375,
|
100 |
-
"f": 0.4444444444
|
101 |
-
},
|
102 |
-
"mark": {
|
103 |
-
"p": 0.7863247863,
|
104 |
-
"r": 0.7796610169,
|
105 |
-
"f": 0.7829787234
|
106 |
-
},
|
107 |
-
"cop": {
|
108 |
-
"p": 0.7697841727,
|
109 |
-
"r": 0.8294573643,
|
110 |
-
"f": 0.7985074627
|
111 |
-
},
|
112 |
-
"obj": {
|
113 |
-
"p": 0.8015665796,
|
114 |
-
"r": 0.7561576355,
|
115 |
-
"f": 0.7782002535
|
116 |
-
},
|
117 |
-
"dep": {
|
118 |
-
"p": 0.0,
|
119 |
-
"r": 0.0,
|
120 |
-
"f": 0.0
|
121 |
-
},
|
122 |
-
"ccomp": {
|
123 |
-
"p": 0.6395348837,
|
124 |
-
"r": 0.625,
|
125 |
-
"f": 0.632183908
|
126 |
-
},
|
127 |
-
"appos": {
|
128 |
-
"p": 0.7333333333,
|
129 |
-
"r": 0.4230769231,
|
130 |
-
"f": 0.5365853659
|
131 |
-
},
|
132 |
-
"nummod": {
|
133 |
-
"p": 0.7099236641,
|
134 |
-
"r": 0.6739130435,
|
135 |
-
"f": 0.6914498141
|
136 |
-
},
|
137 |
-
"nummod:gov": {
|
138 |
-
"p": 0.0,
|
139 |
-
"r": 0.0,
|
140 |
-
"f": 0.0
|
141 |
-
},
|
142 |
-
"flat": {
|
143 |
-
"p": 0.3541666667,
|
144 |
-
"r": 0.1603773585,
|
145 |
-
"f": 0.2207792208
|
146 |
-
},
|
147 |
-
"nsubj:pass": {
|
148 |
-
"p": 0.5,
|
149 |
-
"r": 0.4470588235,
|
150 |
-
"f": 0.4720496894
|
151 |
-
},
|
152 |
-
"flat:foreign": {
|
153 |
-
"p": 0.0,
|
154 |
-
"r": 0.0,
|
155 |
-
"f": 0.0
|
156 |
-
},
|
157 |
-
"csubj:pass": {
|
158 |
-
"p": 0.0,
|
159 |
-
"r": 0.0,
|
160 |
-
"f": 0.0
|
161 |
-
},
|
162 |
-
"iobj": {
|
163 |
-
"p": 0.0,
|
164 |
-
"r": 0.0,
|
165 |
-
"f": 0.0
|
166 |
-
}
|
167 |
-
},
|
168 |
-
"ents_p": 0.750907441,
|
169 |
-
"ents_r": 0.827913957,
|
170 |
-
"ents_f": 0.7875327147,
|
171 |
-
"ents_per_type": {
|
172 |
-
"PERSON": {
|
173 |
-
"p": 0.0,
|
174 |
-
"r": 0.0,
|
175 |
-
"f": 0.0
|
176 |
-
},
|
177 |
-
"GPE": {
|
178 |
-
"p": 0.0,
|
179 |
-
"r": 0.0,
|
180 |
-
"f": 0.0
|
181 |
-
},
|
182 |
-
"PRODUCT": {
|
183 |
-
"p": 0.0,
|
184 |
-
"r": 0.0,
|
185 |
-
"f": 0.0
|
186 |
-
},
|
187 |
-
"ORG": {
|
188 |
-
"p": 0.0,
|
189 |
-
"r": 0.0,
|
190 |
-
"f": 0.0
|
191 |
-
},
|
192 |
-
"LOC": {
|
193 |
-
"p": 0.0,
|
194 |
-
"r": 0.0,
|
195 |
-
"f": 0.0
|
196 |
-
},
|
197 |
-
"TIME": {
|
198 |
-
"p": 0.0,
|
199 |
-
"r": 0.0,
|
200 |
-
"f": 0.0
|
201 |
-
}
|
202 |
-
},
|
203 |
-
"speed": 6365.8683040846,
|
204 |
-
"pos_acc": 0.9496907038,
|
205 |
-
"morph_acc": 0.8704416663,
|
206 |
-
"morph_micro_p": 0.9129765114,
|
207 |
-
"morph_micro_r": 0.9070390207,
|
208 |
-
"morph_micro_f": 0.909998081,
|
209 |
"morph_per_feat": {
|
210 |
"Case": {
|
211 |
-
"p": 0.
|
212 |
"r": 0.924287119,
|
213 |
-
"f": 0.
|
214 |
},
|
215 |
"Gender": {
|
216 |
-
"p": 0.
|
217 |
-
"r": 0.
|
218 |
-
"f": 0.
|
219 |
},
|
220 |
"Number": {
|
221 |
-
"p": 0.
|
222 |
-
"r": 0.
|
223 |
-
"f": 0.
|
224 |
},
|
225 |
"Definite": {
|
226 |
-
"p": 0.
|
227 |
-
"r": 0.
|
228 |
-
"f": 0.
|
229 |
},
|
230 |
"Degree": {
|
231 |
-
"p": 0.
|
232 |
-
"r": 0.
|
233 |
-
"f": 0.
|
234 |
},
|
235 |
"Polarity": {
|
236 |
-
"p": 0.
|
237 |
-
"r": 0.
|
238 |
-
"f": 0.
|
239 |
},
|
240 |
"Tense": {
|
241 |
-
"p": 0.
|
242 |
-
"r": 0.
|
243 |
-
"f": 0.
|
244 |
},
|
245 |
"VerbForm": {
|
246 |
-
"p": 0.
|
247 |
-
"r": 0.
|
248 |
-
"f": 0.
|
249 |
},
|
250 |
"Voice": {
|
251 |
-
"p": 0.
|
252 |
-
"r": 0.
|
253 |
-
"f": 0.
|
254 |
},
|
255 |
"PronType": {
|
256 |
-
"p": 0.
|
257 |
-
"r": 0.
|
258 |
-
"f": 0.
|
259 |
},
|
260 |
"Aspect": {
|
261 |
-
"p": 0.
|
262 |
-
"r": 0.
|
263 |
-
"f": 0.
|
264 |
},
|
265 |
"Hyph": {
|
266 |
-
"p": 0.
|
267 |
-
"r": 0.
|
268 |
-
"f": 0.
|
269 |
},
|
270 |
"Reflex": {
|
271 |
-
"p": 0.
|
272 |
-
"r": 0.
|
273 |
-
"f": 0.
|
274 |
},
|
275 |
"Mood": {
|
276 |
-
"p": 0.
|
277 |
-
"r": 0.
|
278 |
-
"f": 0.
|
279 |
},
|
280 |
"Person": {
|
281 |
-
"p": 0.
|
282 |
-
"r": 0.
|
283 |
-
"f": 0.
|
284 |
},
|
285 |
"AdpType": {
|
286 |
-
"p":
|
287 |
-
"r": 0.
|
288 |
-
"f": 0.
|
289 |
},
|
290 |
"NumForm": {
|
291 |
-
"p": 0.
|
292 |
"r": 0.89,
|
293 |
-
"f": 0.
|
294 |
-
},
|
295 |
-
"NumType": {
|
296 |
-
"p": 0.8,
|
297 |
-
"r": 0.6428571429,
|
298 |
-
"f": 0.7128712871
|
299 |
},
|
300 |
"Abbr": {
|
301 |
-
"p": 0.
|
302 |
"r": 0.943231441,
|
303 |
-
"f": 0.
|
304 |
},
|
305 |
"Foreign": {
|
306 |
-
"p": 0.
|
307 |
-
"r": 0.
|
308 |
-
"f": 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
309 |
}
|
310 |
},
|
311 |
-
"
|
312 |
-
"lemma_acc": 0.7106344332
|
313 |
}
|
|
|
3 |
"token_p": 0.998357254,
|
4 |
"token_r": 0.9988754325,
|
5 |
"token_f": 0.9986162761,
|
6 |
+
"pos_acc": 0.9468766223,
|
7 |
+
"morph_acc": 0.8705658418,
|
8 |
+
"morph_micro_p": 0.9154675098,
|
9 |
+
"morph_micro_r": 0.9045524101,
|
10 |
+
"morph_micro_f": 0.9099772297,
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
11 |
"morph_per_feat": {
|
12 |
"Case": {
|
13 |
+
"p": 0.9271740917,
|
14 |
"r": 0.924287119,
|
15 |
+
"f": 0.9257283545
|
16 |
},
|
17 |
"Gender": {
|
18 |
+
"p": 0.9329511899,
|
19 |
+
"r": 0.9327891629,
|
20 |
+
"f": 0.9328701693
|
21 |
},
|
22 |
"Number": {
|
23 |
+
"p": 0.9187705818,
|
24 |
+
"r": 0.9158980772,
|
25 |
+
"f": 0.9173320808
|
26 |
},
|
27 |
"Definite": {
|
28 |
+
"p": 0.9205955335,
|
29 |
+
"r": 0.9013605442,
|
30 |
+
"f": 0.9108765038
|
31 |
},
|
32 |
"Degree": {
|
33 |
+
"p": 0.8616144975,
|
34 |
+
"r": 0.8723936614,
|
35 |
+
"f": 0.866970576
|
36 |
},
|
37 |
"Polarity": {
|
38 |
+
"p": 0.9304725693,
|
39 |
+
"r": 0.907309322,
|
40 |
+
"f": 0.9187449718
|
41 |
},
|
42 |
"Tense": {
|
43 |
+
"p": 0.8707224335,
|
44 |
+
"r": 0.8400586941,
|
45 |
+
"f": 0.855115758
|
46 |
},
|
47 |
"VerbForm": {
|
48 |
+
"p": 0.9033134166,
|
49 |
+
"r": 0.8808262712,
|
50 |
+
"f": 0.8919281309
|
51 |
},
|
52 |
"Voice": {
|
53 |
+
"p": 0.8136645963,
|
54 |
+
"r": 0.741509434,
|
55 |
+
"f": 0.7759131293
|
56 |
},
|
57 |
"PronType": {
|
58 |
+
"p": 0.9304635762,
|
59 |
+
"r": 0.9273927393,
|
60 |
+
"f": 0.9289256198
|
61 |
},
|
62 |
"Aspect": {
|
63 |
+
"p": 0.8342696629,
|
64 |
+
"r": 0.8181818182,
|
65 |
+
"f": 0.826147427
|
66 |
},
|
67 |
"Hyph": {
|
68 |
+
"p": 0.9037433155,
|
69 |
+
"r": 0.8989361702,
|
70 |
+
"f": 0.9013333333
|
71 |
},
|
72 |
"Reflex": {
|
73 |
+
"p": 0.7579908676,
|
74 |
+
"r": 0.6125461255,
|
75 |
+
"f": 0.6775510204
|
76 |
},
|
77 |
"Mood": {
|
78 |
+
"p": 0.9086479903,
|
79 |
+
"r": 0.8923444976,
|
80 |
+
"f": 0.9004224502
|
81 |
},
|
82 |
"Person": {
|
83 |
+
"p": 0.9114688129,
|
84 |
+
"r": 0.9041916168,
|
85 |
+
"f": 0.9078156313
|
86 |
},
|
87 |
"AdpType": {
|
88 |
+
"p": 0.9899749373,
|
89 |
+
"r": 0.9825870647,
|
90 |
+
"f": 0.986267166
|
91 |
},
|
92 |
"NumForm": {
|
93 |
+
"p": 0.9417989418,
|
94 |
"r": 0.89,
|
95 |
+
"f": 0.9151670951
|
|
|
|
|
|
|
|
|
|
|
96 |
},
|
97 |
"Abbr": {
|
98 |
+
"p": 0.9642857143,
|
99 |
"r": 0.943231441,
|
100 |
+
"f": 0.9536423841
|
101 |
},
|
102 |
"Foreign": {
|
103 |
+
"p": 0.6551724138,
|
104 |
+
"r": 0.59375,
|
105 |
+
"f": 0.6229508197
|
106 |
+
},
|
107 |
+
"NumType": {
|
108 |
+
"p": 0.7608695652,
|
109 |
+
"r": 0.625,
|
110 |
+
"f": 0.6862745098
|
111 |
+
}
|
112 |
+
},
|
113 |
+
"tag_acc": 0.8632116283,
|
114 |
+
"sents_p": 0.8447712418,
|
115 |
+
"sents_r": 0.8379254457,
|
116 |
+
"sents_f": 0.8413344182,
|
117 |
+
"dep_uas": 0.7353933769,
|
118 |
+
"dep_las": 0.6609365113,
|
119 |
+
"dep_las_per_type": {},
|
120 |
+
"lemma_acc": 0.8484193228,
|
121 |
+
"ents_p": 0.7557354926,
|
122 |
+
"ents_r": 0.8404202101,
|
123 |
+
"ents_f": 0.7958313595,
|
124 |
+
"ents_per_type": {
|
125 |
+
"ORG": {
|
126 |
+
"p": 0.6943866944,
|
127 |
+
"r": 0.7625570776,
|
128 |
+
"f": 0.7268770403
|
129 |
+
},
|
130 |
+
"TIME": {
|
131 |
+
"p": 0.7280334728,
|
132 |
+
"r": 0.7909090909,
|
133 |
+
"f": 0.7581699346
|
134 |
+
},
|
135 |
+
"LOC": {
|
136 |
+
"p": 0.7134502924,
|
137 |
+
"r": 0.7554179567,
|
138 |
+
"f": 0.7338345865
|
139 |
+
},
|
140 |
+
"PRODUCT": {
|
141 |
+
"p": 0.3829787234,
|
142 |
+
"r": 0.5714285714,
|
143 |
+
"f": 0.4585987261
|
144 |
+
},
|
145 |
+
"GPE": {
|
146 |
+
"p": 0.7651663405,
|
147 |
+
"r": 0.9654320988,
|
148 |
+
"f": 0.8537117904
|
149 |
+
},
|
150 |
+
"PERSON": {
|
151 |
+
"p": 0.9010791367,
|
152 |
+
"r": 0.9109090909,
|
153 |
+
"f": 0.9059674503
|
154 |
}
|
155 |
},
|
156 |
+
"speed": 9529.4689235955
|
|
|
157 |
}
|
attribute_ruler/patterns
CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
|
|
config.cfg
CHANGED
@@ -10,7 +10,7 @@ seed = 0
|
|
10 |
|
11 |
[nlp]
|
12 |
lang = "lt"
|
13 |
-
pipeline = ["tok2vec","morphologizer","tagger","parser","
|
14 |
disabled = ["senter"]
|
15 |
before_creation = null
|
16 |
after_creation = null
|
@@ -26,11 +26,22 @@ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
|
|
26 |
validate = false
|
27 |
|
28 |
[components.lemmatizer]
|
29 |
-
factory = "
|
30 |
-
|
31 |
-
|
32 |
overwrite = false
|
33 |
scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
34 |
|
35 |
[components.morphologizer]
|
36 |
factory = "morphologizer"
|
@@ -39,8 +50,9 @@ overwrite = true
|
|
39 |
scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
|
40 |
|
41 |
[components.morphologizer.model]
|
42 |
-
@architectures = "spacy.Tagger.
|
43 |
nO = null
|
|
|
44 |
|
45 |
[components.morphologizer.model.tok2vec]
|
46 |
@architectures = "spacy.Tok2VecListener.v1"
|
@@ -70,7 +82,7 @@ nO = null
|
|
70 |
@architectures = "spacy.MultiHashEmbed.v2"
|
71 |
width = 96
|
72 |
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
73 |
-
rows = [5000,
|
74 |
include_static_vectors = true
|
75 |
|
76 |
[components.ner.model.tok2vec.encode]
|
@@ -108,8 +120,9 @@ overwrite = false
|
|
108 |
scorer = {"@scorers":"spacy.senter_scorer.v1"}
|
109 |
|
110 |
[components.senter.model]
|
111 |
-
@architectures = "spacy.Tagger.
|
112 |
nO = null
|
|
|
113 |
|
114 |
[components.senter.model.tok2vec]
|
115 |
@architectures = "spacy.Tok2Vec.v2"
|
@@ -130,12 +143,14 @@ maxout_pieces = 2
|
|
130 |
|
131 |
[components.tagger]
|
132 |
factory = "tagger"
|
|
|
133 |
overwrite = false
|
134 |
scorer = {"@scorers":"spacy.tagger_scorer.v1"}
|
135 |
|
136 |
[components.tagger.model]
|
137 |
-
@architectures = "spacy.Tagger.
|
138 |
nO = null
|
|
|
139 |
|
140 |
[components.tagger.model.tok2vec]
|
141 |
@architectures = "spacy.Tok2VecListener.v1"
|
@@ -152,7 +167,7 @@ factory = "tok2vec"
|
|
152 |
@architectures = "spacy.MultiHashEmbed.v2"
|
153 |
width = ${components.tok2vec.model.encode:width}
|
154 |
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
155 |
-
rows = [5000,
|
156 |
include_static_vectors = true
|
157 |
|
158 |
[components.tok2vec.model.encode]
|
@@ -189,7 +204,7 @@ dropout = 0.1
|
|
189 |
accumulate_gradient = 1
|
190 |
patience = 5000
|
191 |
max_epochs = 0
|
192 |
-
max_steps =
|
193 |
eval_frequency = 1000
|
194 |
frozen_components = []
|
195 |
before_to_disk = null
|
@@ -224,18 +239,18 @@ eps = 0.00000001
|
|
224 |
learn_rate = 0.001
|
225 |
|
226 |
[training.score_weights]
|
227 |
-
pos_acc = 0.
|
228 |
-
morph_acc = 0.
|
229 |
morph_per_feat = null
|
230 |
-
tag_acc = 0.
|
231 |
dep_uas = 0.0
|
232 |
-
dep_las = 0.
|
233 |
dep_las_per_type = null
|
234 |
sents_p = null
|
235 |
sents_r = null
|
236 |
-
sents_f = 0.
|
237 |
-
lemma_acc = 0.
|
238 |
-
ents_f = 0.
|
239 |
ents_p = 0.0
|
240 |
ents_r = 0.0
|
241 |
ents_per_type = null
|
@@ -252,6 +267,13 @@ after_init = null
|
|
252 |
|
253 |
[initialize.components]
|
254 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
255 |
[initialize.components.morphologizer]
|
256 |
|
257 |
[initialize.components.morphologizer.labels]
|
|
|
10 |
|
11 |
[nlp]
|
12 |
lang = "lt"
|
13 |
+
pipeline = ["tok2vec","morphologizer","tagger","parser","lemmatizer","senter","attribute_ruler","ner"]
|
14 |
disabled = ["senter"]
|
15 |
before_creation = null
|
16 |
after_creation = null
|
|
|
26 |
validate = false
|
27 |
|
28 |
[components.lemmatizer]
|
29 |
+
factory = "trainable_lemmatizer"
|
30 |
+
backoff = "orth"
|
31 |
+
min_tree_freq = 3
|
32 |
overwrite = false
|
33 |
scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
|
34 |
+
top_k = 1
|
35 |
+
|
36 |
+
[components.lemmatizer.model]
|
37 |
+
@architectures = "spacy.Tagger.v2"
|
38 |
+
nO = null
|
39 |
+
normalize = false
|
40 |
+
|
41 |
+
[components.lemmatizer.model.tok2vec]
|
42 |
+
@architectures = "spacy.Tok2VecListener.v1"
|
43 |
+
width = ${components.tok2vec.model.encode:width}
|
44 |
+
upstream = "tok2vec"
|
45 |
|
46 |
[components.morphologizer]
|
47 |
factory = "morphologizer"
|
|
|
50 |
scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
|
51 |
|
52 |
[components.morphologizer.model]
|
53 |
+
@architectures = "spacy.Tagger.v2"
|
54 |
nO = null
|
55 |
+
normalize = false
|
56 |
|
57 |
[components.morphologizer.model.tok2vec]
|
58 |
@architectures = "spacy.Tok2VecListener.v1"
|
|
|
82 |
@architectures = "spacy.MultiHashEmbed.v2"
|
83 |
width = 96
|
84 |
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
85 |
+
rows = [5000,1000,2500,2500,50]
|
86 |
include_static_vectors = true
|
87 |
|
88 |
[components.ner.model.tok2vec.encode]
|
|
|
120 |
scorer = {"@scorers":"spacy.senter_scorer.v1"}
|
121 |
|
122 |
[components.senter.model]
|
123 |
+
@architectures = "spacy.Tagger.v2"
|
124 |
nO = null
|
125 |
+
normalize = false
|
126 |
|
127 |
[components.senter.model.tok2vec]
|
128 |
@architectures = "spacy.Tok2Vec.v2"
|
|
|
143 |
|
144 |
[components.tagger]
|
145 |
factory = "tagger"
|
146 |
+
neg_prefix = "!"
|
147 |
overwrite = false
|
148 |
scorer = {"@scorers":"spacy.tagger_scorer.v1"}
|
149 |
|
150 |
[components.tagger.model]
|
151 |
+
@architectures = "spacy.Tagger.v2"
|
152 |
nO = null
|
153 |
+
normalize = false
|
154 |
|
155 |
[components.tagger.model.tok2vec]
|
156 |
@architectures = "spacy.Tok2VecListener.v1"
|
|
|
167 |
@architectures = "spacy.MultiHashEmbed.v2"
|
168 |
width = ${components.tok2vec.model.encode:width}
|
169 |
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
170 |
+
rows = [5000,1000,2500,2500,50]
|
171 |
include_static_vectors = true
|
172 |
|
173 |
[components.tok2vec.model.encode]
|
|
|
204 |
accumulate_gradient = 1
|
205 |
patience = 5000
|
206 |
max_epochs = 0
|
207 |
+
max_steps = 100000
|
208 |
eval_frequency = 1000
|
209 |
frozen_components = []
|
210 |
before_to_disk = null
|
|
|
239 |
learn_rate = 0.001
|
240 |
|
241 |
[training.score_weights]
|
242 |
+
pos_acc = 0.1
|
243 |
+
morph_acc = 0.09
|
244 |
morph_per_feat = null
|
245 |
+
tag_acc = 0.1
|
246 |
dep_uas = 0.0
|
247 |
+
dep_las = 0.29
|
248 |
dep_las_per_type = null
|
249 |
sents_p = null
|
250 |
sents_r = null
|
251 |
+
sents_f = 0.04
|
252 |
+
lemma_acc = 0.1
|
253 |
+
ents_f = 0.29
|
254 |
ents_p = 0.0
|
255 |
ents_r = 0.0
|
256 |
ents_per_type = null
|
|
|
267 |
|
268 |
[initialize.components]
|
269 |
|
270 |
+
[initialize.components.lemmatizer]
|
271 |
+
|
272 |
+
[initialize.components.lemmatizer.labels]
|
273 |
+
@readers = "spacy.read_labels.v1"
|
274 |
+
path = "corpus/labels/trainable_lemmatizer.json"
|
275 |
+
require = false
|
276 |
+
|
277 |
[initialize.components.morphologizer]
|
278 |
|
279 |
[initialize.components.morphologizer.labels]
|
lemmatizer/cfg
ADDED
@@ -0,0 +1,730 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"labels":[
|
3 |
+
0,
|
4 |
+
2,
|
5 |
+
4,
|
6 |
+
6,
|
7 |
+
9,
|
8 |
+
11,
|
9 |
+
13,
|
10 |
+
15,
|
11 |
+
18,
|
12 |
+
20,
|
13 |
+
24,
|
14 |
+
26,
|
15 |
+
28,
|
16 |
+
31,
|
17 |
+
33,
|
18 |
+
35,
|
19 |
+
37,
|
20 |
+
39,
|
21 |
+
41,
|
22 |
+
43,
|
23 |
+
45,
|
24 |
+
47,
|
25 |
+
50,
|
26 |
+
54,
|
27 |
+
58,
|
28 |
+
60,
|
29 |
+
62,
|
30 |
+
64,
|
31 |
+
66,
|
32 |
+
68,
|
33 |
+
70,
|
34 |
+
71,
|
35 |
+
73,
|
36 |
+
75,
|
37 |
+
77,
|
38 |
+
80,
|
39 |
+
82,
|
40 |
+
84,
|
41 |
+
86,
|
42 |
+
88,
|
43 |
+
90,
|
44 |
+
92,
|
45 |
+
94,
|
46 |
+
97,
|
47 |
+
99,
|
48 |
+
101,
|
49 |
+
103,
|
50 |
+
104,
|
51 |
+
106,
|
52 |
+
108,
|
53 |
+
110,
|
54 |
+
112,
|
55 |
+
114,
|
56 |
+
116,
|
57 |
+
118,
|
58 |
+
120,
|
59 |
+
122,
|
60 |
+
124,
|
61 |
+
126,
|
62 |
+
128,
|
63 |
+
130,
|
64 |
+
133,
|
65 |
+
136,
|
66 |
+
138,
|
67 |
+
141,
|
68 |
+
143,
|
69 |
+
146,
|
70 |
+
149,
|
71 |
+
152,
|
72 |
+
154,
|
73 |
+
157,
|
74 |
+
159,
|
75 |
+
161,
|
76 |
+
163,
|
77 |
+
165,
|
78 |
+
167,
|
79 |
+
169,
|
80 |
+
173,
|
81 |
+
175,
|
82 |
+
177,
|
83 |
+
179,
|
84 |
+
180,
|
85 |
+
181,
|
86 |
+
182,
|
87 |
+
184,
|
88 |
+
187,
|
89 |
+
189,
|
90 |
+
191,
|
91 |
+
192,
|
92 |
+
194,
|
93 |
+
196,
|
94 |
+
199,
|
95 |
+
200,
|
96 |
+
202,
|
97 |
+
203,
|
98 |
+
205,
|
99 |
+
207,
|
100 |
+
208,
|
101 |
+
210,
|
102 |
+
213,
|
103 |
+
217,
|
104 |
+
221,
|
105 |
+
224,
|
106 |
+
226,
|
107 |
+
227,
|
108 |
+
230,
|
109 |
+
233,
|
110 |
+
235,
|
111 |
+
237,
|
112 |
+
239,
|
113 |
+
241,
|
114 |
+
244,
|
115 |
+
246,
|
116 |
+
248,
|
117 |
+
250,
|
118 |
+
253,
|
119 |
+
256,
|
120 |
+
258,
|
121 |
+
260,
|
122 |
+
262,
|
123 |
+
263,
|
124 |
+
265,
|
125 |
+
267,
|
126 |
+
270,
|
127 |
+
273,
|
128 |
+
275,
|
129 |
+
277,
|
130 |
+
280,
|
131 |
+
281,
|
132 |
+
284,
|
133 |
+
286,
|
134 |
+
289,
|
135 |
+
292,
|
136 |
+
294,
|
137 |
+
297,
|
138 |
+
300,
|
139 |
+
301,
|
140 |
+
303,
|
141 |
+
306,
|
142 |
+
308,
|
143 |
+
311,
|
144 |
+
314,
|
145 |
+
315,
|
146 |
+
318,
|
147 |
+
321,
|
148 |
+
323,
|
149 |
+
326,
|
150 |
+
329,
|
151 |
+
331,
|
152 |
+
333,
|
153 |
+
335,
|
154 |
+
337,
|
155 |
+
339,
|
156 |
+
340,
|
157 |
+
342,
|
158 |
+
344,
|
159 |
+
346,
|
160 |
+
348,
|
161 |
+
350,
|
162 |
+
352,
|
163 |
+
353,
|
164 |
+
354,
|
165 |
+
356,
|
166 |
+
358,
|
167 |
+
359,
|
168 |
+
361,
|
169 |
+
363,
|
170 |
+
365,
|
171 |
+
367,
|
172 |
+
370,
|
173 |
+
371,
|
174 |
+
373,
|
175 |
+
376,
|
176 |
+
378,
|
177 |
+
380,
|
178 |
+
383,
|
179 |
+
386,
|
180 |
+
389,
|
181 |
+
391,
|
182 |
+
392,
|
183 |
+
394,
|
184 |
+
396,
|
185 |
+
399,
|
186 |
+
400,
|
187 |
+
402,
|
188 |
+
404,
|
189 |
+
407,
|
190 |
+
410,
|
191 |
+
414,
|
192 |
+
416,
|
193 |
+
419,
|
194 |
+
421,
|
195 |
+
422,
|
196 |
+
424,
|
197 |
+
426,
|
198 |
+
427,
|
199 |
+
429,
|
200 |
+
431,
|
201 |
+
432,
|
202 |
+
434,
|
203 |
+
435,
|
204 |
+
436,
|
205 |
+
438,
|
206 |
+
439,
|
207 |
+
441,
|
208 |
+
442,
|
209 |
+
444,
|
210 |
+
445,
|
211 |
+
447,
|
212 |
+
449,
|
213 |
+
451,
|
214 |
+
454,
|
215 |
+
455,
|
216 |
+
458,
|
217 |
+
460,
|
218 |
+
463,
|
219 |
+
465,
|
220 |
+
467,
|
221 |
+
469,
|
222 |
+
472,
|
223 |
+
473,
|
224 |
+
475,
|
225 |
+
478,
|
226 |
+
480,
|
227 |
+
483,
|
228 |
+
485,
|
229 |
+
490,
|
230 |
+
494,
|
231 |
+
496,
|
232 |
+
498,
|
233 |
+
500,
|
234 |
+
502,
|
235 |
+
504,
|
236 |
+
506,
|
237 |
+
509,
|
238 |
+
512,
|
239 |
+
513,
|
240 |
+
515,
|
241 |
+
517,
|
242 |
+
519,
|
243 |
+
522,
|
244 |
+
524,
|
245 |
+
527,
|
246 |
+
530,
|
247 |
+
532,
|
248 |
+
534,
|
249 |
+
538,
|
250 |
+
540,
|
251 |
+
541,
|
252 |
+
543,
|
253 |
+
545,
|
254 |
+
548,
|
255 |
+
551,
|
256 |
+
552,
|
257 |
+
554,
|
258 |
+
557,
|
259 |
+
560,
|
260 |
+
562,
|
261 |
+
564,
|
262 |
+
567,
|
263 |
+
570,
|
264 |
+
573,
|
265 |
+
577,
|
266 |
+
579,
|
267 |
+
581,
|
268 |
+
582,
|
269 |
+
583,
|
270 |
+
584,
|
271 |
+
585,
|
272 |
+
587,
|
273 |
+
588,
|
274 |
+
590,
|
275 |
+
593,
|
276 |
+
594,
|
277 |
+
596,
|
278 |
+
598,
|
279 |
+
602,
|
280 |
+
605,
|
281 |
+
607,
|
282 |
+
609,
|
283 |
+
611,
|
284 |
+
613,
|
285 |
+
615,
|
286 |
+
617,
|
287 |
+
620,
|
288 |
+
623,
|
289 |
+
625,
|
290 |
+
629,
|
291 |
+
631,
|
292 |
+
634,
|
293 |
+
636,
|
294 |
+
638,
|
295 |
+
640,
|
296 |
+
641,
|
297 |
+
643,
|
298 |
+
644,
|
299 |
+
645,
|
300 |
+
648,
|
301 |
+
650,
|
302 |
+
652,
|
303 |
+
654,
|
304 |
+
656,
|
305 |
+
657,
|
306 |
+
659,
|
307 |
+
660,
|
308 |
+
661,
|
309 |
+
662,
|
310 |
+
663,
|
311 |
+
322,
|
312 |
+
664,
|
313 |
+
665,
|
314 |
+
666,
|
315 |
+
667,
|
316 |
+
670,
|
317 |
+
671,
|
318 |
+
674,
|
319 |
+
677,
|
320 |
+
679,
|
321 |
+
681,
|
322 |
+
682,
|
323 |
+
685,
|
324 |
+
687,
|
325 |
+
688,
|
326 |
+
689,
|
327 |
+
690,
|
328 |
+
691,
|
329 |
+
693,
|
330 |
+
695,
|
331 |
+
697,
|
332 |
+
699,
|
333 |
+
701,
|
334 |
+
705,
|
335 |
+
706,
|
336 |
+
707,
|
337 |
+
708,
|
338 |
+
709,
|
339 |
+
711,
|
340 |
+
714,
|
341 |
+
716,
|
342 |
+
718,
|
343 |
+
720,
|
344 |
+
722,
|
345 |
+
724,
|
346 |
+
725,
|
347 |
+
727,
|
348 |
+
730,
|
349 |
+
731,
|
350 |
+
732,
|
351 |
+
734,
|
352 |
+
737,
|
353 |
+
739,
|
354 |
+
741,
|
355 |
+
742,
|
356 |
+
744,
|
357 |
+
745,
|
358 |
+
748,
|
359 |
+
750,
|
360 |
+
752,
|
361 |
+
753,
|
362 |
+
755,
|
363 |
+
757,
|
364 |
+
759,
|
365 |
+
762,
|
366 |
+
766,
|
367 |
+
768,
|
368 |
+
770,
|
369 |
+
773,
|
370 |
+
775,
|
371 |
+
777,
|
372 |
+
780,
|
373 |
+
783,
|
374 |
+
784,
|
375 |
+
785,
|
376 |
+
787,
|
377 |
+
789,
|
378 |
+
791,
|
379 |
+
793,
|
380 |
+
795,
|
381 |
+
798,
|
382 |
+
800,
|
383 |
+
803,
|
384 |
+
806,
|
385 |
+
807,
|
386 |
+
811,
|
387 |
+
813,
|
388 |
+
815,
|
389 |
+
817,
|
390 |
+
819,
|
391 |
+
820,
|
392 |
+
822,
|
393 |
+
824,
|
394 |
+
827,
|
395 |
+
830,
|
396 |
+
833,
|
397 |
+
835,
|
398 |
+
837,
|
399 |
+
838,
|
400 |
+
840,
|
401 |
+
843,
|
402 |
+
844,
|
403 |
+
846,
|
404 |
+
849,
|
405 |
+
851,
|
406 |
+
853,
|
407 |
+
854,
|
408 |
+
857,
|
409 |
+
860,
|
410 |
+
863,
|
411 |
+
865,
|
412 |
+
868,
|
413 |
+
869,
|
414 |
+
871,
|
415 |
+
872,
|
416 |
+
874,
|
417 |
+
876,
|
418 |
+
879,
|
419 |
+
881,
|
420 |
+
883,
|
421 |
+
885,
|
422 |
+
888,
|
423 |
+
891,
|
424 |
+
893,
|
425 |
+
895,
|
426 |
+
897,
|
427 |
+
899,
|
428 |
+
901,
|
429 |
+
904,
|
430 |
+
905,
|
431 |
+
906,
|
432 |
+
909,
|
433 |
+
912,
|
434 |
+
913,
|
435 |
+
914,
|
436 |
+
916,
|
437 |
+
919,
|
438 |
+
920,
|
439 |
+
922,
|
440 |
+
923,
|
441 |
+
924,
|
442 |
+
927,
|
443 |
+
929,
|
444 |
+
931,
|
445 |
+
933,
|
446 |
+
935,
|
447 |
+
936,
|
448 |
+
939,
|
449 |
+
941,
|
450 |
+
943,
|
451 |
+
945,
|
452 |
+
948,
|
453 |
+
950,
|
454 |
+
952,
|
455 |
+
955,
|
456 |
+
957,
|
457 |
+
959,
|
458 |
+
960,
|
459 |
+
962,
|
460 |
+
964,
|
461 |
+
967,
|
462 |
+
968,
|
463 |
+
970,
|
464 |
+
972,
|
465 |
+
974,
|
466 |
+
976,
|
467 |
+
979,
|
468 |
+
982,
|
469 |
+
984,
|
470 |
+
986,
|
471 |
+
988,
|
472 |
+
991,
|
473 |
+
993,
|
474 |
+
996,
|
475 |
+
997,
|
476 |
+
1000,
|
477 |
+
1003,
|
478 |
+
1006,
|
479 |
+
1008,
|
480 |
+
1011,
|
481 |
+
1013,
|
482 |
+
1016,
|
483 |
+
1018,
|
484 |
+
1020,
|
485 |
+
1022,
|
486 |
+
1025,
|
487 |
+
1028,
|
488 |
+
1031,
|
489 |
+
1033,
|
490 |
+
1035,
|
491 |
+
1037,
|
492 |
+
1039,
|
493 |
+
1040,
|
494 |
+
1042,
|
495 |
+
1044,
|
496 |
+
1046,
|
497 |
+
1048,
|
498 |
+
1049,
|
499 |
+
1051,
|
500 |
+
1053,
|
501 |
+
1055,
|
502 |
+
1057,
|
503 |
+
1059,
|
504 |
+
1061,
|
505 |
+
1063,
|
506 |
+
1064,
|
507 |
+
1067,
|
508 |
+
1068,
|
509 |
+
1071,
|
510 |
+
1074,
|
511 |
+
1075,
|
512 |
+
1076,
|
513 |
+
1078,
|
514 |
+
1079,
|
515 |
+
1081,
|
516 |
+
1083,
|
517 |
+
1086,
|
518 |
+
1087,
|
519 |
+
1088,
|
520 |
+
1089,
|
521 |
+
1092,
|
522 |
+
1095,
|
523 |
+
1097,
|
524 |
+
1098,
|
525 |
+
1100,
|
526 |
+
1102,
|
527 |
+
1104,
|
528 |
+
1106,
|
529 |
+
1107,
|
530 |
+
1110,
|
531 |
+
1111,
|
532 |
+
1114,
|
533 |
+
1117,
|
534 |
+
1118,
|
535 |
+
1120,
|
536 |
+
1121,
|
537 |
+
1123,
|
538 |
+
1125,
|
539 |
+
1127,
|
540 |
+
1129,
|
541 |
+
1131,
|
542 |
+
1133,
|
543 |
+
1134,
|
544 |
+
1137,
|
545 |
+
1140,
|
546 |
+
1142,
|
547 |
+
1145,
|
548 |
+
1147,
|
549 |
+
1150,
|
550 |
+
1152,
|
551 |
+
1154,
|
552 |
+
1157,
|
553 |
+
1158,
|
554 |
+
1159,
|
555 |
+
1162,
|
556 |
+
1165,
|
557 |
+
1166,
|
558 |
+
1168,
|
559 |
+
1172,
|
560 |
+
1174,
|
561 |
+
1176,
|
562 |
+
1179,
|
563 |
+
1182,
|
564 |
+
1185,
|
565 |
+
1188,
|
566 |
+
1191,
|
567 |
+
1193,
|
568 |
+
1195,
|
569 |
+
1197,
|
570 |
+
1199,
|
571 |
+
1202,
|
572 |
+
1204,
|
573 |
+
1207,
|
574 |
+
1209,
|
575 |
+
1211,
|
576 |
+
1214,
|
577 |
+
1216,
|
578 |
+
1218,
|
579 |
+
1219,
|
580 |
+
1221,
|
581 |
+
1222,
|
582 |
+
1225,
|
583 |
+
1227,
|
584 |
+
1229,
|
585 |
+
1231,
|
586 |
+
1233,
|
587 |
+
1234,
|
588 |
+
1235,
|
589 |
+
1237,
|
590 |
+
1238,
|
591 |
+
1241,
|
592 |
+
1242,
|
593 |
+
1243,
|
594 |
+
1244,
|
595 |
+
1246,
|
596 |
+
1248,
|
597 |
+
1250,
|
598 |
+
1252,
|
599 |
+
1254,
|
600 |
+
1257,
|
601 |
+
1259,
|
602 |
+
1261,
|
603 |
+
1262,
|
604 |
+
1265,
|
605 |
+
1268,
|
606 |
+
1270,
|
607 |
+
1272,
|
608 |
+
1274,
|
609 |
+
1275,
|
610 |
+
1276,
|
611 |
+
1278,
|
612 |
+
1279,
|
613 |
+
1280,
|
614 |
+
1281,
|
615 |
+
1282,
|
616 |
+
1284,
|
617 |
+
1286,
|
618 |
+
1287,
|
619 |
+
1288,
|
620 |
+
1291,
|
621 |
+
1292,
|
622 |
+
1295,
|
623 |
+
1298,
|
624 |
+
1299,
|
625 |
+
1300,
|
626 |
+
1302,
|
627 |
+
1304,
|
628 |
+
1305,
|
629 |
+
1306,
|
630 |
+
1308,
|
631 |
+
1310,
|
632 |
+
1311,
|
633 |
+
1312,
|
634 |
+
1313,
|
635 |
+
1314,
|
636 |
+
1315,
|
637 |
+
1317,
|
638 |
+
1318,
|
639 |
+
1320,
|
640 |
+
1322,
|
641 |
+
1323,
|
642 |
+
1324,
|
643 |
+
1325,
|
644 |
+
1326,
|
645 |
+
1329,
|
646 |
+
1331,
|
647 |
+
1333,
|
648 |
+
1335,
|
649 |
+
1337,
|
650 |
+
1338,
|
651 |
+
1339,
|
652 |
+
1340,
|
653 |
+
1342,
|
654 |
+
1344,
|
655 |
+
1346,
|
656 |
+
1348,
|
657 |
+
1351,
|
658 |
+
1353,
|
659 |
+
1356,
|
660 |
+
1357,
|
661 |
+
1359,
|
662 |
+
1361,
|
663 |
+
1363,
|
664 |
+
1365,
|
665 |
+
1366,
|
666 |
+
1367,
|
667 |
+
1370,
|
668 |
+
1371,
|
669 |
+
1374,
|
670 |
+
1375,
|
671 |
+
1378,
|
672 |
+
1380,
|
673 |
+
1383,
|
674 |
+
1386,
|
675 |
+
1388,
|
676 |
+
1390,
|
677 |
+
1393,
|
678 |
+
1394,
|
679 |
+
1396,
|
680 |
+
1397,
|
681 |
+
1400,
|
682 |
+
1403,
|
683 |
+
1405,
|
684 |
+
1407,
|
685 |
+
1409,
|
686 |
+
1411,
|
687 |
+
1413,
|
688 |
+
1415,
|
689 |
+
1416,
|
690 |
+
1417,
|
691 |
+
1418,
|
692 |
+
1419,
|
693 |
+
1421,
|
694 |
+
1422,
|
695 |
+
1424,
|
696 |
+
1426,
|
697 |
+
1427,
|
698 |
+
1430,
|
699 |
+
1433,
|
700 |
+
1435,
|
701 |
+
1437,
|
702 |
+
1440,
|
703 |
+
1441,
|
704 |
+
1442,
|
705 |
+
1443,
|
706 |
+
1445,
|
707 |
+
1446,
|
708 |
+
1448,
|
709 |
+
1449,
|
710 |
+
1450,
|
711 |
+
1453,
|
712 |
+
1454,
|
713 |
+
1457,
|
714 |
+
1460,
|
715 |
+
1461,
|
716 |
+
1462,
|
717 |
+
1463,
|
718 |
+
1464,
|
719 |
+
1465,
|
720 |
+
1466,
|
721 |
+
1471,
|
722 |
+
1472,
|
723 |
+
1474,
|
724 |
+
1475,
|
725 |
+
1476,
|
726 |
+
1477,
|
727 |
+
1478,
|
728 |
+
1479
|
729 |
+
]
|
730 |
+
}
|
lemmatizer/{lookups/lookups.bin → model}
RENAMED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e2a0cd62ac4ea974ea936e47be871c878245f5e9e338edadbbff075f20e99d53
|
3 |
+
size 282130
|
lemmatizer/trees
ADDED
Binary file (179 kB). View file
|
|
lt_core_news_md-any-py3-none-any.whl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a4196691cfe2c26368203bdf1e16cf67c07399528076163572169aa79f01eeb8
|
3 |
+
size 42626304
|
meta.json
CHANGED
@@ -1,14 +1,14 @@
|
|
1 |
{
|
2 |
"lang":"lt",
|
3 |
"name":"core_news_md",
|
4 |
-
"version":"3.
|
5 |
-
"description":"Lithuanian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser,
|
6 |
"author":"Explosion",
|
7 |
"email":"[email protected]",
|
8 |
"url":"https://explosion.ai",
|
9 |
"license":"CC BY-SA 4.0",
|
10 |
-
"spacy_version":">=3.
|
11 |
-
"spacy_git_version":"
|
12 |
"vectors":{
|
13 |
"width":300,
|
14 |
"vectors":20000,
|
@@ -1686,15 +1686,8 @@
|
|
1686 |
"punct",
|
1687 |
"xcomp"
|
1688 |
],
|
1689 |
-
"senter":[
|
1690 |
-
"I",
|
1691 |
-
"S"
|
1692 |
-
],
|
1693 |
"attribute_ruler":[
|
1694 |
|
1695 |
-
],
|
1696 |
-
"lemmatizer":[
|
1697 |
-
|
1698 |
],
|
1699 |
"ner":[
|
1700 |
"GPE",
|
@@ -1710,8 +1703,8 @@
|
|
1710 |
"morphologizer",
|
1711 |
"tagger",
|
1712 |
"parser",
|
1713 |
-
"attribute_ruler",
|
1714 |
"lemmatizer",
|
|
|
1715 |
"ner"
|
1716 |
],
|
1717 |
"components":[
|
@@ -1719,9 +1712,9 @@
|
|
1719 |
"morphologizer",
|
1720 |
"tagger",
|
1721 |
"parser",
|
|
|
1722 |
"senter",
|
1723 |
"attribute_ruler",
|
1724 |
-
"lemmatizer",
|
1725 |
"ner"
|
1726 |
],
|
1727 |
"disabled":[
|
@@ -1732,313 +1725,159 @@
|
|
1732 |
"token_p":0.998357254,
|
1733 |
"token_r":0.9988754325,
|
1734 |
"token_f":0.9986162761,
|
1735 |
-
"
|
1736 |
-
"
|
1737 |
-
"
|
1738 |
-
"
|
1739 |
-
"
|
1740 |
-
"dep_las_per_type":{
|
1741 |
-
"root":{
|
1742 |
-
"p":0.7385620915,
|
1743 |
-
"r":0.7325769854,
|
1744 |
-
"f":0.7355573637
|
1745 |
-
},
|
1746 |
-
"obl":{
|
1747 |
-
"p":0.5110132159,
|
1748 |
-
"r":0.5296803653,
|
1749 |
-
"f":0.5201793722
|
1750 |
-
},
|
1751 |
-
"nmod":{
|
1752 |
-
"p":0.7538644471,
|
1753 |
-
"r":0.7624774504,
|
1754 |
-
"f":0.7581464873
|
1755 |
-
},
|
1756 |
-
"amod":{
|
1757 |
-
"p":0.7503805175,
|
1758 |
-
"r":0.7458396369,
|
1759 |
-
"f":0.7481031866
|
1760 |
-
},
|
1761 |
-
"cc":{
|
1762 |
-
"p":0.7351129363,
|
1763 |
-
"r":0.7665952891,
|
1764 |
-
"f":0.750524109
|
1765 |
-
},
|
1766 |
-
"conj":{
|
1767 |
-
"p":0.4879032258,
|
1768 |
-
"r":0.5475113122,
|
1769 |
-
"f":0.5159914712
|
1770 |
-
},
|
1771 |
-
"obl:arg":{
|
1772 |
-
"p":0.5672131148,
|
1773 |
-
"r":0.5831460674,
|
1774 |
-
"f":0.5750692521
|
1775 |
-
},
|
1776 |
-
"acl":{
|
1777 |
-
"p":0.4695945946,
|
1778 |
-
"r":0.4527687296,
|
1779 |
-
"f":0.4610281924
|
1780 |
-
},
|
1781 |
-
"advmod":{
|
1782 |
-
"p":0.7456359102,
|
1783 |
-
"r":0.736453202,
|
1784 |
-
"f":0.741016109
|
1785 |
-
},
|
1786 |
-
"det":{
|
1787 |
-
"p":0.7043010753,
|
1788 |
-
"r":0.8238993711,
|
1789 |
-
"f":0.7594202899
|
1790 |
-
},
|
1791 |
-
"xcomp":{
|
1792 |
-
"p":0.7944664032,
|
1793 |
-
"r":0.858974359,
|
1794 |
-
"f":0.8254620123
|
1795 |
-
},
|
1796 |
-
"advcl":{
|
1797 |
-
"p":0.4106280193,
|
1798 |
-
"r":0.3373015873,
|
1799 |
-
"f":0.3703703704
|
1800 |
-
},
|
1801 |
-
"parataxis":{
|
1802 |
-
"p":0.4444444444,
|
1803 |
-
"r":0.3636363636,
|
1804 |
-
"f":0.4
|
1805 |
-
},
|
1806 |
-
"advmod:emph":{
|
1807 |
-
"p":0.6756756757,
|
1808 |
-
"r":0.5841121495,
|
1809 |
-
"f":0.626566416
|
1810 |
-
},
|
1811 |
-
"nsubj":{
|
1812 |
-
"p":0.7166392092,
|
1813 |
-
"r":0.7038834951,
|
1814 |
-
"f":0.7102040816
|
1815 |
-
},
|
1816 |
-
"acl:relcl":{
|
1817 |
-
"p":0.6865671642,
|
1818 |
-
"r":0.6388888889,
|
1819 |
-
"f":0.6618705036
|
1820 |
-
},
|
1821 |
-
"case":{
|
1822 |
-
"p":0.8396946565,
|
1823 |
-
"r":0.8291457286,
|
1824 |
-
"f":0.8343868521
|
1825 |
-
},
|
1826 |
-
"csubj":{
|
1827 |
-
"p":0.5454545455,
|
1828 |
-
"r":0.375,
|
1829 |
-
"f":0.4444444444
|
1830 |
-
},
|
1831 |
-
"mark":{
|
1832 |
-
"p":0.7863247863,
|
1833 |
-
"r":0.7796610169,
|
1834 |
-
"f":0.7829787234
|
1835 |
-
},
|
1836 |
-
"cop":{
|
1837 |
-
"p":0.7697841727,
|
1838 |
-
"r":0.8294573643,
|
1839 |
-
"f":0.7985074627
|
1840 |
-
},
|
1841 |
-
"obj":{
|
1842 |
-
"p":0.8015665796,
|
1843 |
-
"r":0.7561576355,
|
1844 |
-
"f":0.7782002535
|
1845 |
-
},
|
1846 |
-
"dep":{
|
1847 |
-
"p":0.0,
|
1848 |
-
"r":0.0,
|
1849 |
-
"f":0.0
|
1850 |
-
},
|
1851 |
-
"ccomp":{
|
1852 |
-
"p":0.6395348837,
|
1853 |
-
"r":0.625,
|
1854 |
-
"f":0.632183908
|
1855 |
-
},
|
1856 |
-
"appos":{
|
1857 |
-
"p":0.7333333333,
|
1858 |
-
"r":0.4230769231,
|
1859 |
-
"f":0.5365853659
|
1860 |
-
},
|
1861 |
-
"nummod":{
|
1862 |
-
"p":0.7099236641,
|
1863 |
-
"r":0.6739130435,
|
1864 |
-
"f":0.6914498141
|
1865 |
-
},
|
1866 |
-
"nummod:gov":{
|
1867 |
-
"p":0.0,
|
1868 |
-
"r":0.0,
|
1869 |
-
"f":0.0
|
1870 |
-
},
|
1871 |
-
"flat":{
|
1872 |
-
"p":0.3541666667,
|
1873 |
-
"r":0.1603773585,
|
1874 |
-
"f":0.2207792208
|
1875 |
-
},
|
1876 |
-
"nsubj:pass":{
|
1877 |
-
"p":0.5,
|
1878 |
-
"r":0.4470588235,
|
1879 |
-
"f":0.4720496894
|
1880 |
-
},
|
1881 |
-
"flat:foreign":{
|
1882 |
-
"p":0.0,
|
1883 |
-
"r":0.0,
|
1884 |
-
"f":0.0
|
1885 |
-
},
|
1886 |
-
"csubj:pass":{
|
1887 |
-
"p":0.0,
|
1888 |
-
"r":0.0,
|
1889 |
-
"f":0.0
|
1890 |
-
},
|
1891 |
-
"iobj":{
|
1892 |
-
"p":0.0,
|
1893 |
-
"r":0.0,
|
1894 |
-
"f":0.0
|
1895 |
-
}
|
1896 |
-
},
|
1897 |
-
"ents_p":0.750907441,
|
1898 |
-
"ents_r":0.827913957,
|
1899 |
-
"ents_f":0.7875327147,
|
1900 |
-
"ents_per_type":{
|
1901 |
-
"PERSON":{
|
1902 |
-
"p":0.0,
|
1903 |
-
"r":0.0,
|
1904 |
-
"f":0.0
|
1905 |
-
},
|
1906 |
-
"GPE":{
|
1907 |
-
"p":0.0,
|
1908 |
-
"r":0.0,
|
1909 |
-
"f":0.0
|
1910 |
-
},
|
1911 |
-
"PRODUCT":{
|
1912 |
-
"p":0.0,
|
1913 |
-
"r":0.0,
|
1914 |
-
"f":0.0
|
1915 |
-
},
|
1916 |
-
"ORG":{
|
1917 |
-
"p":0.0,
|
1918 |
-
"r":0.0,
|
1919 |
-
"f":0.0
|
1920 |
-
},
|
1921 |
-
"LOC":{
|
1922 |
-
"p":0.0,
|
1923 |
-
"r":0.0,
|
1924 |
-
"f":0.0
|
1925 |
-
},
|
1926 |
-
"TIME":{
|
1927 |
-
"p":0.0,
|
1928 |
-
"r":0.0,
|
1929 |
-
"f":0.0
|
1930 |
-
}
|
1931 |
-
},
|
1932 |
-
"speed":6365.8683040846,
|
1933 |
-
"pos_acc":0.9496907038,
|
1934 |
-
"morph_acc":0.8704416663,
|
1935 |
-
"morph_micro_p":0.9129765114,
|
1936 |
-
"morph_micro_r":0.9070390207,
|
1937 |
-
"morph_micro_f":0.909998081,
|
1938 |
"morph_per_feat":{
|
1939 |
"Case":{
|
1940 |
-
"p":0.
|
1941 |
"r":0.924287119,
|
1942 |
-
"f":0.
|
1943 |
},
|
1944 |
"Gender":{
|
1945 |
-
"p":0.
|
1946 |
-
"r":0.
|
1947 |
-
"f":0.
|
1948 |
},
|
1949 |
"Number":{
|
1950 |
-
"p":0.
|
1951 |
-
"r":0.
|
1952 |
-
"f":0.
|
1953 |
},
|
1954 |
"Definite":{
|
1955 |
-
"p":0.
|
1956 |
-
"r":0.
|
1957 |
-
"f":0.
|
1958 |
},
|
1959 |
"Degree":{
|
1960 |
-
"p":0.
|
1961 |
-
"r":0.
|
1962 |
-
"f":0.
|
1963 |
},
|
1964 |
"Polarity":{
|
1965 |
-
"p":0.
|
1966 |
-
"r":0.
|
1967 |
-
"f":0.
|
1968 |
},
|
1969 |
"Tense":{
|
1970 |
-
"p":0.
|
1971 |
-
"r":0.
|
1972 |
-
"f":0.
|
1973 |
},
|
1974 |
"VerbForm":{
|
1975 |
-
"p":0.
|
1976 |
-
"r":0.
|
1977 |
-
"f":0.
|
1978 |
},
|
1979 |
"Voice":{
|
1980 |
-
"p":0.
|
1981 |
-
"r":0.
|
1982 |
-
"f":0.
|
1983 |
},
|
1984 |
"PronType":{
|
1985 |
-
"p":0.
|
1986 |
-
"r":0.
|
1987 |
-
"f":0.
|
1988 |
},
|
1989 |
"Aspect":{
|
1990 |
-
"p":0.
|
1991 |
-
"r":0.
|
1992 |
-
"f":0.
|
1993 |
},
|
1994 |
"Hyph":{
|
1995 |
-
"p":0.
|
1996 |
-
"r":0.
|
1997 |
-
"f":0.
|
1998 |
},
|
1999 |
"Reflex":{
|
2000 |
-
"p":0.
|
2001 |
-
"r":0.
|
2002 |
-
"f":0.
|
2003 |
},
|
2004 |
"Mood":{
|
2005 |
-
"p":0.
|
2006 |
-
"r":0.
|
2007 |
-
"f":0.
|
2008 |
},
|
2009 |
"Person":{
|
2010 |
-
"p":0.
|
2011 |
-
"r":0.
|
2012 |
-
"f":0.
|
2013 |
},
|
2014 |
"AdpType":{
|
2015 |
-
"p":
|
2016 |
-
"r":0.
|
2017 |
-
"f":0.
|
2018 |
},
|
2019 |
"NumForm":{
|
2020 |
-
"p":0.
|
2021 |
"r":0.89,
|
2022 |
-
"f":0.
|
2023 |
-
},
|
2024 |
-
"NumType":{
|
2025 |
-
"p":0.8,
|
2026 |
-
"r":0.6428571429,
|
2027 |
-
"f":0.7128712871
|
2028 |
},
|
2029 |
"Abbr":{
|
2030 |
-
"p":0.
|
2031 |
"r":0.943231441,
|
2032 |
-
"f":0.
|
2033 |
},
|
2034 |
"Foreign":{
|
2035 |
-
"p":0.
|
2036 |
-
"r":0.
|
2037 |
-
"f":0.
|
|
|
|
|
|
|
|
|
|
|
2038 |
}
|
2039 |
},
|
2040 |
-
"tag_acc":0.
|
2041 |
-
"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2042 |
},
|
2043 |
"sources":[
|
2044 |
{
|
@@ -2053,12 +1892,6 @@
|
|
2053 |
"license":"commercial (licensed by Explosion)",
|
2054 |
"author":"TokenMill"
|
2055 |
},
|
2056 |
-
{
|
2057 |
-
"name":"Lemmatization Lists",
|
2058 |
-
"url":"https://github.com/michmech/lemmatization-lists/",
|
2059 |
-
"license":"ODbL",
|
2060 |
-
"author":"Michal M\u011bchura"
|
2061 |
-
},
|
2062 |
{
|
2063 |
"name":"Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)",
|
2064 |
"url":"https://spacy.io",
|
|
|
1 |
{
|
2 |
"lang":"lt",
|
3 |
"name":"core_news_md",
|
4 |
+
"version":"3.3.0",
|
5 |
+
"description":"Lithuanian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner.",
|
6 |
"author":"Explosion",
|
7 |
"email":"[email protected]",
|
8 |
"url":"https://explosion.ai",
|
9 |
"license":"CC BY-SA 4.0",
|
10 |
+
"spacy_version":">=3.3.0.dev0,<3.4.0",
|
11 |
+
"spacy_git_version":"849bef2de",
|
12 |
"vectors":{
|
13 |
"width":300,
|
14 |
"vectors":20000,
|
|
|
1686 |
"punct",
|
1687 |
"xcomp"
|
1688 |
],
|
|
|
|
|
|
|
|
|
1689 |
"attribute_ruler":[
|
1690 |
|
|
|
|
|
|
|
1691 |
],
|
1692 |
"ner":[
|
1693 |
"GPE",
|
|
|
1703 |
"morphologizer",
|
1704 |
"tagger",
|
1705 |
"parser",
|
|
|
1706 |
"lemmatizer",
|
1707 |
+
"attribute_ruler",
|
1708 |
"ner"
|
1709 |
],
|
1710 |
"components":[
|
|
|
1712 |
"morphologizer",
|
1713 |
"tagger",
|
1714 |
"parser",
|
1715 |
+
"lemmatizer",
|
1716 |
"senter",
|
1717 |
"attribute_ruler",
|
|
|
1718 |
"ner"
|
1719 |
],
|
1720 |
"disabled":[
|
|
|
1725 |
"token_p":0.998357254,
|
1726 |
"token_r":0.9988754325,
|
1727 |
"token_f":0.9986162761,
|
1728 |
+
"pos_acc":0.9468766223,
|
1729 |
+
"morph_acc":0.8705658418,
|
1730 |
+
"morph_micro_p":0.9154675098,
|
1731 |
+
"morph_micro_r":0.9045524101,
|
1732 |
+
"morph_micro_f":0.9099772297,
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1733 |
"morph_per_feat":{
|
1734 |
"Case":{
|
1735 |
+
"p":0.9271740917,
|
1736 |
"r":0.924287119,
|
1737 |
+
"f":0.9257283545
|
1738 |
},
|
1739 |
"Gender":{
|
1740 |
+
"p":0.9329511899,
|
1741 |
+
"r":0.9327891629,
|
1742 |
+
"f":0.9328701693
|
1743 |
},
|
1744 |
"Number":{
|
1745 |
+
"p":0.9187705818,
|
1746 |
+
"r":0.9158980772,
|
1747 |
+
"f":0.9173320808
|
1748 |
},
|
1749 |
"Definite":{
|
1750 |
+
"p":0.9205955335,
|
1751 |
+
"r":0.9013605442,
|
1752 |
+
"f":0.9108765038
|
1753 |
},
|
1754 |
"Degree":{
|
1755 |
+
"p":0.8616144975,
|
1756 |
+
"r":0.8723936614,
|
1757 |
+
"f":0.866970576
|
1758 |
},
|
1759 |
"Polarity":{
|
1760 |
+
"p":0.9304725693,
|
1761 |
+
"r":0.907309322,
|
1762 |
+
"f":0.9187449718
|
1763 |
},
|
1764 |
"Tense":{
|
1765 |
+
"p":0.8707224335,
|
1766 |
+
"r":0.8400586941,
|
1767 |
+
"f":0.855115758
|
1768 |
},
|
1769 |
"VerbForm":{
|
1770 |
+
"p":0.9033134166,
|
1771 |
+
"r":0.8808262712,
|
1772 |
+
"f":0.8919281309
|
1773 |
},
|
1774 |
"Voice":{
|
1775 |
+
"p":0.8136645963,
|
1776 |
+
"r":0.741509434,
|
1777 |
+
"f":0.7759131293
|
1778 |
},
|
1779 |
"PronType":{
|
1780 |
+
"p":0.9304635762,
|
1781 |
+
"r":0.9273927393,
|
1782 |
+
"f":0.9289256198
|
1783 |
},
|
1784 |
"Aspect":{
|
1785 |
+
"p":0.8342696629,
|
1786 |
+
"r":0.8181818182,
|
1787 |
+
"f":0.826147427
|
1788 |
},
|
1789 |
"Hyph":{
|
1790 |
+
"p":0.9037433155,
|
1791 |
+
"r":0.8989361702,
|
1792 |
+
"f":0.9013333333
|
1793 |
},
|
1794 |
"Reflex":{
|
1795 |
+
"p":0.7579908676,
|
1796 |
+
"r":0.6125461255,
|
1797 |
+
"f":0.6775510204
|
1798 |
},
|
1799 |
"Mood":{
|
1800 |
+
"p":0.9086479903,
|
1801 |
+
"r":0.8923444976,
|
1802 |
+
"f":0.9004224502
|
1803 |
},
|
1804 |
"Person":{
|
1805 |
+
"p":0.9114688129,
|
1806 |
+
"r":0.9041916168,
|
1807 |
+
"f":0.9078156313
|
1808 |
},
|
1809 |
"AdpType":{
|
1810 |
+
"p":0.9899749373,
|
1811 |
+
"r":0.9825870647,
|
1812 |
+
"f":0.986267166
|
1813 |
},
|
1814 |
"NumForm":{
|
1815 |
+
"p":0.9417989418,
|
1816 |
"r":0.89,
|
1817 |
+
"f":0.9151670951
|
|
|
|
|
|
|
|
|
|
|
1818 |
},
|
1819 |
"Abbr":{
|
1820 |
+
"p":0.9642857143,
|
1821 |
"r":0.943231441,
|
1822 |
+
"f":0.9536423841
|
1823 |
},
|
1824 |
"Foreign":{
|
1825 |
+
"p":0.6551724138,
|
1826 |
+
"r":0.59375,
|
1827 |
+
"f":0.6229508197
|
1828 |
+
},
|
1829 |
+
"NumType":{
|
1830 |
+
"p":0.7608695652,
|
1831 |
+
"r":0.625,
|
1832 |
+
"f":0.6862745098
|
1833 |
}
|
1834 |
},
|
1835 |
+
"tag_acc":0.8632116283,
|
1836 |
+
"sents_p":0.8447712418,
|
1837 |
+
"sents_r":0.8379254457,
|
1838 |
+
"sents_f":0.8413344182,
|
1839 |
+
"dep_uas":0.7353933769,
|
1840 |
+
"dep_las":0.6609365113,
|
1841 |
+
"dep_las_per_type":{
|
1842 |
+
|
1843 |
+
},
|
1844 |
+
"lemma_acc":0.8484193228,
|
1845 |
+
"ents_p":0.7557354926,
|
1846 |
+
"ents_r":0.8404202101,
|
1847 |
+
"ents_f":0.7958313595,
|
1848 |
+
"ents_per_type":{
|
1849 |
+
"ORG":{
|
1850 |
+
"p":0.6943866944,
|
1851 |
+
"r":0.7625570776,
|
1852 |
+
"f":0.7268770403
|
1853 |
+
},
|
1854 |
+
"TIME":{
|
1855 |
+
"p":0.7280334728,
|
1856 |
+
"r":0.7909090909,
|
1857 |
+
"f":0.7581699346
|
1858 |
+
},
|
1859 |
+
"LOC":{
|
1860 |
+
"p":0.7134502924,
|
1861 |
+
"r":0.7554179567,
|
1862 |
+
"f":0.7338345865
|
1863 |
+
},
|
1864 |
+
"PRODUCT":{
|
1865 |
+
"p":0.3829787234,
|
1866 |
+
"r":0.5714285714,
|
1867 |
+
"f":0.4585987261
|
1868 |
+
},
|
1869 |
+
"GPE":{
|
1870 |
+
"p":0.7651663405,
|
1871 |
+
"r":0.9654320988,
|
1872 |
+
"f":0.8537117904
|
1873 |
+
},
|
1874 |
+
"PERSON":{
|
1875 |
+
"p":0.9010791367,
|
1876 |
+
"r":0.9109090909,
|
1877 |
+
"f":0.9059674503
|
1878 |
+
}
|
1879 |
+
},
|
1880 |
+
"speed":9529.4689235955
|
1881 |
},
|
1882 |
"sources":[
|
1883 |
{
|
|
|
1892 |
"license":"commercial (licensed by Explosion)",
|
1893 |
"author":"TokenMill"
|
1894 |
},
|
|
|
|
|
|
|
|
|
|
|
|
|
1895 |
{
|
1896 |
"name":"Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)",
|
1897 |
"url":"https://spacy.io",
|
morphologizer/model
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b51e8b1f4f0844a989d343727c11d792b16a0bbb68e357a330ba8d9cdad42842
|
3 |
+
size 349642
|
ner/model
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f336f4edc351764a0df8a13db8878ac2d2a5fc948b12c5bacb4c45210ba6813d
|
3 |
+
size 6498672
|
parser/model
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 305088
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:630181c6a4f6e4fc3b77ae682f7d5ece185682175279b860ca5b43fc25a1a79c
|
3 |
size 305088
|
parser/moves
CHANGED
@@ -1 +1 @@
|
|
1 |
-
��moves�{"0":{"":
|
|
|
1 |
+
��moves�{"0":{"":28122},"1":{"":17180},"2":{"nmod":6082,"punct":4310,"amod":2596,"cc":2050,"nsubj":1695,"case":1462,"obl:arg":1419,"advmod":1419,"mark":942,"acl":918,"nummod":863,"obl":856,"advmod:emph":848,"det":647,"cop":430,"obj":344,"advcl":315,"parataxis":230,"csubj":205,"nsubj:pass":172,"obl:arg||xcomp":65,"obj||xcomp":43,"dep":0},"3":{"punct":4446,"conj":3626,"obl:arg":1705,"obj":1332,"obl":953,"nmod":946,"xcomp":698,"nsubj":518,"advcl":498,"acl":415,"ccomp":257,"acl:relcl":232,"advmod":231,"flat":189,"nsubj:pass":188,"parataxis":166,"appos":125,"nummod":123,"csubj":111,"cc":58,"dep":56,"amod":50,"advmod:emph":42},"4":{"ROOT":2341}}�cfg��neg_key�
|
senter/model
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:509f37b3a36e27bb00e79d9649f72dd8e885cab42104f198eed028ae3c5cc02f
|
3 |
+
size 219953
|
tagger/cfg
CHANGED
@@ -734,5 +734,6 @@
|
|
734 |
"\u012fv.\u012fvard\u017e.mot.vns.K.",
|
735 |
"\u012fv.\u012fvard\u017e.mot.vns.V."
|
736 |
],
|
|
|
737 |
"overwrite":false
|
738 |
}
|
|
|
734 |
"\u012fv.\u012fvard\u017e.mot.vns.K.",
|
735 |
"\u012fv.\u012fvard\u017e.mot.vns.V."
|
736 |
],
|
737 |
+
"neg_prefix":"!",
|
738 |
"overwrite":false
|
739 |
}
|
tagger/model
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9407ac1674f6a9dd5f68a53f01f2ffaa6af9866425e3e1a7b88ffa700a548db2
|
3 |
+
size 284846
|
tok2vec/model
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bfaeecc121a248602b8da3d26d430cf0f45aeb1483c7cecdc8c1c0f0c53c6aca
|
3 |
+
size 6365604
|
tokenizer
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
-
��prefix_search�~^§|^%|^=|^—|^–|^\+(?![0-9])|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�2&\.$|…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|'s$|'S$|’s$|’S$|—$|–$|(?<=[0-9])\+$|(?<=°[FfCcKk])\.$|(?<=[0-9])(?:\$|£|€|¥|฿|US\$|C\$|A\$|₽|﷼|₴|₠|₡|₢|₣|₤|₥|₦|₧|₨|₩|₪|₫|€|₭|₮|₯|₰|₱|₲|₳|₴|₵|₶|₷|₸|₹|₺|₻|₼|₽|₾|₿)$|(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)$|(?<=[0-9a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F%²\-\+…|……|,|:|;|\!|\?|¿|؟|¡|\(|\)|\[|\]|\{|\}|<|>|_|#|\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉)])\.$|(?<=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F][A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])\.$�infix_finditer�=�\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])(?:-|–|—|--|---|——|~)(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])�token_match��url_match�
|
2 |
��A�
|
3 |
-
� ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�?!��A�?!�@_@��A�@_@�C++��A�C++�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�n-tosios��A�n-tosios�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�v.v��A�v.v�v_v��A�v_v�xD��A�xD�xDD��A�xDD� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A
|
|
|
1 |
+
��prefix_search�~^§|^%|^=|^—|^–|^\+(?![0-9])|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�2}\.$|…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|'s$|'S$|’s$|’S$|—$|–$|(?<=[0-9])\+$|(?<=°[FfCcKk])\.$|(?<=[0-9])(?:\$|£|€|¥|฿|US\$|C\$|A\$|₽|﷼|₴|₠|₡|₢|₣|₤|₥|₦|₧|₨|₩|₪|₫|€|₭|₮|₯|₰|₱|₲|₳|₴|₵|₶|₷|₸|₹|₺|₻|₼|₽|₾|₿)$|(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)$|(?<=[0-9a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F%²\-\+…|……|,|:|;|\!|\?|¿|؟|¡|\(|\)|\[|\]|\{|\}|<|>|_|#|\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉)])\.$|(?<=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F][A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])\.$�infix_finditer�>�\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])(?:-|–|—|--|---|——|~)(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])�token_match��url_match�
|
2 |
��A�
|
3 |
+
� ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�?!��A�?!�@_@��A�@_@�C++��A�C++�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�n-tosios��A�n-tosios�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�v.v��A�v.v�v_v��A�v_v�xD��A�xD�xDD��A�xDD� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’�faster_heuristics�
|
vocab/strings.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5e329948ca0c4b9198ec7c5b0a082b7d3d2f3a3ebcc0339e7541779d6ca050fd
|
3 |
+
size 10829058
|