Matrix similarity analysis of texts written in Romanian and Spanish
Authors:
- Artur Niewiarowski,
- Anna Plichta
Abstract
This publication presents the results of a study of similarity between texts written in Romanian and Spanish, using a matrix analysis method based on Levenshtein’s edit distance. The method used in the study does not contain implemented language-dependent vocabulary rules and exhibits the feature of linguistic universality in terms of similarity analysis. The study was carried out on the basis of the commercial computer program Antyplagius, created by the New Data Mining Systems company, which performs similarity analysis exclusively using the aforementioned method. The texts being compared were taken from excerpts from Wikipedia translated by online translators of popular companies which are based on artificial intelligence solutions.
- Record ID
- CUT43d6da84b46c4d2f8487378b1c5eb351
- Publication categories
- ; ;
- Author
- Pages
- 507-512
- Other elements of collation
- tab.; wykr.; Bibliografia (na s.) - 511-512; Bibliografia (liczba pozycji) - 25; Oznaczenie streszczenia - Abstr.
- Substantive notes
- Wydaw. wg cop.
- Miejsce wyd. wg siedziby wydaw.
- Punktacja MNiSW/MEiN (rozdział) - 5
- Book
- Vicario Enrico, Enrico Vicario Bandinelli Romeo, Romeo Bandinelli Fani Virginia Virginia Fani [et al.] (eds.): ECMS 2023 : proceedings of the 37th ECMS International Conference on Modelling and Simulation, June 20th – June 23rd, 2023 Florence, Italy, European Conference for Modelling and Simulation, no. Vol. 37, Iss. 1, 2023, Caserta, ECMS, ISBN 978-3-937436-80-7 (Print)
- Keywords in English
- text-mining, anti-plagiarism, text similarity analysis, Levenshtein’s edit distance, matrix analysis of texts, romanian language, spanish language, romance language group
- URL
- https://www.scs-europe.net/dlib/dl-index.htm Opening in a new tab
- Language
- eng (en) English
- License
- Score (nominal)
- 5
- Score source
- publisherList
- Score
- Additional fields
- Indeksowana w: CORE
- Uniform Resource Identifier
- https://cris.pk.edu.pl/info/article/CUT43d6da84b46c4d2f8487378b1c5eb351/
- URN
urn:pkr-prod:CUT43d6da84b46c4d2f8487378b1c5eb351
* presented citation count is obtained through Internet information analysis, and it is close to the number calculated by the Publish or PerishOpening in a new tab system.