Temat: summarization - Prolib Integro

Skocz do pozycji: 1.

Tytuł:: Survey of Scientific Document Summarization Techniques
Autorzy:: Kurian, Sheena K.
Mathew, Sheena
Tematy:: document summarization
abstractive summarization
extractive summarization; Pokaż więcej
Data publikacji:: 2020
Powiązania:: https://bibliotekanauki.pl/articles/1839285.pdf Link otwiera się w nowym oknie
Źródło:: Computer Science; 2020, 21 (2); 125-161
1508-2806
2300-7036
Pojawia się w:: Computer Science
Opis:: The number of scientic or research papers published every year is growing at an exponential rate, which has led to an intensive research in scientic document summarization. The different methods commonly used in automatic text summarization are discussed in this paper with their pros and cons. Commonly used evaluation techniques and datasets in this field are also discussed. Rouge and Pyramid scores of the different methods are tabulated for easy comparison of the results.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 2.

Tytuł:: Automatic extractive and generic document summarization based on NMF
Autorzy:: Aghdam, Mehdi Hosseinzadeh
Tematy:: text summarization
latent topics
non-negative matrix factorization
updating rules; Pokaż więcej
Data publikacji:: 2023
Powiązania:: https://bibliotekanauki.pl/articles/2201319.pdf Link otwiera się w nowym oknie
Źródło:: Journal of Artificial Intelligence and Soft Computing Research; 2023, 13, 1; 37--49
2083-2567
2449-6499
Pojawia się w:: Journal of Artificial Intelligence and Soft Computing Research
Opis:: Nowadays, textual information grows exponentially on the Internet. Text summarization (TS) plays a crucial role in the massive amount of textual content. Manual TS is time-consuming and impractical in some applications with a huge amount of textual information. Automatic text summarization (ATS) is an essential technology to overcome mentioned challenges. Non-negative matrix factorization (NMF) is a useful tool for extracting semantic contents from textual data. Existing NMF approaches only focus on how factorized matrices should be modeled, and neglect the relationships among sentences. These relationships provide better factorization for TS. This paper suggests a novel non-negative matrix factorization for text summarization (NMFTS). The proposed ATS model puts regularizes on pairwise sentences vectors. A new cost function based on the Frobenius norm is designed, and an algorithm is developed to minimize this function by proposing iterative updating rules. The proposed NMFTS extracts semantic content by reducing the size of documents and mapping the same sentences closely together in the latent topic space. Compared with the basic NMF, the convergence time of the proposed method does not grow. The convergence proof of the NMFTS and empirical results on the benchmark data sets show that the suggested updating rules converge fast and achieve superior results compared to other methods.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 3.

Tytuł:: PKE: a novel Polish keywords extraction method
PKE: nowatorska metoda ekstrakcji słów kluczowych dla języka polskiego
Autorzy:: Kozłowski, M.
Tematy:: information retrieval
keyword extraction
summarization
pozyskiwanie informacji
ekstrakcja słów kluczowych
automatyczne streszczanie; Pokaż więcej
Data publikacji:: 2014
Powiązania:: https://bibliotekanauki.pl/articles/152445.pdf Link otwiera się w nowym oknie
Źródło:: Pomiary Automatyka Kontrola; 2014, R. 60, nr 5, 5; 305-308
0032-4140
Pojawia się w:: Pomiary Automatyka Kontrola
Opis:: In the paper a novel summarization approach, called the Polish Keywords Extractor (PKE), is presented. It is the single document oriented method that is capable of extracting keywords from Polish documents. PKE is a knowledge-poor method (not using any external knowledge resources as Wikipedia) inspired by RAKE and KEA. In comparison with the previous methods PKE uses Polish lemmatizer, Part-of-Speech filters, and various evaluation approaches (statistical measures, classifiers). This algorithm was tested on a set of abstracts of Polish academic papers. The experiments have shown that PKE achieves better quality measures (precision, recall, F-measure) than RAKE and KEA.
Automatyczne streszczanie tekstów dotyczy redukcji całych dokumentów lub korpusów dokumentów do postaci reprezentatywnego zbioru słów, lub akapitu. Jedną z popularniejszych metod streszczania jest ekstrakcja słów kluczowych, której celem jest identyfikacja pojedynczych słów lub fraz etykietujących zadany dokument. Metody ekstrakcji słów kluczowych mogą być podzielone na zorientowane na pojedyncze dokumentu lub na korpusy. Dodatkowo metody ekstrakcji mogą być klasyfikowane według stosowanych podejść: lingwistyczne podejście, statystyczne lub oparte na uczeniu maszynowym. W tym artykule jest zaprezentowane nowe podejście do ekstrakcji słów kluczowych, nazwane PKE, które jest zorientowane na pojedyncze polsko języczne dokumenty. PKE jest metodą nie wykorzystującą zewnętrznych zasobów wiedzy jak np. Wikipedia. Metoda została zainspirowana metodami KEA [7] i RAKE [8]. RAKE jest algorytmem bez nadzoru, niezależnym od dziedziny i języka, który pozyskuje słowa kluczowe z pojedynczych dokumentów. KEA natomiast jest metodą z nadzorem, która wykorzystuje modele bayesowskie w celu obliczenia prawdopodobieństwa bycia słowem kluczowym. W porównaniu do powyższych metod, PKE używa Polskiego lematyzatora, filtrów części mowy, oraz różnorodnych metod ewaluacji (statystycznych miar, klasyfikatorów). Proponowany algorytm został przetestowany na zbiorze polskich abstraktów artykułów. Automatycznie proponowane słowa kluczowe zostały zweryfikowane względem słów wybranych przez autorów prac. Eksperymenty (tabela 1 i 2) pokazały, że PKE osiąga lepsze miary jakości (precyzja, kompletność, F1) niż RAKE i KEA.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 4.

Tytuł:: Automatic Music Summarization.A "Thumbnail" Approach
Autorzy:: Głaczyński, J.
Łukasik, E.
Tematy:: automatic music summarization
music thumbnail
key phrases
automatic music transcription
music information retrieval; Pokaż więcej
Data publikacji:: 2011
Powiązania:: https://bibliotekanauki.pl/articles/177412.pdf Link otwiera się w nowym oknie
Źródło:: Archives of Acoustics; 2011, 36, 2; 297-309
0137-5075
Pojawia się w:: Archives of Acoustics
Opis:: In the paper, various approaches to automatic music audio summarization are discussed. The project described in detail, is the realization of a method for extracting a music thumbnail – a fragment of continuous music of a given duration time that is most similar to the entire music piece. The results of subjective assessment of the thumbnail choice are presented, where four parameters have been taken into account: clarity (representation of the essence of the piece of music), conciseness (the motifs are not repeated in the summary), coherence of music structure, and overall quality of summary usefulness.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 5.

Tytuł:: Automatyczne tworzenie podsumowań tekstów metodami algebraicznymi
Automatic text summarization using algebraic approach
Autorzy:: Gramacki, J.
Gramacki, A.
Tematy:: automatyczne podsumowywanie
ukryta semantyka dokumentów
przekształcenie SVD
generic text summarization
sentence extraction
latent semantic analysis
singular value decomposition; Pokaż więcej
Data publikacji:: 2011
Powiązania:: https://bibliotekanauki.pl/articles/156932.pdf Link otwiera się w nowym oknie
Źródło:: Pomiary Automatyka Kontrola; 2011, R. 57, nr 7, 7; 751-755
0032-4140
Pojawia się w:: Pomiary Automatyka Kontrola
Opis:: Duża liczba zwracanych (na przykład przez różnego rodzaju wyszukiwarki internetowe) dokumentów oznacza, że często zmuszeni jesteśmy do czasochłonnego ich przeglądania, celem weryfikacji trafności zwracanych wyników. Gdy dokumenty są długie, czas ich przeglądania znacznie się wydłuża. Można by go wydatnie skrócić, gdyby istniała możliwość automatycznego generowania sensownych podsumowań (streszczeń). W artykule omawiamy wybrane algebraiczne metody służące automatycznemu wydobywaniu z tekstu jego najistotniejszych słów kluczowych oraz najistotniejszych zdań.
Text summarization is a real practical problem due to explosion of the volume of textual information available nowadays. In order to solve this problem, text summarization systems which extract brief information from a given text are created. The end user, by looking only at the summary, may decide whether the document is or is not of interest to him/her. Built summaries can have 2 fundamental forms. Firstly, extractive summarization may collect important sentences from the input text to constitute the summary. Secondly, abstractive summarization tries to capture main concepts of the text and then some new sentences, summarizing the input text, are generated. Nowadays, however, it seems that the latter approach still needs extensive works to be really useful. A summary can be extracted from a single document or multiple documents. In the paper the authors build summaries of one document only. The extension into multi-document summaries is the straightforward task in the case when a set of semantically uniform texts is summarized. Summaries may also be categorized as generic and query-based summaries. In the first case, there are generated summaries con-taining main topics of a document. In the second case, summaries contain the sentences that are related to the given queries. In the paper there are built generic summaries. Summarization systems use different approaches to determine important sentences. Here there is used semantic oriented approach based on a method known as Latent Semantic Analysis (LSA). LSA is an algebraic method that extracts meaning of words and similarity of sentences using the information about usage of the words in the context. It uses Singular Value Decomposition (SVD) for finding semantically similar words and sentences. Using the results of SVD the authors try to select best sentences (which constitute the best summary of the text). The paper is organized as follows. In Section 2 there is formulated the problem. In Section 3 there is shown how a docu-ment may be represented in a useful algebraic format. The so called Term-Sentence matrix (TSM) is used. The authors also point at some preliminary tasks necessary to be performed for successful further analysis. In Subsection 3.2 there is shortly presented an idea of LSA as based on SVD decomposition. In the last section 4 two examples of text summarizations build for both Polish and English texts are given. The two methods used differ slightly from each other. The authors' extracting key words and key sentences seems to be proper content-related summaries of the input texts.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 6.

Tytuł:: Customer product review summarization over time for competitive intelligence
Autorzy:: Amarouche, Kamal
Benbrahim, Houda
Kassou, Ismail
Tematy:: feature extraction
fuzzy logic
competitive intelligence
opinion mining
opinion summarization
sentiment analysis
SentiWordNet
ekstrakcja cech
logika rozmyta
wywiad konkurencyjny
eksploracja opinii
podsumowanie opinii
analiza nastrojów; Pokaż więcej
Data publikacji:: 2018
Powiązania:: https://bibliotekanauki.pl/articles/950925.pdf Link otwiera się w nowym oknie
Źródło:: Journal of Automation Mobile Robotics and Intelligent Systems; 2018, 12, 4; 70-82
1897-8649
2080-2145
Pojawia się w:: Journal of Automation Mobile Robotics and Intelligent Systems
Opis:: Nowadays, Customer’s product reviews can be widely found on the Web, be it in personal blogs, forums, or ecommerce websites. They contain important products’ information and therefore became a new data source for competitive intelligence. On that account, these reviews need to be analyzed and summarized in order to help the leader of an entity (company, brand, etc.) to make appropriate decisions in an efective way. However, most previous review summarization studies focus on summarizing sentiment distribution toward different product features without taking into account that the real advantages and disadvantages of a product clarify over time. For this reason, in this work we aim to propose a new system for product opinion summarization which depends on the time when reviews are expressed and that covers the sentiments change about product features. The proposed system firstly, generates a summary based on product features in order to give more accurate and efficient information about different features. secondly, classify the product based on its features in its appropriate class (good, medium or bad product) using a fuzzy logic system. The experimental results demonstrate the effectiveness of the proposed system to generate the real image of a product and its features in reviews.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 7.

Tytuł:: Narzędzia do automatycznego streszczania tekstów w języku polskim. Stan badań naukowych i prac wdrożeniowych
Tools for automatic summarization of texts in Polish. State of the research and implementation workse
Autorzy:: Glenc, Piotr
Tematy:: text summarization
Natural Language Processing
text documents
Polish language processing
automation of knowledge acquisition
streszczanie tekstów
przetwarzanie języka naturalnego
dokumenty tekstowe
przetwarzanie języka polskiego
automatyzacja pozyskiwania wiedzy; Pokaż więcej
Data publikacji:: 2021
Powiązania:: https://bibliotekanauki.pl/articles/1191628.pdf Link otwiera się w nowym oknie
Źródło:: e-mentor. Czasopismo naukowe Szkoły Głównej Handlowej w Warszawie; 2021, 89, 2; 67-77
1731-6758
1731-7428
Pojawia się w:: e-mentor. Czasopismo naukowe Szkoły Głównej Handlowej w Warszawie
Opis:: The goal of the publication is to present the state of research and works carried out in Poland on the issue of automatic text summarization. The author describes principal theoretical and methodological issues related to automatic summary generation followed by the outline of the selected works on the automatic abstracting of Polish texts. The author also provides three examples of IT tools that generate summaries of texts in Polish (Summarize, Resoomer, and NICOLAS) and their characteristics derived from the conducted experiment, which included quality assessment of generated summaries using ROUGE-N metrics. The results of both actions showed a deficiency of tools allowing to automatically create summaries of Polish texts, especially in the abstractive approach. Most of the proposed solutions are based on the extractive method, which uses parts of the original text to create its abstract. There is also a shortage of tools generating one common summary of many text documents and specialized tools generating summaries of documents related to specific subject areas. Moreover, it is necessary to intensify works on creating the corpora of Polish-language text summaries, which the computer scientists could apply to evaluate their newly developed tools.
Celem publikacji jest przedstawienie stanu badań i prac prowadzonych w Polsce nad zagadnieniem automatycznego streszczania tekstów. Przedstawiono podstawowe zagadnienia teoretyczne i metodologiczne związane z automatycznym generowaniem streszczeń dokumentów tekstowych. Na tle tych rozważań dokonano opisu wybranych prac dotyczących automatycznego generowania streszczeń tekstów polskojęzycznych. Zaprezentowano również przykłady narzędzi informatycznych generujących streszczenia tekstów w języku polskim (Summarize, Resoomer, NICOLAS) oraz dokonano oceny jakości generowanych streszczeń z wykorzystaniem miar ROUGE-N. Wyniki badań literaturowych i przeprowadzonego eksperymentu wskazały na niedobór narzędzi pozwalających na generowanie streszczeń tekstów polskojęzycznych, zwłaszcza w podejściu abstraktowym. Większość zaproponowanych rozwiązań pozwala na generowanie streszczeń w podejściu ekstrakcyjnym, polegającym na wykorzystywaniu w tworzonym streszczeniu fragmentów oryginalnego tekstu. Widoczny jest również niedobór narzędzi pozwalających na wygenerowanie jednego streszczenia wielu tekstów oraz narzędzi wyspecjalizowanych, pozwalających na generowanie streszczeń tekstów dotyczących konkretnych obszarów tematycznych. Ponadto konieczne jest zintensyfikowanie prac w obszarze tworzenia korpusów streszczeń polskojęzycznych tekstów, które będą mogły być wykorzystane do ewaluacji nowo tworzonych narzędzi.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 8.

Tytuł:: Analysis of methods and means of text mining
Autorzy:: Rybchak, Z.
Basystiuk, O.
Tematy:: text mining
text analytics
data analysis
high-quality information
text categorization
text clustering
document summarization
sentiment analysis
sieć językowa
analiza tekstu
analiza danych
wysoka jakość informacji
klasyfikacja tekstowa
kategoryzacja tekstowa
grupowanie tekstu
streszczenie dokumentów tekstowych
technika sentiment analysis; Pokaż więcej
Data publikacji:: 2017
Powiązania:: https://bibliotekanauki.pl/articles/411072.pdf Link otwiera się w nowym oknie
Źródło:: ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes; 2017, 6, 2; 73-78
2084-5715
Pojawia się w:: ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes
Opis:: In Big Data era when data volume doubled every year analyzing of all this data become really complicated task, so in this case text mining systems, techniques and tools become main instrument of analyzing tones and tones of information, selecting that information that suit the best for your needs and just help save your time for more interesting thing. The main aims of this article are explain basic principles of this field and overview some interesting technologies that nowadays are widely used in text mining.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Informacja

Wyszukujesz frazę "summarization" wg kryterium: Temat