Temat: class-imbalanced - Prolib Integro

Skocz do pozycji: 1.

Tytuł:: Real-time equipment condition assessment for a class-imbalanced dataset based on heterogeneous ensemble learning
Ocena stanu sprzętu w czasie rzeczywistym dla zbiorów danych o niezrównoważonym rozkładzie w klasach. Metoda oparta na uczeniu zespołowym
Autorzy:: Chen, Xiaohui
Zhang, Zhiyao
Zhang, Ze
Tematy:: condition assessment
heterogeneous ensemble learning
genetic algorithm
class-imbalanced
ocena stanu
uczenie zespołowe
algorytm genetyczny
niezrównoważony rozkład w klasach; Pokaż więcej
Wydawca:: Polska Akademia Nauk. Polskie Naukowo-Techniczne Towarzystwo Eksploatacyjne PAN
Powiązania:: https://bibliotekanauki.pl/articles/300613.pdf Link otwiera się w nowym oknie
Opis:: This study proposes an ensemble learning model for the purpose of performing a real-time equipment condition assessment. This model makes it possible to plan desired preventive maintenance activities before an unexpected failure takes place. This study focuses on the class-imbalanced problem in equipment condition assessment research. In reality, equipment will experience multiple conditions(states), most of the time remaining in the normal condition and relatively rarely being in the critical condition, which means that, from the perspective of data modelling, the distribution of samples is highly imbalanced among different classes(conditions). The majority of samples belong to the normal condition, while the minority belong to the critical condition, which poses a great challenge to the classification performance. To address this problem, a genetic algorithm-based ensemble learning model is presented. Furthermore, a self-updating learning strategy is presented for online monitoring, contributing to adaptability and reliability enhancement along with time. Many previous studies have attempted feature extraction and to set thresholds for equipment health indicators. This study has an advantage of omitting these steps, as it can directly assess the equipment condition through the proposed ensemble learning model. Numerical experiments, including two types of comparison studies, have been conducted. The results show the greater effectiveness of our proposed model over that of previous research in terms of the stability and accuracy of its classification performance.
W pracy przedstawiono model uczenia maszynowego opartego na zespołach niejednorodnych klasyfikatorów (ensemble learning), który pozwala przeprowadzać ocenę stanu sprzętu w czasie rzeczywistym. Model ten umożliwia zaplanowanie niezbędnych czynności konserwacji profilaktycznej przed wystąpieniem niespodziewanego uszkodzenia. Tematem pracy jest zagadnienie niezrównoważonego rozkładu w klasach poruszane w badaniach dotyczących oceny stanu sprzętu. W warunkach rzeczywistych, sprzęt chrakteryzuje wiele różnych stanów, przy czym przez większość czasu pozostaje on w stanie normalnym, a relatywnie rzadko znajduje się w stanie krytycznym, co oznacza, że z punktu widzenia modelowania danych, rozkład prób w poszczególnych klasach (stanach) jest wysoce niezrównoważony. Większość prób należy do stanu normalnego, a mniejszość do stanu krytycznego, co stanowi duże wyzwanie jeśli chodzi o wydajność klasyfikacji. W celu rozwiązania tego problemu, przedstawiono model uczenia zespołowego oparty na algorytmie genetycznym. Ponadto zaprezentowano samoaktualizującą się strategię uczenia wykorzystywaną do monitorowania online, która wraz z upływem czasu zwiększa adaptacyjność i niezawodność modelu . W wielu poprzednich badaniach podejmowano próby ekstrakcji cech oraz ustalania progów dla wskaźników stanu sprzętu. Zaletą przedstawionej metody jest to, że pozwala ona pominąć te etapy i bezpośrednio oceniać stan sprzętu za pomocą proponowanego modelu uczenia zespołowego. Przeprowadzono eksperymenty numeryczne, w tym dwa rodzaje badań porównawczych. Wyniki pokazują większą skuteczność proponowanego modelu w stosunku do poprzednich badań pod względem stabilności i trafności klasyfikacji.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 2.

Tytuł:: The use of data mining models in solving the problem of imbalanced classes based on the example of an online marketing campaign
Wykorzystanie modeli data mining w rozwiązywaniu problemu niezrównoważonych klas na przykładzie kampanii marketingowych w Internecie
Autorzy:: Łapczyński, Mariusz
Surma, Jerzy
Tematy:: C&RT
Random Forest
imbalanced class problem
online social network
banner ad campaign; Pokaż więcej
Wydawca:: Wydawnictwo Uniwersytetu Ekonomicznego we Wrocławiu
Powiązania:: https://bibliotekanauki.pl/articles/424980.pdf Link otwiera się w nowym oknie
Opis:: While building predictive models in analytical CRM, researchers often encounter the problem of imbalanced classes (skewed distributions of dependent variables), which consists in the fact that the number of observations belonging to one category of the dependent variable is much lower than the number of observations belonging to the second category of that variable. This is related to such areas as churn analysis, customer acquisition models and cross and up-selling models. The purpose of the paper is to present a predictive model that was built to predict the response of Internet users to banner advertising. The dataset used in the study came from an online social network which offers advertisers banner campaigns targeting its users. The advertising campaign of a cosmetics company was carried out in the autumn of 2010 and was mainly targeted at young women. A user of this service was described by 115 independent variables – 3 out of which were demographic variables (sex, age, education), and the remaining 112 referred to the user’s online activity. While building the model there appeared the problem of imbalanced classes due to the low number of users who clicked on the banner ad. The number of cases amounted to 81,000, while the number of positive reactions to the banner was 207, which constitutes approximately 0.25% of the dependent variable. During the study, two popular data mining tools were utilized – the decision trees C&RT and Random Forest. The second goal of this paper is to compare the performance of the predictive models based on both these analytical tools.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 3.

Tytuł:: Analysis of the class balancing techniques in machine learning
Analiza metod równoważenia klas w uczeniu maszynowym
Autorzy:: Baran, Kamil
Opis:: W obliczu rewolucji technologicznej, dostępność i kompletność danych wydaje się oczywista. Większość systemów uczenia maszynowego są zaprojektowane z założeniem pracy z dużymi zbiorami zrównoważonych danych. Pomimo tego, jest wiele domen które są dotknięte naturalną rzadkością występujących zdarzeń lub niedostępnością danych. W takich domenach, jedna klasa, która zazwyczaj jest klasą zainteresowania, jest reprezentowana przez znacząco mniejszą liczbę instancji co wpływa negatywnie na proces uczenia i klasyfikacji.Celem tej pracy jest: - przedstawienie problemu niezrównoważenia klas i jego wpływu na proces uczenia i klasyfikacji, - omówienie popularnych technik równoważenia klas, - przedstawienie alternatywnych metod pomiaru jakości klasyfikacji, - przeprowadzenie eksperymentów wykorzystując do tego różne techniki równoważenia i zbiory danych. - - Eksperymenty maja na celu : a) zbadać wydajność technik równoważenia klas względem dwóch metryk: zysków klasyfikacji oraz wydajności czasowej, b) zbadać wpływ współczynnika zrównoważenia zbioru na wydajność technik równoważenia.
In the presence of technical revolution, availability and completeness of the data seem to be relatively straightforward. The majority of the machine learning systems are designed to work with a huge load of well-balanced data. In spite of the above-mentioned fact, many domains are still suffering from the natural rarity of events or unavailability of the data. In these domains, one class, which is usually the class of interest, is represented by a significantly small number of instances which affects the whole learning and classification process.The purpose of this thesis is : - to provide insights on the class imbalance problem and the effect it has on the learning and classification process - to provide an overview of the popular class balancing strategies - to present some of the alternative classification performance metrics - to conduct the experiments using various balancing strategies and datasets. Experiments have two major goals: (a) examine the class balancing techniques performance concerning two metrics: classification results improvements and time efficiency. (b) study the impact imbalance ratio has on the performance of class balancing strategies
Dostawca treści:: Repozytorium Uniwersytetu Jagiellońskiego

Inne

na półce

Skocz do pozycji: 4.

Tytuł:: Using information on class interrelations to improve classification of multiclass imbalanced data: A new resampling algorithm
Autorzy:: Janicka, Małgorzata
Lango, Mateusz
Stefanowski, Jerzy
Tematy:: imbalanced data
multi-class learning
re-sampling
data difficulty factor
similarity degree
dane niezrównoważone
ponowne próbkowanie
stopień podobieństwa; Pokaż więcej
Wydawca:: Uniwersytet Zielonogórski. Oficyna Wydawnicza
Powiązania:: https://bibliotekanauki.pl/articles/330287.pdf Link otwiera się w nowym oknie
Opis:: The relations between multiple imbalanced classes can be handled with a specialized approach which evaluates types of examples’ difficulty based on an analysis of the class distribution in the examples’ neighborhood, additionally exploiting information about the similarity of neighboring classes. In this paper, we demonstrate that such an approach can be implemented as a data preprocessing technique and that it can improve the performance of various classifiers on multiclass imbalanced datasets. It has led us to the introduction of a new resampling algorithm, called Similarity Oversampling and Undersampling Preprocessing (SOUP), which resamples examples according to their difficulty. Its experimental evaluation on real and artificial datasets has shown that it is competitive with the most popular decomposition ensembles and better than specialized preprocessing techniques for multi-imbalanced problems.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 5.

Tytuł:: Automated approach to classification of mine-like objects using multiple-aspect sonar images
Autorzy:: Wang, X.
Liu, X.
Japkowicz, N.
Matwin, S.
Tematy:: object
sea bed
multiple side-scan sonar
Dempster-Shafer theory
DS concept
classifier
imbalance
imbalanced problem
multi-instance class; Pokaż więcej
Wydawca:: Społeczna Akademia Nauk w Łodzi. Polskie Towarzystwo Sieci Neuronowych
Powiązania:: https://bibliotekanauki.pl/articles/91790.pdf Link otwiera się w nowym oknie
Opis:: In this paper, the detection of mines or other objects on the seabed from multiple side-scan sonar views is considered. Two frameworks are provided for this kind of classification. The first framework is based upon the Dempster–Shafer (DS) concept of fusion from a single-view kernel-based classifier and the second framework is based upon the concepts of multi-instance classifiers. Moreover, we consider the class imbalance problem which is always presents in sonar image recognition. Our experimental results show that both of the presented frameworks can be used in mine-like object classification and the presented methods for multi-instance class imbalanced problem are also effective in such classification.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Informacja

Wyszukujesz frazę "class-imbalanced" wg kryterium: Temat