Temat: K-Means - Prolib Integro

Skocz do pozycji: 1.

Tytuł:: Inicjalizacja segmentacji k-means uwzględniająca rozkład gęstości pikseli
Autorzy:: Świta, R.
Suszyński, Z.
Tematy:: FA
KKZ
k-means
kmeans++
segmentacja
k-means ++
segmentation
High Density; Pokaż więcej
Wydawca:: Politechnika Koszalińska. Wydawnictwo Uczelniane
Powiązania:: https://bibliotekanauki.pl/articles/118366.pdf Link otwiera się w nowym oknie
Opis:: Artykuł przedstawia modyfikację inicjalizacji KKZ algorytmu k-means, uwzględniającą, oprócz wzajemnych odległości środków segmentów, również rozkład gęstości pikseli. Funkcja gęstości piksela jest sumą odwrotności odległości piksela od pozostałych i jest poddawana oszacowaniu na podstawie odległości piksela od wartości średniej i wariancji wartości pikseli. W eksperymentach segmentacji podlegały cztery różne sekwencje obrazów termicznych uzyskanych metodą termografii aktywnej. Pomimo dodatkowych obliczeń podczas inicjalizacji, metoda wykazała szybszą zbieżność algorytmu z czasami bardzo podobnymi do inicjalizacji KKZ, ale mniejszym błędem końcowym segmentacji.
This article presents a modification for the KKZ initialization of the k-means segmentation algorithm, which, in addition to the mutual distance of segments, takes into account the density of pixels. Pixel density is expressed asa sum of the inverse of the pixel’s distance to the other pixels and is subjected to estimation based on the distance from the mean and variance of the pixel values. In the experiments, four different sequences of thermal images were used, obtained using active thermography. Despite the additional calculations during initialization, method showed a faster convergence of the algorithm, with processing times very similar to the KKZ initialization, but with a lower final segmentation error.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 2.

Tytuł:: DSMK-means “density-based split-and-Merge K-means clustering algorithm
Autorzy:: Aldahdooh, R. T.
Ashour, W.
Tematy:: clustering
K-means
Density-based Split
Merge K-means clustering Algorithm
DSMK-means
clustering algorithm; Pokaż więcej
Wydawca:: Społeczna Akademia Nauk w Łodzi. Polskie Towarzystwo Sieci Neuronowych
Powiązania:: https://bibliotekanauki.pl/articles/91719.pdf Link otwiera się w nowym oknie
Opis:: Clustering is widely used to explore and understand large collections of data. K-means clustering method is one of the most popular approaches due to its ease of use and simplicity to implement. This paper introduces Density-based Split- and -Merge K-means clustering Algorithm (DSMK-means), which is developed to address stability problems of standard K-means clustering algorithm, and to improve the performance of clustering when dealing with datasets that contain clusters with different complex shapes and noise or outliers. Based on a set of many experiments, this paper concluded that developed algorithms “DSMK-means” are more capable of finding high accuracy results compared with other algorithms especially as they can process datasets containing clusters with different shapes, densities, or those with outliers and noise.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 3.

Tytuł:: Segmentacja sekwencji obrazów metodą korelacyjną
Segmentation of the image sequence using the correlation method
Autorzy:: Świta, R.
Suszyński, Z.
Tematy:: segmentacja
obrazy termiczne
korelacja
K-means
FCM
segmentation
thermal images
correlation
k-means; Pokaż więcej
Wydawca:: Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Powiązania:: https://bibliotekanauki.pl/articles/152568.pdf Link otwiera się w nowym oknie
Opis:: Artykuł przedstawia nową metodę segmentacji sekwencji obrazów termicznych wyodrębniającą obszary o różnych właściwościach cieplnych. Metoda oparta jest na korelacji położenia i kształtu segmentów w poszczególnych kadrach sekwencji. Segmentacja pozwala zmniejszyć liczbę analizowanych obszarów do kilku tysięcy razy, co stwarza realne możliwości praktycznego wykorzystania tomografii termicznej. Opisana metoda jest porównana z algorytmami klasteryzacji K-Means i FCM. Zaletą algorytmu korelacyjnego jest automatyczne wyznaczanie liczby segmentów wyjściowych.
This paper presents a new method for segmentation of thermal image sequences. Its aim is to divide the sequence into segments with different thermal properties. The described algorithm is based on measurements of the position and shape correlation of the segments in successive frames of the sequence. It is composed of several stages. The first stage consists of segmenting consecutive frames of the sequence (Fig. 2). The second step is analysis of the similarity of each segment in each frame with respect to all other segments of all frames and synthesis of the intermediate segments (Fig. 4). The intermediate segments form the segmented output image using the depth buffer technique to resolve multiple pixel-to-segment assignments (Fig. 6). This method is a basis for the thermal analysis of solids, which results in discovering depth profiles of thermal properties for each area. The segmentation reduces the number of the analyzed areas down to a few thousand times, which creates real opportunities for practical application of thermal tomography. The new algorithm has been compared with the K means algorithm [2], and FCM [6], which minimizes the sum of pixel value deviations from the centers of the segments they are assigned to, for all frames of the sequence (Tab. 1). The advantage of the correlation method is automatic determination of the number of output segments in the image and maintaining the constant segmentation error when increasing the number of the processed frames.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 4.

Tytuł:: Clustering of data represented by pairwise comparisons
Autorzy:: Dvoenko, Sergey
Tematy:: clustering
k-means
distance
similarity; Pokaż więcej
Wydawca:: Polska Akademia Nauk. Instytut Badań Systemowych PAN
Powiązania:: https://bibliotekanauki.pl/articles/2183479.pdf Link otwiera się w nowym oknie
Opis:: In this paper, experimental data, given in the form of pairwise comparisons, such as distances or similarities, are considered. Clustering algorithms for processing such data are developed based on the well-known k-means procedure. Relations to factor analysis are shown. The problems of improving clustering quality and of finding the proper number of clusters in the case of pairwise comparisons are considered. Illustrative examples are provided.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 5.

Tytuł:: K-means is probabilistically poor
Autorzy:: Kłopotek, Mieczysław
Tematy:: k-means
clustering
probabilistic k-richness; Pokaż więcej
Wydawca:: Uniwersytet Przyrodniczo-Humanistyczny w Siedlcach
Powiązania:: https://bibliotekanauki.pl/articles/2201613.pdf Link otwiera się w nowym oknie
Opis:: Kleinberg introduced the concept of k-richness as a requirement for an algorithm to be a clustering algorithm. The most popular algorithm k means dos not fit this definition because of its probabilistic nature. Hence Ackerman et al. proposed the notion of probabilistic k-richness claiming without proof that k-means has this property. It is proven in this paper, by example, that the version of k-means with random initialization does not have the property probabilistic k-richness, just rebuking Ackeman's claim.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 6.

Tytuł:: Przegląd technik grupowania danych i obszary zastosowań
Autorzy:: Sala, Karolina
Tematy:: cluster analysis
hierarchical clustering
k-means; Pokaż więcej
Wydawca:: Instytut Studiów Międzynarodowych i Edukacji Humanum
Powiązania:: https://bibliotekanauki.pl/articles/2157869.pdf Link otwiera się w nowym oknie
Opis:: The paper presents an overview of various clustering techniques used in data mining. Clustering is an unsupervised learning problem that is used to identify groups in a set of unlabeled data. Data is grouped by probability so that objects of the same group / cluster have similar properties / characteristics [1]. This article aims at exploring and comparing different clustering algorithms. Grouping is used in many areas, including machine learning, pattern recognition, image analysis, information retrieval.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 7.

Tytuł:: Analiza algorytmów budowania coresetów dla problemu k-means.
Analysis of coreset building algorithms for the k-means problem.
Autorzy:: Helm, Piotr
Opis:: W pracy przedstawimy stan wiedzy na temat budowania coresetów w kontekście problemu k-means. W szczególności omówimy techniki konstrukcji coresetów takie jak geometryczna dekompozycja oraz losowe próbkowanie. Celem pracy jest przedstawienie wyników teoretycznych oraz implementacja technik budowania coresetów.
In the paper we will present the state of knowledge on building coresets in the context of the k-means problem. In particular, we will discuss coreset construction techniques such as geometric decomposition and random sampling. The aim of the work is to present theoretical results and to implement introduced techniques of building coresets.
Dostawca treści:: Repozytorium Uniwersytetu Jagiellońskiego

Inne

na półce

Skocz do pozycji: 8.

Tytuł:: Geodesic distances for clustering linked text data
Autorzy:: Tekir, S.
Mansmann, F.
Keimer, D.
Tematy:: clustering
geodesic distance
text data
k-means algorithm
cosine distance
k-harmonic means
microprecision values; Pokaż więcej
Wydawca:: Społeczna Akademia Nauk w Łodzi. Polskie Towarzystwo Sieci Neuronowych
Powiązania:: https://bibliotekanauki.pl/articles/91737.pdf Link otwiera się w nowym oknie
Opis:: The quality of a clustering not only depends on the chosen algorithm and its parameters, but also on the definition of the similarity of two respective objects in a dataset. Applications such as clustering of web documents is traditionally built either on textual similarity measures or on link information. Due to the incompatibility of these two information spaces, combining these two information sources in one distance measure is a challenging issue. In this paper, we thus propose a geodesic distance function that combines traditional similarity measures with link information. In particular, we test the effectiveness of geodesic distances as similarity measures under the space assumption of spherical geometry in a 0-sphere. Our proposed distance measure is thus a combination of the cosine distance of the term-document matrix and some curvature values in the geodesic distance formula. To estimate these curvature values, we calculate clustering coefficient values for every document from the link graph of the data set and increase their distinctiveness by means of a heuristic as these clustering coefficient values are rough estimates of the curvatures. To evaluate our work, we perform clustering tests with the k-means algorithm on a subset of the EnglishWikipedia hyperlinked data set with both traditional cosine distance and our proposed geodesic distance. Additionally, taking inspiration from the unified view of the performance functions of k-means and k-harmonic means, min and harmonic average of the cosine and geodesic distances are taken in order to construct alternate distance forms. The effectiveness of our approach is measured by computing microprecision values of the clusters based on the provided categorical information of each article.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 9.

Tytuł:: K-means and its generalizations
Autorzy:: Chaikouskaya, Alena
Opis:: Klastrowanie danych jest bardzo ważnym zagadnieniem. Ilość danych do przetwarzania jest coraz większa. K-means jest znanym algorytmem klastrowania danych, który szuka klastrów minimalizując odległość wewnętrzną między nimi. Algorytmy Lloyda i Hartigana są znanymi heurystycznymi metodami rozwiązującymi problem k-means. Jednak jak wszystkie algorytmy heurystyczne nie działają dobrze w każdej sytuacji.Celem pracy jest przedstawienie problemu k-means, wyjaśnienie i implementacja rozwiązania heurystycznego biorącego pod uwagę kształt danych oraz porównanie analizowanej metody ze znanymi podejściami. Rozpatrywana metoda nie tylko rozwiązuje problem k-means, ponadto znajduje metrykę, która minimalizuje pole klastrów.
Clustering is a very important topic nowadays. The amount of data which need to be processed gets bigger and bigger every day. K-means is a well-known method of data clustering which defines clusters by minimizing inner-cluster distance. Lloyd and Hartigan algorithms are known heuristic methods of solving k-means problem. But as all heuristic methods they do not work well in every situation.The objective of the work is to make a general overview of k-means, explain and implement a heuristic solution which takes into consideration the form of data and compare the studied method to known approaches. The studied method is meant not just to solve k-means problem, but to find a metric which minimizes the area of clusters.
Dostawca treści:: Repozytorium Uniwersytetu Jagiellońskiego

Inne

na półce

Skocz do pozycji: 10.

Tytuł:: The number of clusters in hybrid predictive models: does it really matter?
Autorzy:: Łapczyński, Mariusz
Jefmański, Bartłomiej
Tematy:: hybrid predictive model
k-means algorithm
decision trees; Pokaż więcej
Wydawca:: Główny Urząd Statystyczny
Powiązania:: https://bibliotekanauki.pl/articles/1046637.pdf Link otwiera się w nowym oknie
Opis:: For quite a long time, research studies have attempted to combine various analytical tools to build predictive models. It is possible to combine tools of the same type (ensemble models, committees) or tools of different types (hybrid models). Hybrid models are used in such areas as customer relationship management (CRM), web usage mining, medical sciences, petroleum geology and anomaly detection in computer networks. Our hybrid model was created as a sequential combination of a cluster analysis and decision trees. In the first step of the procedure, objects were grouped into clusters using the k-means algorithm. The second step involved building a decision tree model with a new independent variable that indicated which cluster the objects belonged to. The analysis was based on 14 data sets collected from publicly accessible repositories. The performance of the models was assessed with the use of measures derived from the confusion matrix, including the accuracy, precision, recall, F-measure, and the lift in the first and second decile. We tried to find a relationship between the number of clusters and the quality of hybrid predictive models. According to our knowledge, similar studies have not been conducted yet. Our research demonstrates that in some cases building hybrid models can improve the performance of predictive models. It turned out that the models with the highest performance measures require building a relatively large number of clusters (from 9 to 15).
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Informacja

Wyszukujesz frazę "K-Means" wg kryterium: Temat