relation: http://eprints.imtlucca.it/1527/ title: Clustering before training large datasets - Case study: K-SVD creator: Rusu, Cristian subject: QA75 Electronic computers. Computer science description: Training and using overcomplete dictionaries has been the subject of many developments in the area of signal processing and sparse representations. The main idea is to train a dictionary that is able to achieve good sparse representations of the items contained in a given dataset. The most popular approach is the K-SVD algorithm and in this paper we study its application to large datasets. The main interest is to speedup the training procedure while keeping the representation errors close to some specific values. This goal is reached by using a clustering procedure, called here T-mindot, which reduces the size of the dataset but keeps the most representative data items and a measure of their importance. Experimental simulations compare the running times and representation errors of the training method with and without the clustering procedure and they clearly show how effective T-mindot is. publisher: IEEE date: 2012 type: Book Section type: PeerReviewed format: application/pdf language: en identifier: http://eprints.imtlucca.it/1527/1/CLUSTERING%20BEFORE%20TRAINING%20LARGE%20DATASETS%20-%20CASE%20STUDY%20K-SVD.pdf identifier: Rusu, Cristian Clustering before training large datasets - Case study: K-SVD. In: Proceedings of the 20th European Signal Processing Conference (EUSIPCO), 2012. IEEE, pp. 2188-2192. ISBN 978-1-4673-1068-0 (2012) relation: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6333907&isnumber=6333779