Abstract
This chapter describes the Grid Monitoring System (GMS) - a system which adopts a distributed data mining approach to detection of misconfigured grid machines. The GMS non-intrusively collects data from sources available throughout the grid system. It converts raw data to semantically meaningful data and stores these data on the machine from which, it was obtained limiting incurred overhead and allowing scalability. When analysis is requested, a distributed outliers detection algorithm is employed to identify misconfigured machines. The algorithm itself is implemented as a recursive workflow of grid jobs and is especially suited to grid systems in which the machines might be unavailable most of the time or often fail altogether.
Original language | English |
---|---|
Title of host publication | Data Mining Techniques in Grid Computing Environments |
Publisher | John Wiley & Sons, Ltd |
Pages | 71-89 |
Number of pages | 19 |
ISBN (Print) | 9780470512586 |
DOIs | |
State | Published - 22 Jun 2009 |
Externally published | Yes |
Keywords
- Data analysis
- Distributed HilOut algorithm
- GMS ontology - a job
- GMS using Condor DAGman
- Grid Monitoring System (GMS)
- Grid system misconfigured machine mining
- Knowledge discovery process - acquiring
- Large grid system misconfigured machine detection
- Pre-processing and storing data
- System data acquisition - intrusive and non-intrusive
- a pool and matchmaking
ASJC Scopus subject areas
- General Computer Science