¡Te damos la bienvenida a Scribd!

248 T. M. Khoshgoftaar and E. B. Allen: Et Al. (1992) ), Artificial Neural Networks (Khoshgoftaar and Lanning (1995) ), and

Cargado por

0% encontró este documento útil (0 votos)

23 vistas1 página

The document discusses using data mining techniques like classification trees to analyze large software development databases and discover relationships between software quality factors and attributes of the products and processes. Specifically, it introduces the Classification And Regression Trees (CART) algorithm, which can automatically build a parsimonious tree model by first constructing a maximal tree and then pruning it to an appropriate level of detail. CART is attractive for software quality modeling because it emphasizes pruning to achieve robust models.

Descripción original:

hjhh

Título original

00264___4116fe22aff416d76a2d228d5737eb30.pdf

Derechos de autor

Formatos disponibles

PDF, TXT o lea en línea desde Scribd

Compartir este documento

Compartir o incrustar documentos

Opciones para compartir

¿Le pareció útil este documento?

¿Este contenido es inapropiado?

Denunciar este documento

Copyright:

Attribution Non-Commercial (BY-NC)

Formatos disponibles

Descargue como PDF, TXT o lea en línea desde Scribd

Marcar por contenido inapropiado

0% encontró este documento útil (0 votos)

23 vistas1 página

248 T. M. Khoshgoftaar and E. B. Allen: Et Al. (1992) ), Artificial Neural Networks (Khoshgoftaar and Lanning (1995) ), and

Cargado por

Tiu Ton

Copyright:

Attribution Non-Commercial (BY-NC)

Formatos disponibles

Descargue como PDF, TXT o lea en línea desde Scribd

Marcar por contenido inapropiado

Saltar a página

Está en la página 1de 1

Buscar dentro del documento

248

T. M. Khoshgoftaar

and E. B. Allen

A project's developmental history can be captured by information systems. Many software development organizations have very large data bases for configuration management and for problem reporting which capture data on events during development. Such data bases are potential sources of new information relating software quality factors to the attributes of software products and the attributes of their development processes. For large legacy systems or product lines, the amount of available data can be overwhelming. The combination of numerous attributes of software products and processes, very large data bases designed for other purposes, and weak theoretical support [Kitchenham and Pfleeger (1996)] mandates an empirical approach to software quality prediction, rather than a strictly deductive approach [Khoshgoftaar et al. (2000)]. Fayyad (1996) defines knowledge discovery in data bases as "the nontrivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data". Given a set of large data bases or a data warehouse, major steps of the knowledge discovery process are [Fayyad et al. (1996)]: (1) selection and sampling of data; (2) preprocessing and cleaning of data; (3) data reduction and transformation; (4) data mining; and (5) evaluation of knowledge. Fayyad restricts the term data mining to denote the step of extracting patterns or models from clean, transformed data, for example, fitting a model or finding a pattern. Classificationtree modeling is an acknowledged tool for data mining [Glymour et al. (1996), Hand (1998)]. Knowledge discovery in general, and the data mining step in particular, is focused on finding patterns and models that can be interpreted as useful knowledge [Fayyad et al. (1996)]. Industrial software systems often have thousands of modules, and a large number of variables can be extracted from source code measurements, configuration management data, and problem reporting data. The result is a large amount of multidimensional data to be analyzed by the data mining step. Classification trees can be used as a data mining technique to identify significant and important relationships between faults and software product and process attributes [Khoshgoftaar et al. (1996a), Porter and Selby (1990), Troster and Tian (1995)]. This paper introduces the Classification And Regression Trees (CART) algorithm [Breiman et al. (1984)] to software engineering practitioners. A "classification tree" is an algorithm, depicted as a tree graph, that classifies an input object. Alternative classification techniques used in software quality modeling include discriminant analysis [Khoshgoftaar et al. (1996b)], the discriminative power technique [Schneidewind (1995)], logistic regression [Basili et al. (1996)], pattern recognition [Briand et al. (1992)], artificial neural networks [Khoshgoftaar and Lanning (1995)], and fuzzy classification [Ebert (1996)]. A classification tree differs from these in the way it models complex relationships between class membership and combinations of variables. CART automatically builds a parsimonious tree by first building a maximal tree and then pruning it to an appropriate level of detail. CART is attractive because it emphasizes pruning to achieve robust models. Although Kitchenham briefly

También podría gustarte

Data Mapping for Data Warehouse Design
De Everand
Data Mapping for Data Warehouse Design
Qamar Shahbaz
Calificación: 5 de 5 estrellas
5/5 (3)
USCA21Q0014
Documento18 páginas
USCA21Q0014
rman
Aún no hay calificaciones
Data Mining and Business Intelligence
Documento41 páginas
Data Mining and Business Intelligence
Anonymous m13yWyGy
Aún no hay calificaciones
Data Structures: Notes For Lecture 12 Introduction To Data Mining by Samaher Hussein Ali
Documento4 páginas
Data Structures: Notes For Lecture 12 Introduction To Data Mining by Samaher Hussein Ali
samaher hussein
Aún no hay calificaciones
Data Mining Tools
Documento13 páginas
Data Mining Tools
Musta Mustapha
Aún no hay calificaciones
Unit 6: Big Data Analytics Using R: 6.0 Overview
Documento32 páginas
Unit 6: Big Data Analytics Using R: 6.0 Overview
splokbov
Aún no hay calificaciones
A Systematic Mapping Study of Process Mining: Enterprise Information Systems
Documento46 páginas
A Systematic Mapping Study of Process Mining: Enterprise Information Systems
Ricardo Pucinelli
Aún no hay calificaciones
Big Data Analytics Architecture Design
Documento28 páginas
Big Data Analytics Architecture Design
mwanza matthews
Aún no hay calificaciones
Data Mining
Documento10 páginas
Data Mining
John Reuben Victor Oliver
Aún no hay calificaciones
Datamining 2
Documento5 páginas
Datamining 2
Manoj Manu
Aún no hay calificaciones
JSS D 18 00868 QualityAttributesintheContextofBigDataSystems
Documento26 páginas
JSS D 18 00868 QualityAttributesintheContextofBigDataSystems
David Salgado
Aún no hay calificaciones
Data Profiling
Documento3 páginas
Data Profiling
charlotte899
Aún no hay calificaciones
Data Science Specialization
Documento21 páginas
Data Science Specialization
SEENU MANGAL
Aún no hay calificaciones
Web Intelligence: What Is Webintelligence?
Documento25 páginas
Web Intelligence: What Is Webintelligence?
Rajesh Rathod
Aún no hay calificaciones
IV Unit Big Data Analysis
Documento17 páginas
IV Unit Big Data Analysis
gowrishankar nayana
Aún no hay calificaciones
Big Data Platforms
Documento8 páginas
Big Data Platforms
JAWAHAR BALARAMAN
Aún no hay calificaciones
Object-Oriented Modelling For Module-Based Production Logistics Inventory System
Documento8 páginas
Object-Oriented Modelling For Module-Based Production Logistics Inventory System
Rog Don
Aún no hay calificaciones
Modul 1 CertDA
Documento8 páginas
Modul 1 CertDA
Indra Siswanto
Aún no hay calificaciones
Non Adoption of Advanced Data Analytics
Documento24 páginas
Non Adoption of Advanced Data Analytics
tits
Aún no hay calificaciones
Big Data Is A Broad Term For
Documento14 páginas
Big Data Is A Broad Term For
Anonymous quSoLnkiQ
Aún no hay calificaciones
Samra Resrch
Documento20 páginas
Samra Resrch
aslamaneeza8
Aún no hay calificaciones
Case Tools:: Matrices, Hierarchies, Process Modeling
Documento10 páginas
Case Tools:: Matrices, Hierarchies, Process Modeling
Andre Boco
Aún no hay calificaciones
Data Lineage
Documento14 páginas
Data Lineage
john949
Aún no hay calificaciones
CBEC4103 Data Warehousing
Documento10 páginas
CBEC4103 Data Warehousing
Attenuator James
Aún no hay calificaciones
Gokaraju Rangaraju Institute of Engineering and Technology
Documento49 páginas
Gokaraju Rangaraju Institute of Engineering and Technology
Ramya Teja
Aún no hay calificaciones
BLDV DQ WK7
Documento3 páginas
BLDV DQ WK7
ash
Aún no hay calificaciones
Data Mining: What Is Data Mining?: Correlations or Patterns Among Fields in Large Relational Databases
Documento6 páginas
Data Mining: What Is Data Mining?: Correlations or Patterns Among Fields in Large Relational Databases
Anonymous wfUYLhYZt
Aún no hay calificaciones
AR3a Data Mining Applications in Accounting A
Documento27 páginas
AR3a Data Mining Applications in Accounting A
Ayuu Ramadhani
Aún no hay calificaciones
Lecture 1 - Introductory To Data Analytics
Documento11 páginas
Lecture 1 - Introductory To Data Analytics
Zakwan Wan
Aún no hay calificaciones
Chapter 06 Test Reviewer
Documento5 páginas
Chapter 06 Test Reviewer
Jaafar Soriano
Aún no hay calificaciones
CHAPTER 2 Emerging
Documento8 páginas
CHAPTER 2 Emerging
Jiru Alemayehu
Aún no hay calificaciones
Data Analytics Source of Things
Documento5 páginas
Data Analytics Source of Things
memc vignesh
Aún no hay calificaciones
Data Mining: Encyclopedic Style Neutral
Documento12 páginas
Data Mining: Encyclopedic Style Neutral
Alex Lee
Aún no hay calificaciones
A Semiotic Approach To Investigate Quality Issues of Open Big Data Ecosystems
Documento10 páginas
A Semiotic Approach To Investigate Quality Issues of Open Big Data Ecosystems
Manuel Herrera
Aún no hay calificaciones
C - B D A - A S C R F D: Loud Based IG ATA Nalytics Urvey of Urrent Esearch and Uture Irections
Documento12 páginas
C - B D A - A S C R F D: Loud Based IG ATA Nalytics Urvey of Urrent Esearch and Uture Irections
prayas jhariya
Aún no hay calificaciones
Assignment QUIZES Set 2
Documento5 páginas
Assignment QUIZES Set 2
Victor Wangai
Aún no hay calificaciones
Data Migration First Steps
Documento6 páginas
Data Migration First Steps
Ioanna Zlateva
Aún no hay calificaciones
Data Mining Notes
Documento21 páginas
Data Mining Notes
aryan
Aún no hay calificaciones
Ali Reserch
Documento23 páginas
Ali Reserch
aslamaneeza8
Aún no hay calificaciones
Big Data Framework For System Health Monitoring
Documento10 páginas
Big Data Framework For System Health Monitoring
malik shahzad Abdullah
Aún no hay calificaciones
Quality Management in Software Engineering
Documento8 páginas
Quality Management in Software Engineering
selinasimpson1201
Aún no hay calificaciones
V 2 Chapter 2
Documento8 páginas
V 2 Chapter 2
Moeen Ali
Aún no hay calificaciones
Data Migration Strategies
Documento6 páginas
Data Migration Strategies
ypravi
Aún no hay calificaciones
Computers & Industrial Engineering: Sciencedirect
Documento16 páginas
Computers & Industrial Engineering: Sciencedirect
Luis Fernando Quintero Henao
Aún no hay calificaciones
Dint A 00026
Documento7 páginas
Dint A 00026
Ramses Boxberg
Aún no hay calificaciones
A Survey On Data Mining
Documento4 páginas
A Survey On Data Mining
International Organization of Scientific Research (IOSR)
Aún no hay calificaciones
Modeling Software Quality With Classification Trees 249
Documento1 página
Modeling Software Quality With Classification Trees 249
Tiu Ton
Aún no hay calificaciones
To Development Manufacturing and Education Using Data Mining A Review
Documento6 páginas
To Development Manufacturing and Education Using Data Mining A Review
Editor IJTSRD
Aún no hay calificaciones
Mohamed2020 Article TheStateOfTheArtAndTaxonomyOfB
Documento49 páginas
Mohamed2020 Article TheStateOfTheArtAndTaxonomyOfB
happydayforus2000
Aún no hay calificaciones
Cloud Computing Literature Review
Documento9 páginas
Cloud Computing Literature Review
l1wot1j1fon3
100% (1)
Operations Research Applications in The Field of Information and Communication
Documento6 páginas
Operations Research Applications in The Field of Information and Communication
rushabhrakholiya
Aún no hay calificaciones
Ethiopia Football Federation
Documento4 páginas
Ethiopia Football Federation
Getachew Yizengaw Enyew
Aún no hay calificaciones
Data Mining - First Page PDF
Documento20 páginas
Data Mining - First Page PDF
Priya Varma
Aún no hay calificaciones
Alohomora Unlocking Data Quality Causes Through Event Log Contex
Documento16 páginas
Alohomora Unlocking Data Quality Causes Through Event Log Contex
sanaz.mohammadzade13
Aún no hay calificaciones
Structured Analysis Tools: Data Flow Diagram (DFD)
Documento10 páginas
Structured Analysis Tools: Data Flow Diagram (DFD)
Nancy Goyal
Aún no hay calificaciones
Compulsory Question
Documento6 páginas
Compulsory Question
Cole
Aún no hay calificaciones
Computer Fundamentals (Final)
Documento23 páginas
Computer Fundamentals (Final)
Siddharth
Aún no hay calificaciones
Business Uses of Data Mining and Data Warehousing MIS 304 Section 04 CRN-41595
Documento23 páginas
Business Uses of Data Mining and Data Warehousing MIS 304 Section 04 CRN-41595
Mohit Sharma
Aún no hay calificaciones
Business Analytics Notes
Documento6 páginas
Business Analytics Notes
Priyali Rai
Aún no hay calificaciones
Process Mining
Documento7 páginas
Process Mining
olivia523
Aún no hay calificaciones
Pattern Recognition: Fundamentals and Applications
De Everand
Pattern Recognition: Fundamentals and Applications
Fouad Sabry
Aún no hay calificaciones
Visit To Penthouse
Documento12 páginas
Visit To Penthouse
Tiu Ton
Aún no hay calificaciones
Software, Mar. 14-32
Documento1 página
Software, Mar. 14-32
Tiu Ton
Aún no hay calificaciones
Effect of Air Ingress in Boilers
Documento6 páginas
Effect of Air Ingress in Boilers
Anvita Chebrolu
100% (1)
Boiler Tube Failure Analysis - 1
Documento7 páginas
Boiler Tube Failure Analysis - 1
Srinivasa
Aún no hay calificaciones
Modeling Software Quality With Classification Trees 265: The Figure Uses A Logarithmic Scale For Profit
Documento1 página
Modeling Software Quality With Classification Trees 265: The Figure Uses A Logarithmic Scale For Profit
Tiu Ton
Aún no hay calificaciones
00284
Documento1 página
00284
Tiu Ton
Aún no hay calificaciones
Katalog Anting Perak: No. Gambar Grosir Kode Nama Produk
Documento25 páginas
Katalog Anting Perak: No. Gambar Grosir Kode Nama Produk
Tiu Ton
Aún no hay calificaciones
Environment Concepts With Tolerance Design Optimization Model 171
Documento1 página
Environment Concepts With Tolerance Design Optimization Model 171
Tiu Ton
Aún no hay calificaciones
NAVSEA RCM Handbook DTD 18 April 2007 PDF
Documento105 páginas
NAVSEA RCM Handbook DTD 18 April 2007 PDF
Tiu Ton
Aún no hay calificaciones
Logo Bismillah
Documento1 página
Logo Bismillah
Tiu Ton
Aún no hay calificaciones
Logo Undangan
Documento1 página
Logo Undangan
syerroo
Aún no hay calificaciones
PDF
Documento1 página
PDF
Tiu Ton
Aún no hay calificaciones
Fempfocus Summer 2005 PDF
Documento24 páginas
Fempfocus Summer 2005 PDF
Tiu Ton
Aún no hay calificaciones
Abacunas PDF
Documento9 páginas
Abacunas PDF
Tiu Ton
Aún no hay calificaciones
Modeling Software Quality With Classification Trees 249
Documento1 página
Modeling Software Quality With Classification Trees 249
Tiu Ton
Aún no hay calificaciones
A Methodology For The Measurement of Test Effectiveness 241
Documento1 página
A Methodology For The Measurement of Test Effectiveness 241
Tiu Ton
Aún no hay calificaciones
Modeling Software Quality With Classification Trees 267: Acknowledgments
Documento1 página
Modeling Software Quality With Classification Trees 267: Acknowledgments
Tiu Ton
Aún no hay calificaciones
Software, Mar. 14-32
Documento1 página
Software, Mar. 14-32
Tiu Ton
Aún no hay calificaciones
X - /ife - Is Smaller Than The Width, A . Then For Any Given Input, Only The Small
Documento1 página
X - /ife - Is Smaller Than The Width, A . Then For Any Given Input, Only The Small
Tiu Ton
Aún no hay calificaciones
260 T. M. Khoshgoftaar and E. B. Allen: T I L L
Documento1 página
260 T. M. Khoshgoftaar and E. B. Allen: T I L L
Tiu Ton
Aún no hay calificaciones
Sec 2 Bappendixa
Documento8 páginas
Sec 2 Bappendixa
Tiu Ton
Aún no hay calificaciones
00011
Documento1 página
00011
Tiu Ton
Aún no hay calificaciones
PDF
Documento1 página
PDF
Tiu Ton
Aún no hay calificaciones
Conclusions: How To Test The Skewness and The Kurtosis
Documento1 página
Conclusions: How To Test The Skewness and The Kurtosis
Tiu Ton
Aún no hay calificaciones
Steam8 Boiler
Documento2 páginas
Steam8 Boiler
Tiu Ton
Aún no hay calificaciones
G (T) (A-M (T) / (4) : 20 X. Zhang and H. Pham
Documento1 página
G (T) (A-M (T) / (4) : 20 X. Zhang and H. Pham
Tiu Ton
Aún no hay calificaciones
00 Contents
Documento7 páginas
00 Contents
Tiu Ton
Aún no hay calificaciones
Process Heating Source Book 2
Documento114 páginas
Process Heating Source Book 2
kalpeshds
100% (2)
Motor
Documento116 páginas
Motor
Rutvi Vishal Vaghela
100% (1)
CS583 Introduction
Documento30 páginas
CS583 Introduction
shopinged
Aún no hay calificaciones
Cambridge
Documento327 páginas
Cambridge
uma mishra
Aún no hay calificaciones
Appendix Weka
Documento17 páginas
Appendix Weka
Imran
Aún no hay calificaciones
Hand2007 - Article - Principles Ofs DataMining
Documento2 páginas
Hand2007 - Article - Principles Ofs DataMining
Tibi
Aún no hay calificaciones
3 UnSupervised Learning
Documento53 páginas
3 UnSupervised Learning
Zaeem Abbas
Aún no hay calificaciones
Data Warehousing & Data Mining
Documento1 página
Data Warehousing & Data Mining
Anmol
Aún no hay calificaciones
Digital Forensics Question Bank
Documento18 páginas
Digital Forensics Question Bank
Swaroop Deshpande
Aún no hay calificaciones
Unit - Introduction - : Data Mining: Concepts and Techniques
Documento56 páginas
Unit - Introduction - : Data Mining: Concepts and Techniques
Ankur Sharma
Aún no hay calificaciones
Lecture 1-Data Mining (Introduction)
Documento30 páginas
Lecture 1-Data Mining (Introduction)
ruba
Aún no hay calificaciones
Unit 5 - Fit
Documento4 páginas
Unit 5 - Fit
Austin Rebby
Aún no hay calificaciones
Clustering Techniques
Documento38 páginas
Clustering Techniques
kmkatariya
Aún no hay calificaciones
Fuzzy C Means (Overlapping Clustering)
Documento13 páginas
Fuzzy C Means (Overlapping Clustering)
Ravi Gupta
Aún no hay calificaciones
Data Mining - Lecture 1
Documento23 páginas
Data Mining - Lecture 1
Ankush Jindal
Aún no hay calificaciones
Data Shoes
Documento51 páginas
Data Shoes
zieza2410
100% (1)
Article in Press: Sustainable Computing: Informatics and Systems
Documento11 páginas
Article in Press: Sustainable Computing: Informatics and Systems
Ingrid Huaiquil
Aún no hay calificaciones
Symbiosis Centre For Information Technology: MBA-DSDA 2020-22 (Semester I) Research Methodology
Documento4 páginas
Symbiosis Centre For Information Technology: MBA-DSDA 2020-22 (Semester I) Research Methodology
Kansha Gupta
Aún no hay calificaciones
Data Warehousing and Data Mining Dr.P.rizwan Ahmed
Documento20 páginas
Data Warehousing and Data Mining Dr.P.rizwan Ahmed
Rizwan Ahmed
0% (1)
Application of Data Mining in Agriculture
Documento10 páginas
Application of Data Mining in Agriculture
Boris Milovic
Aún no hay calificaciones
Unsupervised Learning and Clustering
Documento19 páginas
Unsupervised Learning and Clustering
Jeff Pratt
Aún no hay calificaciones
What Is Data Mining
Documento5 páginas
What Is Data Mining
Gustavo Alves
Aún no hay calificaciones
Data Warehouse and Data Mining: Syllabus
Documento28 páginas
Data Warehouse and Data Mining: Syllabus
palaniappan
Aún no hay calificaciones
Application of Advantage of Analytics in Mining Industry 2
Documento9 páginas
Application of Advantage of Analytics in Mining Industry 2
AKRUTI JENA 19111304
Aún no hay calificaciones
Cs425 Datawarehousing & Datamining (Elective - Iv) : IV Year B.Tech. ECM I - Semester L T P To C
Documento2 páginas
Cs425 Datawarehousing & Datamining (Elective - Iv) : IV Year B.Tech. ECM I - Semester L T P To C
Sri Chandu
Aún no hay calificaciones
Data Mining From Data To Knowledge PDF
Documento464 páginas
Data Mining From Data To Knowledge PDF
Riko Rivanthio
Aún no hay calificaciones
SYLLABUS
Documento3 páginas
SYLLABUS
Rao Kunal
Aún no hay calificaciones
Advantages and Disadvantages of Information Systems
Documento42 páginas
Advantages and Disadvantages of Information Systems
Annu Bhatia
Aún no hay calificaciones
Syllabus MSCIT Full
Documento48 páginas
Syllabus MSCIT Full
kapitsa
Aún no hay calificaciones
Web Mining and Web Usage Mining Techniques: Bulletin de La Société Des Sciences de Liège, Vol. 85, 2016, P. 321 - 328
Documento8 páginas
Web Mining and Web Usage Mining Techniques: Bulletin de La Société Des Sciences de Liège, Vol. 85, 2016, P. 321 - 328
Lalisa Dugasa
Aún no hay calificaciones
MSGuide
Documento18 páginas
MSGuide
Mahesh Chandrappa
Aún no hay calificaciones