As a broad subfield of artificial intelligence, machine learning is concerned with the design and development of algorithms and techniques that allow computers to "learn". In Mathematics, Computing, Linguistics and related subjects an algorithm is a sequence of finite instructions often used for Calculation At a general level, there are two types of learning: inductive, and deductive. Induction or inductive reasoning, sometimes called inductive logic, is the process of Reasoning in which the premises of an argument are believed Deductive reasoning is Reasoning which uses deductive Arguments to move from given statements ( Premises to Conclusions which must be true if the Inductive machine learning methods extract rules and patterns out of massive data sets.
The major focus of machine learning research is to extract information from data automatically, by computational and statistical methods. Hence, machine learning is closely related not only to data mining and statistics, but also theoretical computer science. Data mining is the process of Sorting through large amounts of data and picking out relevant information Statistics is a mathematical science pertaining to the collection analysis interpretation or explanation and presentation of Data. Theoretical computer science is the collection of topics of Computer science that focuses on the more abstract logical and mathematical aspects of Computing, such
Applications
Machine learning has a wide spectrum of applications including natural language processing, syntactic pattern recognition, search engines, medical diagnosis, bioinformatics, brain-machine interfaces and cheminformatics, detecting credit card fraud, stock market analysis, classifying DNA sequences, speech and handwriting recognition, object recognition in computer vision, game playing and robot locomotion. Natural language processing ( NLP) is a subfield of Artificial intelligence and Computational linguistics. Syntactic pattern recognition or structural pattern recognition is a form of Pattern recognition, where items are presented pattern structures which can take into Diagnosis is the identification by Process of elimination, of the nature of anything Bioinformatics is the application of information technology to the field of molecular biology A brain-computer interface (BCI sometimes called a direct neural interface or a brain-machine interface, is a direct communication pathway between a human or animal Cheminformatics (also known as chemoinformatics and chemical informatics) is the use of computer and informational techniques applied to a range of problems Credit card fraud is a wide-ranging term for Theft and Fraud committed using a Credit card or any similar payment mechanism as a fraudulent source A stock market, or (equity market is a private or public market for the trading of company Stock and derivatives of company A DNA sequence or genetic sequence is a succession of letters representing the Primary structure of a real or hypothetical DNA Molecule Speech recognition (also known as automatic speech recognition or computer speech recognition) converts spoken words to machine-readable input (for example to keypresses Handwriting recognition is the ability of a computer to receive and interpret intelligible Handwritten input such as pendrives digital cameras and other devices Object recognition in Computer vision is a task of finding given object in an image or video sequence Computer vision is the science and technology of machines that see A strategy game is a Game (eg computer, video or Board game) in which the players' decision-making skills have a high significance Robot locomotion is the study of how to design Robot appendages and control mechanisms to allow robots to move fluidly and efficiently
Human interaction
Some machine learning systems attempt to eliminate the need for human intuition in the analysis of the data, while others adopt a collaborative approach between human and machine. Human intuition cannot be entirely eliminated since the designer of the system must specify how the data is to be represented and what mechanisms will be used to search for a characterization of the data. Machine learning can be viewed as an attempt to automate parts of the scientific method. Scientific method refers to bodies of Techniques for investigating phenomena
Some statistical machine learning researchers create methods within the framework of Bayesian statistics. Bayesian inference is Statistical inference in which evidence or observations are used to update or to newly infer the Probability that a hypothesis may be true
Algorithm types
Machine learning algorithms are organized into a taxonomy, based on the desired outcome of the algorithm. In Mathematics, Computing, Linguistics and related subjects an algorithm is a sequence of finite instructions often used for Calculation Taxonomy is the practice and science of classification The word comes from the Greek, taxis (meaning 'order' 'arrangement' and, nomos Common algorithm types include:
- Supervised learning — in which the algorithm generates a function that maps inputs to desired outputs. Supervised learning is a Machine learning technique for learning a function from training data One standard formulation of the supervised learning task is the classification problem: the learner is required to learn (to approximate) the behavior of a function which maps a vector
into one of several classes by looking at several input-output examples of the function. Statistical classification is a procedure in which individual items are placed into groups based on quantitative information on one or more characteristics inherent in the items (referred
- Unsupervised learning — An agent which models a set of inputs: labeled examples are not available. In Machine learning, unsupervised learning is a class of problems in which one seeks to determine how the data are organised
- Semi-supervised learning — which combines both labeled and unlabeled examples to generate an appropriate function or classifier. In Computer science, semi-supervised learning is a class of Machine learning techniques that make use of both labeled and unlabeled data for training - typically a
- Reinforcement learning — in which the algorithm learns a policy of how to act given an observation of the world. Inspired by related psychological theory in Computer science, reinforcement learning is a sub-area of Machine learning concerned with how an agent Every action has some impact in the environment, and the environment provides feedback that guides the learning algorithm.
- Transduction — similar to supervised learning, but does not explicitly construct a function: instead, tries to predict new outputs based on training inputs, training outputs, and test inputs which are available while training. In Logic, Statistical inference, and Supervised learning, transduction or transductive inference is Reasoning fromobserved specific
- Learning to learn — in which the algorithm learns its own inductive bias based on previous experience. Multi-task learning is an approach to Machine learning, that learns a problem together with other related problems at the same time using a shared representation The inductive bias of a learning Algorithm is the set of assumptions that the learner uses to predict outputs given inputs that it has not encountered (Mitchell 1980
The computational analysis of machine learning algorithms and their performance is a branch of theoretical computer science known as computational learning theory. Theoretical computer science is the collection of topics of Computer science that focuses on the more abstract logical and mathematical aspects of Computing, such In Theoretical computer science, computational learning theory is a mathematical field related to the analysis of Machine learning algorithms
Machine learning topics
- This list represents the topics covered on a typical machine learning course.
|
|
- Approximate inference techniques
- Optimization
- Most of methods listed above either use optimization or are instances of optimization algorithms
- Meta-learning (ensemble methods)
- Inductive transfer and learning to learn
|
See also
- Autonomous robot
- Computational intelligence
- Fuzzy logic
- Inductive logic programming
- Intelligent system
- Journal of Machine Learning Research
- Important publications in machine learning (computer science)
- List of numerical analysis software
- MLMTA Machine Learning: Models, Technologies & Applications
- Multi-label classification
- Neural Information Processing Systems (NIPS) (conference)
- Neural network software
- Pattern recognition
- Predictive analytics
- WEKA Open-source machine learning framework for pattern classification, regression, and clustering. Bayesian probability interprets the concept of Probability as 'a measure of a state of knowledge'. Conditional probability is the Probability of some event A, given the occurrence of some other event B. In statistics regression analysis is a collective name for techniques for the modeling and analysis of numerical data consisting of values of a Dependent variable (response Statistical classification is a procedure in which individual items are placed into groups based on quantitative information on one or more characteristics inherent in the items (referred An artificial neural network (ANN, often just called a "neural network" (NN is a Mathematical model or Computational model based on Biological neural In Operations research, specifically in Decision analysis, a decision tree (or tree diagram is a decision support tool that uses a graph or Gene Expression Programming (GEP is an evolutionary Algorithm that evolves populations of Computer programs in order to solve a user defined problem A genetic algorithm (GA is a Search technique used in Computing to find exact or Approximate solutions to optimization and Search In Artificial intelligence, genetic programming (GP is an Evolutionary algorithm based methodology inspired by Biological evolution to find Inductive logic programming ( ILP) is a subfield of Machine learning which uses Logic programming as a uniform representation for examples background knowledge Kriging is a group of geostatistical techniques to interpolate the value of a Random field (e Linear discriminant analysis (LDA and the related Fisher's linear discriminant are methods used in Statistics and Machine learning to find the Linear In Pattern recognition, the k -nearest neighbor algorithm ( k -NN is a method for classifying objects based on closest training examples in the Minimum message length (MML is a formal Information theory restatement of Occam's Razor: even when models are not equal in goodness of fit accuracy to the observed The perceptron is a type of Artificial neural network invented in 1957 at the Cornell Aeronautical Laboratory by Frank Rosenblatt. A quadratic classifier is used in Machine learning to separate measurements of two or more classes of objects or events by a Quadric surface A radial basis function network is an Artificial neural network that uses Radial basis functions as activation functions Support vector machines ( SVMs) are a set of related Supervised learning methods used for classification and regression. In Mathematics and Computer science, dynamic programming is a method of solving problems exhibiting the properties of Overlapping subproblems and An expectation-maximization ( EM) algorithm is used in Statistics for finding Maximum likelihood estimates of Parameters in probabilistic In Mathematics, a probability density function (pdf is a function that represents a Probability distribution in terms of Integrals Formally a probability In Statistics, a generative model is a model for randomly generating observed data typically given some hidden parameters In Probability theory, Statistics, and Machine learning, a graphical model (GM is a graph that represents independencies among Random variables A Bayesian network (or a belief network) is a Probabilistic graphical model that represents a set of Variables and their probabilistic independencies A Markov network, or Markov random field, is a model of the (full Joint probability distribution of a set \mathcal{X} of Random variables Generative topographic map (GTM is a Machine learning method that is a probabilistic counterpart of the Self-organizing map (SOM is provably convergent and does Monte Carlo methods are a class of Computational Algorithms that rely on repeated Random sampling to compute their results Variational Bayesian methods also called ensemble learning, are a family of techniques for approximating intractable integrals arising in Bayesian statistics and machine learning Variable-order Markov (VOM models are an important class of models that extend the well known Markov chain models Variable-order Bayesian network (VOBN models provide an important extension of both the Bayesian network models and the Variable-order Markov models. Belief propagation, also known as the sum-product algorithm, is an iterative Algorithm for computing marginals of functions on a Graphical model In Mathematics, the term optimization, or mathematical programming, refers to the study of problems in which one seeks to minimize or maximize a real function Boosting is a Machine learning Meta-algorithm for performing Supervised learning. Bootstrap aggregating ( bagging) is a Meta-algorithm to improve Machine learning of Classification and Regression models in terms of In Machine learning, a random forest is a classifier that consists of many decision trees and outputs the class that is the mode of the classes In Machine learning, Weighted Majority Algorithm (WMA is a meta-learning algorithm used to construct a compound algorithm from a pool of prediction algorithms which could Inductive transfer, or transfer learning, is a research problem in Machine learning that focuses on storing knowledge gained while solving one problem and applying Inspired by related psychological theory in Computer science, reinforcement learning is a sub-area of Machine learning concerned with how an agent Temporal difference learning is a prediction method It has been mostly used for solving the Reinforcement learning problem Monte Carlo methods are a class of Computational Algorithms that rely on repeated Random sampling to compute their results Autonomous robots are Robots which can perform desired tasks in unstructured environments without continuous human guidance Computational intelligence (CI is an offshoot of Artificial intelligence. Fuzzy logic is a form of Multi-valued logic derived from Fuzzy set theory to deal with Reasoning that is approximate rather than precise Inductive logic programming ( ILP) is a subfield of Machine learning which uses Logic programming as a uniform representation for examples background knowledge The Journal of Machine Learning Research (usually abbreviated JMLR) is a Scientific journal focusing on Machine learning, a Computability Computability An introduction Listed here are a number of computer programs used for performing numerical calculations acslX is a software application for modeling and evaluating the performance Machine Learning Models Technologies & Applications ( MLMTA) is an international Machine learning and Applied statistics conference held every June in Multi-label classification is a concept in mathematics and Machine learning. Neural Information Processing Systems ( NIPS) is a Machine learning and Computational neuroscience conference held every December in Vancouver Neural network software is used to simulate, Research, develop and apply Artificial neural networks Biological neural networks and Pattern recognition is a sub-topic of Machine learning. It is "the act of taking in raw data and taking an action based on the category of the data" Predictive analytics encompasses a variety of techniques from Statistics and Data mining that analyze current and historical data to make predictions about future Weka (Waikato Environment for Knowledge Analysis is a popular suite of Machine learning software written in Java, developed at the University of Waikato
References
Further reading
- Ethem Alpaydın (2004) Introduction to Machine Learning (Adaptive Computation and Machine Learning), MIT Press, ISBN 0262012111
- Christopher M. Bishop (2007) Pattern Recognition and Machine Learning, Springer ISBN 0-387-31073-8.
- Ryszard S. Michalski, Jaime G. Carbonell, Tom M. Mitchell (1983), Machine Learning: An Artificial Intelligence Approach, Tioga Publishing Company, ISBN 0-935382-05-4.
- Ryszard S. Michalski, Jaime G. Carbonell, Tom M. Mitchell (1986), Machine Learning: An Artificial Intelligence Approach, Volume II, Morgan Kaufmann, ISBN 0-934613-00-1.
- Yves Kodratoff, Ryszard S. Michalski (1990), Machine Learning: An Artificial Intelligence Approach, Volume III, Morgan Kaufmann, ISBN 1-55860-119-8.
- Ryszard S. Michalski, George Tecuci (1994), Machine Learning: A Multistrategy Approach, Volume IV, Morgan Kaufmann, ISBN 1-55860-251-8.
- Bhagat, P. M. (2005). Pattern Recognition in Industry, Elsevier. ISBN 0-08-044538-1.
- Bishop, C. M. (1995). Neural Networks for Pattern Recognition, Oxford University Press. ISBN 0-19-853864-2.
- Richard O. Duda, Peter E. Hart, David G. Stork (2001) Pattern classification (2nd edition), Wiley, New York, ISBN 0-471-05669-3.
- Huang T. -M. , Kecman V. , Kopriva I. (2006), Kernel Based Algorithms for Mining Huge Data Sets, Supervised, Semi-supervised, and Unsupervised Learning, Springer-Verlag, Berlin, Heidelberg, 260 pp. 96 illus. , Hardcover, ISBN 3-540-31681-7.
- KECMAN Vojislav (2001), LEARNING AND SOFT COMPUTING, Support Vector Machines, Neural Networks and Fuzzy Logic Models, The MIT Press, Cambridge, MA, 608 pp. , 268 illus. , ISBN 0-262-11255-8.
- MacKay, D. J. C. (2003). Information Theory, Inference, and Learning Algorithms, Cambridge University Press. ISBN 0-521-64298-1.
- Mitchell, T. (1997). Machine Learning, McGraw Hill. ISBN 0-07-042807-7.
- Ian H. Witten and Eibe Frank "Data Mining: Practical machine learning tools and techniques" Morgan Kaufmann ISBN 0-12-088407-0.
- Sholom Weiss and Casimir Kulikowski (1991). Computer Systems That Learn, Morgan Kaufmann. ISBN 1-55860-065-5.
- Mierswa, Ingo and Wurst, Michael and Klinkenberg, Ralf and Scholz, Martin and Euler, Timm: YALE: Rapid Prototyping for Complex Data Mining Tasks, in Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-06), 2006.
- Trevor Hastie, Robert Tibshirani and Jerome Friedman (2001). The Elements of Statistical Learning, Springer. ISBN 0387952845.
- Vladimir Vapnik (1998). Statistical Learning Theory. Wiley-Interscience, ISBN 0471030031.
External links
Dictionary
machine learning
-noun
- (artificial intelligence) A field concerned with the design and development of algorithms and techniques that allow computers to learn.
© 2009 citizendia.org; parts available under the terms of GNU Free Documentation License, from http://en.wikipedia.org
network: | |