A Handbook on Healthcare Applications

Sowmyayani, S. (2022) A Handbook on Healthcare Applications. B P International. ISBN 978-93-5547-948-8

Full text not available from this repository.

Abstract

With the rise in big data, machine learning has become particularly important for solving problems. Machine learning uses two types of techniques: supervised learning and unsupervised learning. Clustering is the most common unsupervised learning technique. Classification and Regression are supervised learning techniques. Clustering algorithms fall into two broad groups: Hard clustering and soft clustering. K-Means, K-Mediods, Hierarchical clustering, Self-organizing Map are some of the hard clustering methods. Fuzzy C- Means, Gaussian Mixture Model are soft clustering methods. In classification problem, the classes may be binary or multiclass. A multiclass classification problem is generally more challenging because it requires a more complex model. Most common classification algorithms include Logistic Regression, k Nearest Neighbor (kNN), Support Vector Machine (SVM), Neural Network, Naïve Bayes, Discriminant Analysis, Decision Tree, Bagged and Boosted Decision Trees. Regression algorithms include Gaussian Process Regression Model, SVM Regression, Generalized Linear Model and Regression Tree.

Depends on the application, some problems require pre-processing and optimization. Real-world datasets can be messy, incomplete and in a variety of formats. Hence Pre-processing is necessary before solving the problem. Machine learning is an effective method for finding patterns in big datasets. But bigger data brings added complexity. As datasets get bigger, it is essential to reduce the number of features. The three most commonly used dimensionality reduction techniques are: Principal Component Analysis (PCA), Factor Analysis and Nonnegative matrix factorization. The performance of the method apparently increases when machine learning algorithms is used. Selecting a machine learning algorithm is a process of trial and error. The specific characteristics of the algorithms include Speed of training, Memory usage, Predictive accuracy on new data, Transparency or interpretability.

Item Type: Book
Subjects: Institute Archives > Medical Science
Depositing User: Managing Editor
Date Deposited: 07 Oct 2023 09:15
Last Modified: 07 Oct 2023 09:15
URI: http://eprint.subtopublish.com/id/eprint/3018

Actions (login required)

View Item
View Item