Open Access Open Access  Restricted Access Subscription or Fee Access

Breast Cancer Prediction using Machine Learning Techniques

Rekha Jain, Nipun Rishi, Nitesh Sachdev

Abstract


One of the most widely recognized diseases in a large portion of the urban communities and the second generally common in rural areas of India is breast cancer. Every 4 minutes, one lady is determined to have breast cancer growth and one lady dies with breast cancer disease every 13 minutes in India. Over half of the breast cancer patients in India are encountering stages 3 and 4, where the probabilities of survival are incredibly low; there is a need to assemble a programmed finding framework for quick recognition of disease. For the prediction of breast cancer that whether the patient is suffering from it can be classified with the help of benign and malignant tumor, since we are classifying the data into two, hence the classification techniques of machine learning are used in which the machine learning model learns from the past information and can anticipate on the new information. In this paper, the dataset is taken from the UCI repository and relative investigation on the build of the model utilizing logistic regression, support vector machine, and random forest is done on that dataset. The main objective is to achieve better results among all the algorithms that are used in classifying data with respect to the proficiency and viability of every algorithm in terms of precision, accuracy, and sensitivity. Test outcomes show that the random forest is seen to provide the best results for the classification of breast cancer, and it gives an accuracy of 98.60%. This machine learning research is done using the python language and executed in the scientific python development environment.


Full Text:

PDF

References


Ch. Shravya, K Pravalika, Shaik Subhani. Prediction of breast cancer using supervised machine learning techniques. International Journal of Innovative Technology and Exploring Engineering (IJITEE). Apr 2019; 8(6): 1106–1110.

Hiba Asri, Hajar Mousannifb, Hassan Al Moatassime, Thomas Noeld. Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Computer Science. 2016; 83: 1064–1069.

Haifeng Wang, Sang Won Yoon. Breast cancer prediction using data mining method. IEEE Conference Proceedings of the 2015 Industrial and Systems Engineering Research Conference, 2015.

O’Reilly Media. (2018). Aurelien Geron. Hands-on machine learning with Scikit-Learn &TensorFlow. Available at: https://1lib.in/book/11038191/3a8cd3?id=11038191&secret=3a8cd3.

Towards Data science. Building a simple machine learning model on breast cancer data. (Sep 2018). Available at: https://towardsdatascience.com/building-a-simple-machine-learning-model-on-breast-cancer-data-eca4b3b99fa3. 2021.

Java Point. (2021). Support vector machine algorithm. Available at: https://www.javatpoint.com/ machine-learning-support-vector-machine-algorithm.

TL Octaviani, Z Rustam. Random forest for breast cancer prediction. 2018 Proceedings of the 4th International Symposium on Current Progress in Mathematics and Sciences (ISCPMS2018). Available at: https://scholar.ui.ac.id/en/publications/random-forest-for-breast-cancer-prediction.

Tutorial Point. Classification algorithms—random forest. Available at https://www.tutorialspoint.com/machine_learning_with_python/machine_learning_with_python_classification_algorithms_random_forest.htm. 2021.

Randorson. Breast cancer detection using machine learning. (July 2019). Available at: https://randerson112358.medium.com/breast-cancer-detection-using-machine-learning-38820fe98982. 2021.

BM Gayathri, CP Sumathi, T Santhanam. Breast Cancer Diagnosis Using Machine Learning Algorithms–A Survey”, International Journal of Distributed and Parallel Systems. May 2013;4(3): 105-112.


Refbacks

  • There are currently no refbacks.