000 02787nam a22001577a 4500
082 _a629.8
100 _aAKHTAR,TEHSEEN
_929556
245 _aDETECTION OF THYROID DISEASES USING MACHINE LEARNING TECHNIQUES /
_cTEHSEEN AKHTAR
264 _aIslamabad :
_bSMME- NUST;
_c2021
300 _a137p.
_bSoft Copy
_c30cm
500 _aBackground: The unusual growth of the glandular tissue on the boundary of the Thyroid gland is an indication of Thyroid disease. Thyroid disease is characterised by an unusually high or low number of hormones produced by the thyroid gland, the two most prevalent kinds are hypothyroidism (underactive thyroid gland) and hyperthyroidism (overactive thyroid gland). The main aim of this project was to introduce the concept of an efficient multi-stage ensemble i.e., the voting ensemble of the homogeneous ensemble which could be used with a variety of feature-selection algorithms for improving the diagnosis of thyroid diseases. The dataset utilised in this study was built from real-time thyroid data obtained from the teaching hospital in DG Khan at District Head Quarter (DHQ), Pakistan. Following the appropriate preprocessing processes, three kinds of attribute-selection strategies were used: The first approach used was Select from Model (SFM), the second technique was the Select K-Best (SKB), and the final methodology was the Recursive Feature Elimination (RFE). Select From Model (SFM) is a form of attribute-selection strategy that uses a model to select attributes. As potential feature estimators, the Decision Tree (DT), Logistic Regression (LR), Gradient Boosting (GB) and Random Forest denoted as the (RF) classifiers were employed in conjunction with each other. The homogeneous ensemble activated the bagging, boosting-based learners, who were then classified by the Voting ensemble, which employed both soft and hard voting to categorise the data. Other performance assessment criteria such as hamming loss, accuracy, mean square error, sensitivity and others have been implemented. The results of the experiments reveal that when the suggested approach for better thyroid sickness detection is applied in its most practicable form, it is most successful. On the dataset 1, all of the algorithms tested obtained 100 % accuracy with subset of the total no of feature in each case, however on the dataset 2, more than 98 percent accuracy was reached in every case. On the basis of accuracy and computing cost, the results given here exceeded equivalent benchmark models in their respective fields of study.
650 _aPhD Robotics and Intelligent Machine Engineering
_9123222
700 _aSupervisor : Dr. Omer Gilani
_9119662
856 _uhttp://10.250.8.41:8080/xmlui/handle/123456789/28854
942 _2ddc
_cTHE
999 _c610719
_d610719