ABSTRACT
The COVID-19 pandemic has become a serious problem that has attacked various aspects of life such as social, economic, religious, and others. The government has held a COVID-19 vaccination program as an effort to deal with the COVID-19 problem since January 13, 2021. Many problems occurred due to difficulties in dividing the vaccination recipient areas. This is due to the large number of regions with different conditions for each region. One of the efforts to assist the process of processing large vaccination data is data mining techniques and using the clustering method with the K-medoids algorithm. In this study, data on COVID-19 vaccination was grouped in the East Jakarta area using the K-medoids algorithm clustering method. The calculation is carried out using the Euclidean Distance equation and the value of S > 0. The grouped area categories are at the kelurahan level which will then be divided into several clusters. The clustering process was carried out with RapidMiner on 267 kelurahan data on four main attributes, namely the number of targets, the number of vaccine doses 1, the number of vaccine doses 2, and the number of vaccine doses 3. The clustering process was carried out in 6 simulations with variations of k medoids as much as 2 to 7. The results of clustering show the best number of clusters obtained in the simulation is cluster 6 with the smallest Davies Bouldin Index (DBI) value of 0.209. The clusters obtained are clusters 0 to cluster 5. The cluster that is prioritized in giving vaccinations is cluster 2 with 67 items because its members are areas in DKI Jakarta and give a high score in cases of COVID-19 compared to other clusters. © 2022 IEEE.
ABSTRACT
The use of private vehicles during the Covid-19 pandemic has increased because private vehicles, especially cars, are considered as the safest mode of transportation to maintain distance and prevent transmission of the Covid-19 virus. Based on data from two different Indonesian secondary car market place, a comparison of a price sample of Car X in the city of Surabaya with the specifications for the 2015 to 2018 car years with car milage under 1000 kilometers, the used cars have a variety of prices hence a used car price prediction system is needed so that people can find out the average price of used cars sold in the market. In this study the author will use the Random Forest Regressor as a machine learning algorithm to predict the price of a used car with a dataset from the AtapData website. The reason for choosing the Random Forest Regressor is because the algorithm has the power to handle large amounts of data with high dimensions with categorical and numerical data types. The evaluation method used in this study is the Root Mean Absolute Error which produces a value of 0.55612 for validation data and 0.56638 for testing data, while the evaluation proceed with Mean Absolute Error produces a value of 0.45208 for validation data and 0.47576 for testing data. © 2022 IEEE.
ABSTRACT
Since the case of the 2019 Coronavirus Disease pandemic or commonly referred to as Covid-19, the use of public transportation has slowly begun to become an option as transportation to reduce the spread of the corona virus cluster, therefore some people prefer to buy private vehicles. However, due to the increasing price of cars, some people prefer to buy used cars. On the used car buying and selling platform, OLX Autos Indonesia, the demand for used cars increased by 15% to 20%. Therefore, this study was conducted to determine the characteristics of the cluster formed from the used car sales dataset taken from AtapData (atapdata.ai). AtapData is an open data site in Indonesia that can be used for research related to Data Science. This cluster model was created using the K-Prototypes algorithm, Silhouette Score and Davies Bouldin Index to evaluate the resulting cluster results. This clustering model will produce three clusters. The results of the three clusters will have one thing in common, namely brands that dominate sales, including Toyota, Honda, Daihatsu, Nissan, and Mitsubishi. Clustering evaluation using the Silhouette Score method produces a value of 0.7744140503593034. And for the evaluation of the Davies-Bouldin Index it produces a value of 0.4999221950856398. © 2022 IEEE.