Based on the database platform of ancient TCM books,names of tumor-related diseases in ancient TCM books were retrieved by Selenium WebDriver, an automation framework tool under Python 3.8. Lxml's etree library was used to parse the data. Statistics was made for "classification", "authors", "completion time" and "summary" of relevant ancient books automatically. After the data was checked and processed, Tableau 2019.2 software was used for data visualizationanalysis. And ancient Chinesemedicineliteratures relating to tumor were consulted at the database manually,with the dynasties as the clue,and the symptoms,etiology,pathogenesis and prognosis as the emphasis,this paper explores the development process of TCM oncology.
It was found that TCM oncology originated in the pre-Qin dynasty,and was improved in the Han and Tang dynasties, mature in the Song and Ming dynasties and completed in the Qing dynasty and the Republic of China. The data visualizationmethod with integrated automation framework and parsing tools is helpful to analyze the subdivision characteristics of ancient TCM literatures,which is convenient,efficient and innovative,in the expectation to provide a classic reference for contemporary TCM studies.