Hao Wu
Associate Professor Supervisor of Doctorate Candidates Supervisor of Master's Candidates
Name (Simplified Chinese):Hao Wu
Name (English):Hao Wu
Name (Pinyin):Wu Hao
E-Mail:haowu@sdu.edu.cn
Date of Employment:2020-10-12
School/Department:Shandong University
Administrative Position:Associate Professor
Education Level:With Certificate of Graduation for Doctorate Study
Business Address:215a Office of scientific research building in Software Park Campus of Shandong University No.1500, Shunhua Road, high tech Zone, Jinan City, Shandong Province
Gender:Male
Contact Information:Email: haowu@sdu.edu.cn or haowu@nwsuaf.edu.cn Wechat: bioinformatics01
Degree:Doctor
Status:Employed
Academic Titles:Teaching and research, master's degree Supervisor
Other Post:Senior member of China Computer Association (CCF); Young and general project evaluation experts of NSFC; National Master's thesis evaluation expert; More than ten reviewers and guest editors of SCI Journals
Alma Mater:Xidian University
Whether on the job:0
Discipline:Other Majors of Software Engineering
Computer Applications Technology
Computer Science and Technology
Academic Honor:
Honors and Titles:
2021-12-20 山东大学软件学院优秀教师
2020-12-20 山东大学教师教学创新大赛三等奖
2018-07-10 西北农林科技大学信息工程学院本科毕业设计优秀指导教师
2016-12-25 西北农林科技大学青年教师讲课比赛二等奖,比赛现场评分第一名
2016-12-30 西北农林科技大学信息工程学院“特殊课时津贴”奖
2013-06-10 教改项目《计算机网络应用技术课程建设改革与实践》荣获西北农林科技大学教学成果二等奖
2009-10-10 西北农林科技大学 “优秀班主任”
2008-12-30 西北农林科技大学“校级优秀教师”
Hits:
Affiliation of Author(s):软件学院
Journal:Briefings in Bioinformatics
Key Words:topologically associating domains (TADs), ensemble learning, machine learning, sequence analysis
Abstract:Chromosome is composed of many distinct chromatin domains, referred to variably as topological domains or topologically associating domains (TADs). The domains are stable across different cell types and highly conserved across species, thus these chromatin domains have been considered as the basic units of chromosome folding and regarded as an important secondary structure in chromosome organization. However, the identification of TAD boundaries is still a great challenge due to the high cost and low resolution of Hi-C data or experiments. In this study, we propose a novel ensemble learning framework, termed as StackTADB, for predicting the boundaries of TADs. StackTADB integrates four base classifiers including Random Forest, Logistic Regression, K-NearestNeighbor and Support Vector Machine. From the analysis of a series of examinations on the data set in the previous study, it is concluded that StackTADB has optimal performance in six metrics, AUC, Accuracy, MCC, Precision, Recall and F1 score, and it is superior to the existing methods. In addition, the comparison of the performance of multiple features shows that Kmers-based features play an essential role in predicting TADs boundaries of fruit flies, and we also apply the SHapley Additive exPlanations (SHAP) framework to interpret the predictions of StackTADB to identify the reason why Kmers-based features are vital. The experimental results show that the subsequences matching the BEAF-32 motif play a crucial role in predicting the boundaries of TADs. The source code is freely available at https://github.com/HaoWuLab-Bioinformatics/StackTADB and the webserver of StackTADB is freely available at http://hwtad.sdu.edu.cn:8002/StackTADB.
All the Authors:Pengyu Zhang,Zhaoheng Ai,Leyi Wei,Hongming Zhang*,Fan Yang*,Lizhen Cui*
First Author:Hao Wu*
Indexed by:Journal paper
Document Code:24DDA04061DE4D03B344C2068828930D
Discipline:Engineering
First-Level Discipline:Software Engineering
Document Type:J
Volume:24
Issue:1
Page Number:bbac023
Number of Words:12
Translation or Not:no
Date of Publication:2022-02-01
Included Journals:SCI
Attachments: