Developing a Hybrid Model to Predict Student First Year Retention and Academic Success in STEM Disciplines Using Neural Networks
by Alkhasawneh, Ruba, Ph.D., VIRGINIA COMMONWEALTH UNIVERSITY, 2011, 134 pages; 3473939

Abstract:

Understanding the reasoning behind the low enrollment and retention rates of Underrepresented Minority (URM) students (African Americans, Hispanic Americans, and Native Americans) in the disciplines of science, technology, engineering, and mathematics (STEM) has concerned many researchers for decades. Numerous studies have used traditional statistical methods to identify factors that affect and predict student retention. Recently, researchers have relied on using data mining techniques for modeling student retention in higher education [1].

This research has used neural networks for performance modeling in order to obtain an adequate understanding of factors related to first year academic success and retention of URM at Virginia Commonwealth University.

This research used feed forward back-propagation architecture for modeling. The student retention model was developed based on fall to fall retention in STEM majors. The overall freshman year GPA was used to model student academic success. Each model was built in two different ways: the first was built using all available student inputs, and the second using an optimized subset of student inputs. The optimized subset of the most relevant features that comes with the student, such as demographic attributes, high school rank, and SAT test scores was formed using genetic algorithms.

A further step towards understanding the retention of URM groups in STEM fields was taken by conducting a series of focus groups with participants of an intervention program at VCU. Focus groups were designed to elicit responses from participants for identifying factors that affect their retention the most and provide more knowledge about their first year experiences, academically and socially. Results of the genetic algorithm and focus groups were incorporated into building a hybrid model using the most relevant student inputs.

The developed hybrid model is shown to be a valuable tool in analyzing and predicting student academic success and retention. In particular, we have shown that identifying the most relevant student inputs from the student's perspective can be incorporated with quantitative methodologies to build a tool that can be used and interpreted effectively by people who are related to the field of STEM retention and education. Further, the hybrid model performed comparable to the model developed using the optimized set of inputs that resulted from the genetic algorithm. The GPA prediction hybrid model was tested to determine how well it would predict the GPA for all students, majority students and URM students. The root mean squared error (RMSE) on a 4.0 scale was 0.45 for all students, 0.47 for majority students, and 0.45 for URM students. The hybrid retention model was able to predict student retention correctly for 74% of all students, 79% of majority students and 60% of URM students. The hybrid model's accuracy was increased 3% compared to the model which used the optimized set of inputs.

 
AdviserRosalyn S. Hobson
SchoolVIRGINIA COMMONWEALTH UNIVERSITY
SourceDAI/B 72-12, p. , Oct 2011
Source TypeDissertation
SubjectsEducation; Engineering
Publication Number3473939
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3473939
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.