Utilization of semantic linking and visualization techniques to facilitate knowledge creation from textual software artifacts
by Makovoz, Michael, Ph.D., NOVA SOUTHEASTERN UNIVERSITY, 2008, 191 pages; 3297657

Abstract:

In this dissertation, the researcher investigated how to promote knowledge creation and sharing among software team members by automatically linking existing textual knowledge artifacts via a centralized framework based on information retrieval. Software maintenance and support are important day-to-day tasks in IT departments and they cannot be performed properly without complete understanding of the system layout. During the project life cycle, different professionals might require different types of project knowledge. For instance, system management might be interested in overall system knowledge, while component developers most likely would express interest in more detailed and low level system knowledge. In this case, information sharing without taking into consideration levels of abstraction could be overwhelming.

Since overall knowledge about software projects is often distributed among different information sources such as source code, documentation and manuals, etc.; the researcher concluded that information is most often conveyed through the medium of text. Based on above conclusion, textual artifacts were the primary subjects of study during this research. The most important of them was the software source code because, due to programming language syntax, the application source code contains important structural and semantic information.

In this dissertation, the researcher concentrated on a centralized knowledge creation and sharing framework based on specialized information filtering techniques and information retrieval strategies. It helped to link existing textual artifacts and to facilitate knowledge discovery and sharing among team members. In order to construct semantic similarity links between different textual artifacts, the researcher used the LSI algorithm - a specialized information retrieval technique based on the vector space model and linear algebra. It was capable of providing a much cheaper and more flexible way to automate and to identify semantic links between textual artifacts based on their linguistic similarities.

The objectives of this research were met by developing a framework (and an experimental application based on it) that allowed one to link existing software artifacts and presented users with the concise list of the most relevant materials necessary to get answers about the desired topic. The framework employed different pre-processing techniques and visualized forms to discover and to present users with artifacts and their parts based on their semantic closeness and the desired sensitivity. The presented solution helped to recreate an indirect socialization stage and to promote explicit and tacit knowledge generation and transfer.

 
AdviserFrank J. Mitropoulos
SchoolNOVA SOUTHEASTERN UNIVERSITY
SourceDAI/B 69-02, p. , Jun 2008
Source TypeDissertation
SubjectsArtificial intelligence; Computer science
Publication Number3297657
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3297657
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.