Develop a Vertical Search Engine - Report Writing - IT Assignment Help

Download Solution Order New Solution
Assignment Task

 

Task:
Develop a vertical search engine similar to Google Scholar that only retrieves papers/books published by a member of Coventry University. That is, at least one of the co-authors must be from CU. To that end, you crawl Google Scholar profiles of academic staff at CU and index their papers in their profiles. The seed page for your crawler, i.e. the first page to crawl, is the Google Scholar page for Coventry University.
Your system crawls this page and the links provided for each member of staff there to access their Google Scholar profiles. Then for each profile, it goes through the publications and construct the inverted index using the information about those publications. Because of low rate of changes to this information, your crawler may be scheduled to look for new information, say, once per week, but it should ideally be able to do so automatically, as a scheduled task.
From the user’s point of view, your system has an interface that is similar to the Google Scholar main page, where the user can type in their queries/keywords about the resources they want to find. Then, your system will display the results, sorted by relevance, in a similar way Google Scholar does. However, only publications with at least one co-author from CU are retrieved. You may further specialise your search engine to a specific field, e.g., computer science, mechanical engineering, bioinformatics or whatever you would like. In addition, whether as a separate program or integrated with search engine, a subject classification functionality is needed. More specifically, the input is a scientific text and the output is its subject among zero or more of the cases: Health, Engineering, Business, Art. You can use any general purpose programming language of your choice although Python is recommended because of its rich library and sample codes developed in the labs. In case of ambiguity, make reasonable assumptions and/or let me know.
Please note that to show that your system meets each of the above-mentioned requirements, your report must provide sufficient evidence including clear description, complete source code, and complete screenshots where applicable.

 

 

This IT Assignment has been solved by our IT experts at My Uni Paper. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.
Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.

Get It Done! Today

Country
Applicable Time Zone is AEST [Sydney, NSW] (GMT+11)
+

Every Assignment. Every Solution. Instantly. Deadline Ahead? Grab Your Sample Now.