Loading...
Please wait, while we are loading the content...
Similar Documents
Towards Building a Scholarly Big Data Platform: Challenges, Lessons and Opportunities
| Content Provider | CiteSeerX |
|---|---|
| Abstract | We introduce a big data platform that provides various ser-vices for harvesting scholarly information and enabling ef-ficient scholarly applications. The core architecture of the platform is built on a secured private cloud; it crawls data using a scholarly focused crawler that leverages a dynamic scheduler, processes data by utilizing a map reduce based crawl-extraction-ingestion (CEI) workflow, and stores data in distributed repositories and databases. Services such as scholarly data harvesting, information extraction, and user information and log data analytics are integrated into the platform and provided by an OAI and RESTful APIs. We also introduce a set of scholarly applications built on top of this platform including citation recommendation and collab-orator discovery. |
| File Format | |
| Access Restriction | Open |
| Subject Keyword | Scholarly Big Data Platform Citation Recommendation Scholarly Information Various Ser-vices Collab-orator Discovery Distributed Repository Dynamic Scheduler Secured Private Cloud Log Data Analytics Scholarly Data Harvesting Restful Apis Core Architecture Ef-ficient Scholarly Application User Information Store Data Scholarly Application Information Extraction Big Data Platform |
| Content Type | Text |