Loading...
Please wait, while we are loading the content...
Similar Documents
A preprocessing method of internet search data for prediction improvement: application to Chinese stock market
| Content Provider | ACM Digital Library |
|---|---|
| Author | Peng, Geng Lv, Benfu Yuan, Qingyu Liu, Ying |
| Abstract | The correlations between Internet search data and socio-economic Indicators have been proved in many studies, but the basis work of these studies - data preprocessing, determining the quality of the result, has lacked a systematic methodology. In this paper, we develop a comprehensive method for Internet search data preprocessing, which includes the critical steps: (a) keywords selection, (b) time difference measurement, and (c) leading index composition. Applying our method to study Chinese stock market price, we can get the leading keywords index with stable leading relation and high degree of fit. Specifically, the correlation coefficient between our leading keywords index and Shanghai Composite Index reaches 98.7%, and Granger test confirms that keywords index has significant prediction ability for Shanghai Composite Index. Adding keywords index to the AR model can reduce the MAPE from 3.8% to 1.4%, and each percentage point change of keywords index is correlated with 0.136 percentage point move in the same direction of Shanghai Composite Index in next period. |
| Starting Page | 1 |
| Ending Page | 7 |
| Page Count | 7 |
| File Format | |
| ISBN | 9781450315517 |
| DOI | 10.1145/2462130.2462133 |
| Language | English |
| Publisher | Association for Computing Machinery (ACM) |
| Publisher Date | 2012-08-12 |
| Publisher Place | New York |
| Access Restriction | Subscribed |
| Subject Keyword | Leading keywords index Stepwise composition Time difference measurement Internet search data Preprocessing method |
| Content Type | Text |
| Resource Type | Article |