Loading...
Please wait, while we are loading the content...
Similar Documents
Toward Evaluation that Leads to Best Practices: Reconciling Dialog Evaluation in Research and Industry
| Content Provider | Semantic Scholar |
|---|---|
| Author | Paek, Tim |
| Copyright Year | 2007 |
| Abstract | Dialog evaluation is approached in different ways by research and industry. While researchers have sought commensurable evaluation metrics that allow for comparison of disparate systems with varying tasks and domains, industry engineers have focused mostly on best practices and delivering a return-on-investment to customers. In this paper, we contend that the problem of finding commensurable metrics also applies to commercial evaluation, and critically survey four candidate metrics for commensurability. Finally, in light of the problems faced by the candidate metrics, we advocate a collaborative agenda for dialog evaluation based on using statistical meta-analysis for empirically establishing best practices from any evaluation metric. |
| Starting Page | 40 |
| Ending Page | 47 |
| Page Count | 8 |
| File Format | PDF HTM / HTML |
| DOI | 10.3115/1556328.1556334 |
| Alternate Webpage(s) | http://www.aclweb.org/anthology/W/W07/W07-0306.pdf |
| Alternate Webpage(s) | http://aclweb.org/anthology/W/W07/W07-0306.pdf |
| Alternate Webpage(s) | http://aclweb.org/anthology/W07-0306 |
| Alternate Webpage(s) | http://anthology.aclweb.org/W/W07/W07-0306.pdf |
| Alternate Webpage(s) | http://wing.comp.nus.edu.sg/~antho/W/W07/W07-0306.pdf |
| Alternate Webpage(s) | http://www.aclweb.org/anthology/W07-0306 |
| Alternate Webpage(s) | http://aclweb.org/anthology//W/W07/W07-0306.pdf |
| Alternate Webpage(s) | https://doi.org/10.3115/1556328.1556334 |
| Journal | HLT-NAACL 2007 |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |