NDLI: The Epoch-Greedy algorithm for contextual multi-armed bandits

Content Provider	Semantic Scholar
Author	Langford, John Zhang, Tong
Copyright Year	2007
Abstract	We present Epoch-Greedy, an algorithm for contextual multi-armed bandits (also known as bandits with side information). Epoch-Greedy has the following properties: 1. No knowledge of a time horizon T is necessary. 2. The regret incurred by Epoch-Greedy is controlled by a sample complexity bound for a hypothesis class. 3. The regret scales as O(T2/3S1/3) or better (sometimes, much better). Here S is the complexity term in a sample complexity bound for standard supervised learning.
Starting Page	817
Ending Page	824
Page Count	8
File Format	PDF HTM / HTML
Alternate Webpage(s)	http://www.stat.rutgers.edu/home/tzhang/papers/nips07-bandits.pdf
Alternate Webpage(s)	http://stat.rutgers.edu/home/tzhang/papers/nips07-bandits.pdf
Alternate Webpage(s)	http://hunch.net/~jl/projects/RL/sidebandits/bandit.pdf
Alternate Webpage(s)	http://courses.cms.caltech.edu/cs101.2/slides/cs101.2-05-contextual-bandits.pdf
Alternate Webpage(s)	http://hunch.net/~jl/projects/interactive/sidebandits/bandit.pdf
Alternate Webpage(s)	http://papers.nips.cc/paper/3178-the-epoch-greedy-algorithm-for-multi-armed-bandits-with-side-information.pdf
Alternate Webpage(s)	http://books.nips.cc/papers/files/nips20/NIPS2007_0785.pdf
Alternate Webpage(s)	http://cseweb.ucsd.edu/~kamalika/teaching/CSE291W11/feb28.pdf
Alternate Webpage(s)	http://machinelearning.wustl.edu/mlpapers/paper_files/NIPS2007_785.pdf
Journal	NIPS 2007
Language	English
Access Restriction	Open
Content Type	Text
Resource Type	Article

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in

The epoch-greedy algorithm for contextual multi-armed bandits.

The epoch-greedy algorithm for contextual multi-armed bandits (2008)

Contextual multi-armed bandits.

Contextual multi-armed bandits — appendix.

A time and space efficient algorithm for contextual linear bandits.

Contextual multi-armed bandits for web server defense

2 Contextual Multi-armed Bandits for the Prevention of Spam in VoIP Networks

Contextual Multi-armed Bandits for Web Server Defense

An Empirical Study of Human Behavioral Agents in Bandits, Contextual Bandits and Reinforcement Learning

The Epoch-Greedy algorithm for contextual multi-armed bandits

Similar Documents

The epoch-greedy algorithm for contextual multi-armed bandits.

The epoch-greedy algorithm for contextual multi-armed bandits (2008)

Contextual multi-armed bandits.

Contextual multi-armed bandits — appendix.

A time and space efficient algorithm for contextual linear bandits.

Contextual multi-armed bandits for web server defense

2 Contextual Multi-armed Bandits for the Prevention of Spam in VoIP Networks

Contextual Multi-armed Bandits for Web Server Defense

An Empirical Study of Human Behavioral Agents in Bandits, Contextual Bandits and Reinforcement Learning

The Epoch-Greedy algorithm for contextual multi-armed bandits