Faster Machine Learning on Big Data Sets

Event details

Date	17.06.2014
Hour	09:00
Speaker	James KWOK, The Hong Kong University of Science and Technology, Hong Kong.
Location	BC 420
Category	Conferences - Seminars

On big data sets, it is often challenging to learn the parameters in a machine learning model. A popular technique is the use of stochastic gradient, which computes the gradient at a single sample instead of over the whole data set. Another alternative is distributed processing, which is particularly natural when a single computer cannot store or process the whole data set. In this talk, some recent extensions will be presented. For stochastic gradient, instead of using the information from only one sample, we incrementally approximate the full gradient by also using old gradient values from the other samples. It enjoys the same computational simplicity as existing stochastic algorithms, but has faster convergence. As for existing distributed machine learning algorithms, they are often synchronized and the system can move forward only at the pace of the slowest worker. I will present an asynchronous algorithm which requires only partial synchronization, and updates from the faster workers can be incorporated more often by the master.

Practical information

General public
Free

Organizer

Boi Faltings

Contact

Sylvie Thomet

Export Event

Event broadcasted in

Send a reminder