Virtualizing China’s Biggest Online Marketplace for Training Reinforcement Learning

This article is part of the Academic Alibaba series and is taken from the paper entitled “Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning.” by Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen and An-Xiang Zeng. The full paper can be read here.

Image for post
Image for post
Image for post
Image for post
Taobao search in engine view and in customer view
Image for post
Image for post
The customer distributions between Taobao and the Virtual Taobao
Image for post
Image for post
The R2P distributions between Taobao and the Virtual Taobao
Image for post
Image for post

Alibaba Tech

First-hand and in-depth information about Alibaba’s latest technology → Search “Alibaba Tech” on Facebook

First-hand & in-depth information about Alibaba's tech innovation in Artificial Intelligence, Big Data & Computer Engineering. Follow us on Facebook!

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store