SvgAI - Training Methods Analysis of Artificial Intelligent Agent to use SVG Editor

Anh H. Dang, Wataru Kameyama

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Deep reinforcement learning has been successfully used to train artificial intelligent (AI) agents, which outperforms humans in many tasks. The objective of this research is to train an AI agent to draw SVG images by using scalable vector graphic (SVG) editor with deep reinforcement learning, where the AI agent is to draw SVG images that are similar as much as possible to the given target raster images. In this paper, we propose framework to train the AI agent by value-function based Q-learning and policy-gradient based learning methods. With Q-learning based method, we find that it is crucial to distinguish the action space into two sets to apply a different exploration policy on each set during the training process. Evaluations show that our proposed dual ϵ-greedy exploration policy greatly stabilizes the training process and increases the accuracy of the AI agent. On the other hand, policy-gradient based training does not depend on external reward function. However, it is hard to implement especially in the environment with a large action space. To overcome this difficulty, we propose a strategy similar to the dynamic programming method to allow the agent to generate training samples by itself. In our evaluation, the highest score is archived by the agent trained by this proposed method. SVG images produced by the proposed AI agent have also superior quality compared to popular raster-to-SVG conversion softwares.

Original languageEnglish
Title of host publication21st International Conference on Advanced Communication Technology
Subtitle of host publicationICT for 4th Industrial Revolution!, ICACT 2019 - Proceeding
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1159-1166
Number of pages8
ISBN (Electronic)9791188428021
DOIs
Publication statusPublished - 2019 Apr 29
Event21st International Conference on Advanced Communication Technology, ICACT 2019 - Pyeongchang, Korea, Republic of
Duration: 2019 Feb 172019 Feb 20

Publication series

NameInternational Conference on Advanced Communication Technology, ICACT
Volume2019-February
ISSN (Print)1738-9445

Conference

Conference21st International Conference on Advanced Communication Technology, ICACT 2019
CountryKorea, Republic of
CityPyeongchang
Period19/2/1719/2/20

Keywords

  • Exploration Policy
  • Q-learning
  • Reinforcement Learning
  • SVG

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'SvgAI - Training Methods Analysis of Artificial Intelligent Agent to use SVG Editor'. Together they form a unique fingerprint.

Cite this