Towards efficiently solving the rubik’s cube with deep reinforcement learning and recursion

M. Mahindra Roshan; S. Rakesh; T. Sri Gnana Guru; B. Rohith; J. Hemalatha

doi:10.1051/e3sconf/202449101009

All issues

Volume 491 (2024)

E3S Web Conf., 491 (2024) 01009

Abstract

Open Access

Issue		E3S Web Conf. Volume 491, 2024 International Conference on Environmental Development Using Computer Science (ICECS’24)


Article Number		01009
Number of page(s)		9
Section		Energy Management for Sustainable Environment
DOI		https://doi.org/10.1051/e3sconf/202449101009
Published online		21 February 2024

E3S Web of Conferences 491, 01009 (2024)

Towards efficiently solving the rubik’s cube with deep reinforcement learning and recursion

M. Mahindra Roshan¹¹, S. Rakesh², T. Sri Gnana Guru³, B. Rohith⁴ and J. Hemalatha⁵²

¹ Students Department of Computer Science and Engineering , AAA College of Engineering and Technology, Amathur , Sivakasi , TamiNadu.
² Students Department of Computer Science and Engineering , AAA College of Engineering and Technology, Amathur , Sivakasi , TamiNadu.
³ Students Department of Computer Science and Engineering , AAA College of Engineering and Technology, Amathur , Sivakasi , TamiNadu.
⁴ Students Department of Computer Science and Engineering , AAA College of Engineering and Technology, Amathur , Sivakasi , TamiNadu.
⁵ Professor, Department of CSE, AAA College of Engineering and Technology, Amathur, Sivakasi, TamilNadu.

¹ Corresponding author: mahindraroshan413@gmail.com
² Corresponding author: jhemalathakumar@gmail.com

Abstract

The Rubik’s cube is a prototypical combinatorial puzzle that has a large state space with a single goal state. The goal state is unlikely to be retrieved using orders of randomly generated moves, posing unique challenges for machine learning. The proposed work is above to solve the Rubik’s cube with recursion and DeepCubeA, a deep reinforcement learning approach that learns how to solve increasingly difficult states in reverse from the goal state without any specific domain knowledge. DeepCubeA solves 100% of all test patterns, finding a shortest path to the goal state 60.3% of the time. Deep Cube A generalizes to other combinatorial puzzles andis able to solve the 15 puzzle, 24 puzzle, 35 puzzle, 48 puzzle, Lights Out and Sokoban, finding a shortest path in the majority of verifiable cases. These models were trained with 1 4 GPUs and 20 30 CPUs. This varies throughout training as the training is often stopped and started again to make room for other processes. Further our experimentation compares the results of Rubik’s cube solving among both recursion and DeepCubeA and also with the state of art models. Later, we intend to develop a new deep learning model with an application.

Key words: Cubics / recursion / Deep Learning model / reinforcement learning / training / GPU

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.