Reinforcement learning approach for disassembly