Placement in Integrated Circuits using Cyclic Reinforcement Learning and Simulated Annealing