A multi-asset quantitative trading system that trains a PPO (Proximal Policy Optimization) reinforcement learning agent in a realistic market environment. Built with transaction costs, slippage ...
# all below 3 task are combined with opcode and having index 43 onwards for all of them in the function TN2int() # 'KERNEL_NETWORK_TASK_UDPIP'#index 43 # 42(opcode ...