The infrastructure 100 can transmit user's input commands from the ULB 105 to the datacenter 116 and inject the commands to the remote device 112 and to the software 103 running on the remote device 112. The process of injecting the user commands can include translating the user input commands to input format compatible with the remote device 112. Furthermore, the user's input commands are entered on a replica display of the remote device 112, generated on the ULB 105, as will be described herein. The infrastructure 100 can provide a video stream of the display of the remote device 112 and display responses of the software 103 to the ULB 105. The user can inspect the responses of the software 103 on the ULB 105. A variety of software and hardware components of the infrastructure 100 can provide the functionality of transmitting user input commands to the software 103, running on the remote device 112 and generating a replica or mirrored display of the remote device 112 on the ULB 105. In some embodiments, the video stream of the display of the remote device 112 is used to generate a replica or mirrored image of the remote device 112 on the ULB 105. The user input commands can be received in the form of interactions with the replica display on the ULB 105. For example, when the remote device 112 is a smart phone, with a touch screen, the user can input taps and touch screen gestures, via mouse clicks, on-screen keyboard inputs, via a laptop or desktop keyboard and so forth. The infrastructure 100 can capture the user interactions with the replica display and transmit them to the remote device 112.