Edge Computing Reinforcement Learning: Optimizing AI Model Deployments
Keywords:
reinforcement learning, edge computing, resource constraints, model optimizationAbstract
Edge computing installs AI models for data sources to reduce the latency, decision making and bandwidth. This AI model deployment and administration is limited by processing power, memory, and energy. Deployment of this model benefits dynamic and adaptive reinforcement learning which improves resource allocation job scheduling and energy management with environmental requirement. This research paper uses Deep Q Network and proximal policy optimization to merge RL with resource constraint edge computing.
Downloads
References
Shi, W., Cao, J., Zhang, Q., Li, Y., & Xu, L. (2016). Edge computing: Vision and challenges. IEEE Internet of Things Journal, 3(5), 637-646.
Satyanarayanan, M. (2017). The emergence of edge computing. Computer, 50(1), 30-39.
He, K., et al. (2016). Deep residual learning for image recognition. Proceedings of CVPR, 770-778.
Sutton, R. S., & Barto, A. G. (2018). Reinforcement Learning: An Introduction. MIT Press.
Mnih, V., et al. (2015). Human-level control through deep reinforcement learning. Nature, 518(7540), 529-533.
Silver, D., et al. (2016). Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587), 484-489.
Wang, X., et al. (2019). RL-based task offloading in mobile edge computing. IEEE Access, 7, 29498-29510.
Schulman, J., et al. (2017). Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347.
Zhang, C., et al. (2020). Dynamic model partitioning with RL. ACM Transactions on Edge Computing, 12(4), 123-135.
S. Kumari, “Kanban and AI for Efficient Digital Transformation: Optimizing Process Automation, Task Management, and Cross-Departmental Collaboration in Agile Enterprises”, Blockchain Tech. & Distributed Sys., vol. 1, no. 1, pp. 39–56, Mar. 2021
Sivaraman, Hariprasad. (2020). Integrating Large Language Models for Automated Test Case Generation in Complex Systems.
Singu, Santosh Kumar. "Real-Time Data Integration: Tools, Techniques, and Best Practices." ESP Journal of Engineering & Technology Advancements 1.1 (2021): 158-172.
S. Kumari, “Kanban-Driven Digital Transformation for Cloud-Based Platforms: Leveraging AI to Optimize Resource Allocation, Task Prioritization, and Workflow Automation”, J. of Artificial Int. Research and App., vol. 1, no. 1, pp. 568–586, Jan. 2021
S. Kumari, “Kanban and Agile for AI-Powered Product Management in Cloud-Native Platforms: Improving Workflow Efficiency Through Machine Learning-Driven Decision Support Systems”, Distrib Learn Broad Appl Sci Res, vol. 5, pp. 867–885, Aug. 2019
S. Kumari, “Digital Transformation Frameworks for Legacy Enterprises: Integrating AI and Cloud Computing to Revolutionize Business Models and Operational Efficiency ”, Journal of AI-Assisted Scientific Discovery, vol. 1, no. 1, pp. 186–204, Jan. 2021
Sivaraman, Hariprasad. (2020). Intelligent Deployment Orchestration Using ML for Multi-Environment CI/CD Pipelines.
S. Kumari, “AI-Powered Cybersecurity in Agile Workflows: Enhancing DevSecOps in Cloud-Native Environments through Automated Threat Intelligence ”, J. Sci. Tech., vol. 1, no. 1, pp. 809–828, Dec. 2020.
S. Kumari, “Cloud Transformation and Cybersecurity: Using AI for Securing Data Migration and Optimizing Cloud Operations in Agile Environments”, J. Sci. Tech., vol. 1, no. 1, pp. 791–808, Oct. 2020.
Singu, Santosh Kumar. "Designing scalable data engineering pipelines using Azure and Databricks." ESP Journal of Engineering & Technology Advancements 1.2 (2021): 176-187.
Sivaraman, Hariprasad. (2021). INTELLIGENT AUTOMATION FOR SERVICE DEGRADATION PREDICTION USING LLMS AND OBSERVABILITY DATA. International Journal of Engineering Management. 6. 10.5281/zenodo.14342920.
S. Kumari, “AI-Powered Cloud Security for Agile Transformation: Leveraging Machine Learning for Threat Detection and Automated Incident Response ”, Distrib Learn Broad Appl Sci Res, vol. 6, pp. 467–488, Oct. 2020
Mao, Y., et al. (2017). A survey on mobile edge computing. IEEE Communications Surveys & Tutorials, 19(4), 2322-2358.
Han, S., et al. (2016). Deep compression: Compressing deep neural networks. Proceedings of ICLR.
Zhao, J., et al. (2021). RL for latency-aware edge AI deployments. IEEE Transactions on Neural Networks and Learning Systems, 32(5), 1234-1246.
Chen, T., et al. (2018). Energy-efficient edge computing. Proceedings of ACM IoTDI, 28-37.
Li, Y., et al. (2020). Energy-aware task scheduling in IoT. IEEE Internet of Things Journal, 7(8), 7487-7498.
Lin, X., et al. (2019). Task scheduling in multi-access edge computing using RL. IEEE Communications Magazine, 57(3), 62-69.
Kumar, S., et al. (2020). RL-based resource optimization in healthcare IoT. Computers & Security, 90, 101711.
Papernot, N., et al. (2018). Technical challenges of ML in cybersecurity. Proceedings of ACM CCS, 62-75.
Hinton, G., et al. (2015). Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531.
Li, W., et al. (2020). Distributed RL in edge computing. IEEE Transactions on Neural Networks and Learning Systems, 31(10), 4025-4035.
McMahan, H. B., et al. (2017). Federated learning: Collaborative machine learning. arXiv preprint arXiv:1712.07557.
Yang, Z., et al. (2018). RL for autonomous vehicles in edge environments. IEEE Transactions on Intelligent Vehicles, 3(2), 198-207.
Fang, X., et al. (2021). RL-based smart grid energy management. IEEE Transactions on Power Systems, 36(5), 4123-4132.
Tran, T., et al. (2020). Adaptive RL for telemedicine applications. IEEE Transactions on Multimedia, 22(12), 3015-3026.