Draft:Edge intelligence

Edge intelligence

Edge intelligence (or Edge AI) is a paradigm in distributed computing that integrates artificial intelligence (AI) capabilities directly into edge devices and edge servers, enabling data processing and decision-making close to the data source [1]. It combines concepts from edge computing and machine learning to reduce latency, improve privacy, and enable real-time analytics in resource-constrained environments.

Overview

Traditional cloud-based AI systems rely on centralized data processing, where raw data is transmitted from end devices to remote data centers. In contrast, edge intelligence shifts computation toward the network edge—such as sensors, mobile devices, and embedded systems—allowing models to operate locally or collaboratively across distributed nodes.[1]

This approach is particularly useful in applications that require:[1]

  • Low latency (e.g., autonomous driving, industrial automation)
  • Bandwidth efficiency (reducing data transmission to the cloud)
  • Privacy preservation (keeping sensitive data local)

Architecture

A typical edge intelligence system consists of three layers:[2]

  1. Device layer Includes sensors, IoT devices, and mobile endpoints that collect and sometimes process data locally.
  2. Edge layer Comprises edge servers or gateways that perform intermediate processing, model inference, or aggregation.
  3. Cloud layer Provides large-scale training, global coordination, and long-term storage.

These layers often collaborate through distributed learning frameworks such as federated learning or split learning.[2]

Key techniques

Model compression and optimization

Edge devices have limited computational resources. Techniques such as pruning, quantization, and knowledge distillation are used to reduce model size and complexity.[3]

Distributed learning

Learning is distributed across multiple devices[4]:

  • Federated learning enables local model training without sharing raw data.
  • Split learning divides a model between client and server to balance computation and privacy.

On-device inference

Models are deployed directly on edge devices for real-time decision-making, avoiding cloud latency.[3]

Edge-cloud collaboration

Hybrid approaches dynamically partition tasks between edge and cloud depending on resource availability and task requirements.[2]

Applications

Edge intelligence is widely used in [1]:

  • Autonomous vehicles: real-time perception and decision-making
  • Smart cities: traffic monitoring and infrastructure management
  • Healthcare: wearable devices and remote diagnostics
  • Industrial IoT: predictive maintenance and process optimization
  • Smart homes: voice assistants and security systems

Advantages

Benefits for end users [1]:

  • Low latency: real-time processing without cloud dependency
  • Improved privacy: sensitive data remains on-device
  • Reduced bandwidth usage: less data transmitted to the cloud
  • Scalability: distributed computation across many devices

Challenges

Research Challenges [2] :

  • Resource constraints: limited memory, compute, and power
  • Model heterogeneity: varying device capabilities
  • Security and privacy risks: potential leakage from models or communication
  • System complexity: coordination across distributed nodes
  • Non-IID data: data distributions differ across devices

Research directions

Current research in edge intelligence focuses on:

  • Efficient distributed training under resource constraints
  • Privacy-preserving and secure learning methods
  • Adaptive model partitioning and scheduling
  • Robustness against adversarial attacks
  • Integration with 5G/6G and next-generation networks

See also

References

  1. Raghubir Singh, Sukhpal Singh Gill, Edge AI: A survey, Internet of Things and Cyber-Physical Systems, Volume 3, 2023, Pages 71-92, ISSN 2667-3452, https://doi.org/10.1016/j.iotcps.2023.02.004.
  2. Yin Zhang, Chi Jiang, Binglei Yue, Jiafu Wan, Mohsen Guizani, Information fusion for edge intelligence: A survey, Information Fusion, Volume 81, 2022, Pages 171-186, ISSN 1566-2535, https://doi.org/10.1016/j.inffus.2021.11.018.

  1. ^ a b c d e "What is edge AI?". ibm.com. IBM. Retrieved 3/25/2026. {{cite web}}: Check date values in: |access-date= and |date= (help)
  2. ^ a b c d Javier Mendez, Kay Bierzynski, M. P. Cuéllar, and Diego P. Morales. 2022. Edge Intelligence: Concepts, Architectures, Applications, and Future Directions. ACM Trans. Embed. Comput. Syst. 21, 5, Article 48 (September 2022), 41 pages. https://doi.org/10.1145/3486674
  3. ^ a b Wang, Xubin; Jia, Weijia (4 Jan 2025). "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies". arXiv:2501.03265v1 [cs.LG].
  4. ^ Prabath Abeysekara, Hai Dong, A.K. Qin (24 Oct 2022). "Deep Edge Intelligence: Architecture, Key Features, Enabling Technologies and Challenges". arXiv.

Content Disclaimer

Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.

  1. The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
  2. There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
  3. It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
  4. Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
  5. Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.