Preserving the Integrity of Large Language Models: Strategies Against Adversarial Attacks and Input Distortions

Abuelgasim  Saadeldin

Preserving the Integrity of Large Language Models: Strategies Against Adversarial Attacks and Input Distortions

Authors

Abuelgasim Saadeldin Author

Keywords:

Large Language Models , Robustness, Adversarial Attacks, Input Perturbations, Adversarial Training, Robust Optimization, Input Preprocessing, Vulnerabilities

Abstract

Large language models (LLMs) have demonstrated unprecedented performance across diverse natural language processing tasks, yet their vulnerability to adversarial attacks and input distortions raises concerns about their integrity and reliability. This paper investigates strategies for preserving the integrity of LLMs by mitigating adversarial attacks and input distortions. By addressing the crucial issue of integrity preservation, this research contributes to the development of trustworthy and dependable large language models for real-world applications.

Downloads

Published

01-05-2024

Issue

Vol. 1 No. 1 (2024): First Issue 2024

Section

Articles

How to Cite

Preserving the Integrity of Large Language Models: Strategies Against Adversarial Attacks and Input Distortions. (2024). Asian American Research Letters Journal, 1(1). https://aarlj.com/index.php/AARLJ/article/view/10

Download Citation

Preserving the Integrity of Large Language Models: Strategies Against Adversarial Attacks and Input Distortions

Authors

Keywords:

Abstract

Downloads

Published

Issue

Section

How to Cite

Similar Articles

Most read articles by the same author(s)