Malicious actors are targeting LLM prompt flow and response handling to bypass safety controls.
LLM Jacking is emerging as a real threat to enterprise Generative AI deployments. Rather than attacking the model itself, adversaries are exploiting the broader prompt and output processing chain to make models act against policy, leak secrets, or execute unauthorized actions.
This attack class includes prompt injection, hidden-prompt manipulation, tool abuse, and outcome hijacking. Security teams must now protect the entire model execution pipeline, not just the endpoint.
LLM Jacking involves attackers changing the way the model is instructed or how its outputs are handled. Common tactics include:
Not all AI systems are equally vulnerable. LLM Jacking is especially critical where models are chained with external tools, action runners, or prompt-based workflows. The biggest risk is when binary decisions and automated workflows depend directly on generated text.
LLM Jacking is a practical example of attackers treating AI as part of the software supply chain. If you are using LLMs for automation, decision support, or data access, assume that attackers will test prompt and output pathways for weak points.
Enterprise teams should harden prompt flows, add verification around AI outputs, and treat every model integration as a potential attack surface.