[COMPASS-7111] Track and categorize instances of abuse Created: 14/Aug/23 Updated: 27/Oct/23 Resolved: 04/Oct/23 |
|
| Status: | Closed |
| Project: | Compass |
| Component/s: | GAI |
| Affects Version/s: | None |
| Fix Version/s: | No version |
| Type: | Task | Priority: | Major - P3 |
| Reporter: | Alena Khineika | Assignee: | Unassigned |
| Resolution: | Works as Designed | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Issue Links: |
|
||||||||
| Story Points: | 3 | ||||||||
| Documentation Changes: | Not Needed | ||||||||
| Description |
|
The prompt injection is out of scope of this project, but we should consider reviewing user prompts / responses for injection attempts to fingerprint for possible attacker behavior. https://arxiv.org/abs/2302.12173v2 |
| Comments |
| Comment by Jessica Sigafoos [ 04/Oct/23 ] |
|
We feel covered by our existing rate limiting. |