69 lines
2.1 KiB
Markdown
69 lines
2.1 KiB
Markdown
---
|
|
date: '2022-12-19T14:55:52'
|
|
hypothesis-meta:
|
|
created: '2022-12-19T14:55:52.384335+00:00'
|
|
document:
|
|
title:
|
|
- My AI Safety Lecture for UT Effective Altruism
|
|
flagged: false
|
|
group: __world__
|
|
hidden: false
|
|
id: O7YUan-tEe29vjfmuBFMKQ
|
|
links:
|
|
html: https://hypothes.is/a/O7YUan-tEe29vjfmuBFMKQ
|
|
incontext: https://hyp.is/O7YUan-tEe29vjfmuBFMKQ/scottaaronson.blog/?p=6823
|
|
json: https://hypothes.is/api/annotations/O7YUan-tEe29vjfmuBFMKQ
|
|
permissions:
|
|
admin:
|
|
- acct:ravenscroftj@hypothes.is
|
|
delete:
|
|
- acct:ravenscroftj@hypothes.is
|
|
read:
|
|
- group:__world__
|
|
update:
|
|
- acct:ravenscroftj@hypothes.is
|
|
tags:
|
|
- explainability
|
|
- nlproc
|
|
target:
|
|
- selector:
|
|
- endContainer: /div[2]/div[2]/div[2]/div[1]/p[95]
|
|
endOffset: 193
|
|
startContainer: /div[2]/div[2]/div[2]/div[1]/p[95]
|
|
startOffset: 0
|
|
type: RangeSelector
|
|
- end: 38138
|
|
start: 37945
|
|
type: TextPositionSelector
|
|
- exact: So then to watermark, instead of selecting the next token randomly, the
|
|
idea will be to select it pseudorandomly, using a cryptographic pseudorandom
|
|
function, whose key is known only to OpenAI.
|
|
prefix: 'of output tokens) each time.
|
|
|
|
|
|
|
|
|
|
'
|
|
suffix: " That won\u2019t make any detectable"
|
|
type: TextQuoteSelector
|
|
source: https://scottaaronson.blog/?p=6823
|
|
text: Watermarking by applying cryptographic pseudorandom functions to the model
|
|
output instead of true random (true pseudo-random)
|
|
updated: '2022-12-19T14:55:52.384335+00:00'
|
|
uri: https://scottaaronson.blog/?p=6823
|
|
user: acct:ravenscroftj@hypothes.is
|
|
user_info:
|
|
display_name: James Ravenscroft
|
|
in-reply-to: https://scottaaronson.blog/?p=6823
|
|
tags:
|
|
- explainability
|
|
- nlproc
|
|
- hypothesis
|
|
type: annotation
|
|
url: /annotations/2022/12/19/1671461752
|
|
|
|
---
|
|
|
|
|
|
|
|
<blockquote>So then to watermark, instead of selecting the next token randomly, the idea will be to select it pseudorandomly, using a cryptographic pseudorandom function, whose key is known only to OpenAI.</blockquote>Watermarking by applying cryptographic pseudorandom functions to the model output instead of true random (true pseudo-random) |