Add 'brainsteam/content/annotations/2022/12/19/1671461752.md'
continuous-integration/drone/push Build is passing Details

This commit is contained in:
ravenscroftj 2022-12-19 15:00:10 +00:00
parent 293d26a1be
commit a29cf0aa40
1 changed files with 69 additions and 0 deletions

View File

@ -0,0 +1,69 @@
---
date: '2022-12-19T14:55:52'
hypothesis-meta:
created: '2022-12-19T14:55:52.384335+00:00'
document:
title:
- My AI Safety Lecture for UT Effective Altruism
flagged: false
group: __world__
hidden: false
id: O7YUan-tEe29vjfmuBFMKQ
links:
html: https://hypothes.is/a/O7YUan-tEe29vjfmuBFMKQ
incontext: https://hyp.is/O7YUan-tEe29vjfmuBFMKQ/scottaaronson.blog/?p=6823
json: https://hypothes.is/api/annotations/O7YUan-tEe29vjfmuBFMKQ
permissions:
admin:
- acct:ravenscroftj@hypothes.is
delete:
- acct:ravenscroftj@hypothes.is
read:
- group:__world__
update:
- acct:ravenscroftj@hypothes.is
tags:
- explainability
- nlproc
target:
- selector:
- endContainer: /div[2]/div[2]/div[2]/div[1]/p[95]
endOffset: 193
startContainer: /div[2]/div[2]/div[2]/div[1]/p[95]
startOffset: 0
type: RangeSelector
- end: 38138
start: 37945
type: TextPositionSelector
- exact: So then to watermark, instead of selecting the next token randomly, the
idea will be to select it pseudorandomly, using a cryptographic pseudorandom
function, whose key is known only to OpenAI.
prefix: 'of output tokens) each time.
'
suffix: " That won\u2019t make any detectable"
type: TextQuoteSelector
source: https://scottaaronson.blog/?p=6823
text: Watermarking by applying cryptographic pseudorandom functions to the model
output instead of true random (true pseudo-random)
updated: '2022-12-19T14:55:52.384335+00:00'
uri: https://scottaaronson.blog/?p=6823
user: acct:ravenscroftj@hypothes.is
user_info:
display_name: James Ravenscroft
in-reply-to: https://scottaaronson.blog/?p=6823
tags:
- explainability
- nlproc
- hypothesis
type: annotation
url: /annotations/2022/12/19/1671461752
---
<blockquote>So then to watermark, instead of selecting the next token randomly, the idea will be to select it pseudorandomly, using a cryptographic pseudorandom function, whose key is known only to OpenAI.</blockquote>Watermarking by applying cryptographic pseudorandom functions to the model output instead of true random (true pseudo-random)