Add 'brainsteam/content/annotations/2022/12/19/1671461752.md'
continuous-integration/drone/push Build is passing
Details
continuous-integration/drone/push Build is passing
Details
This commit is contained in:
parent
293d26a1be
commit
a29cf0aa40
|
@ -0,0 +1,69 @@
|
||||||
|
---
|
||||||
|
date: '2022-12-19T14:55:52'
|
||||||
|
hypothesis-meta:
|
||||||
|
created: '2022-12-19T14:55:52.384335+00:00'
|
||||||
|
document:
|
||||||
|
title:
|
||||||
|
- My AI Safety Lecture for UT Effective Altruism
|
||||||
|
flagged: false
|
||||||
|
group: __world__
|
||||||
|
hidden: false
|
||||||
|
id: O7YUan-tEe29vjfmuBFMKQ
|
||||||
|
links:
|
||||||
|
html: https://hypothes.is/a/O7YUan-tEe29vjfmuBFMKQ
|
||||||
|
incontext: https://hyp.is/O7YUan-tEe29vjfmuBFMKQ/scottaaronson.blog/?p=6823
|
||||||
|
json: https://hypothes.is/api/annotations/O7YUan-tEe29vjfmuBFMKQ
|
||||||
|
permissions:
|
||||||
|
admin:
|
||||||
|
- acct:ravenscroftj@hypothes.is
|
||||||
|
delete:
|
||||||
|
- acct:ravenscroftj@hypothes.is
|
||||||
|
read:
|
||||||
|
- group:__world__
|
||||||
|
update:
|
||||||
|
- acct:ravenscroftj@hypothes.is
|
||||||
|
tags:
|
||||||
|
- explainability
|
||||||
|
- nlproc
|
||||||
|
target:
|
||||||
|
- selector:
|
||||||
|
- endContainer: /div[2]/div[2]/div[2]/div[1]/p[95]
|
||||||
|
endOffset: 193
|
||||||
|
startContainer: /div[2]/div[2]/div[2]/div[1]/p[95]
|
||||||
|
startOffset: 0
|
||||||
|
type: RangeSelector
|
||||||
|
- end: 38138
|
||||||
|
start: 37945
|
||||||
|
type: TextPositionSelector
|
||||||
|
- exact: So then to watermark, instead of selecting the next token randomly, the
|
||||||
|
idea will be to select it pseudorandomly, using a cryptographic pseudorandom
|
||||||
|
function, whose key is known only to OpenAI.
|
||||||
|
prefix: 'of output tokens) each time.
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
'
|
||||||
|
suffix: " That won\u2019t make any detectable"
|
||||||
|
type: TextQuoteSelector
|
||||||
|
source: https://scottaaronson.blog/?p=6823
|
||||||
|
text: Watermarking by applying cryptographic pseudorandom functions to the model
|
||||||
|
output instead of true random (true pseudo-random)
|
||||||
|
updated: '2022-12-19T14:55:52.384335+00:00'
|
||||||
|
uri: https://scottaaronson.blog/?p=6823
|
||||||
|
user: acct:ravenscroftj@hypothes.is
|
||||||
|
user_info:
|
||||||
|
display_name: James Ravenscroft
|
||||||
|
in-reply-to: https://scottaaronson.blog/?p=6823
|
||||||
|
tags:
|
||||||
|
- explainability
|
||||||
|
- nlproc
|
||||||
|
- hypothesis
|
||||||
|
type: annotation
|
||||||
|
url: /annotations/2022/12/19/1671461752
|
||||||
|
|
||||||
|
---
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
<blockquote>So then to watermark, instead of selecting the next token randomly, the idea will be to select it pseudorandomly, using a cryptographic pseudorandom function, whose key is known only to OpenAI.</blockquote>Watermarking by applying cryptographic pseudorandom functions to the model output instead of true random (true pseudo-random)
|
Loading…
Reference in New Issue