Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
RC_ITR
on May 15, 2023
|
parent
|
context
|
favorite
| on:
Brex’s Prompt Engineering Guide
My understanding of both this and Apple AFT is that they are trained with attention, but then inference is done as an RNN.
Is your understanding different?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Is your understanding different?