fell into a lesswrong rabbit hole after re-reading paul christiano's 'what failure looks like' (it came up at the curve and i am struggling to understand 'loss of control' scenarios) and man all the decision theory stuff just instantly runs into the halting problem right away
determining whether a given program will output a particular symbol for particular inputs is semi-decidable and therefore equivalent to the halting problem. any 'read the other agent's source code'-based decision theory falls apart because of this fact.
in fact any *reflective* decision theory falls apart because of this fact. tdt et. al. only work in situations where agents are not turing complete. not very useful.
the logical uncertainty stuff holds up pretty well though
@azsantosk actually here’s a wild and unsubstantiated take; adding probability takes us from a discrete to a continuous setting and that’s what does away with undecidability that would be such a beautiful theorem
@azsantosk so either we are above or below turing
@azsantosk either omega has hypercomputation (is a wizard) or our agents aren’t very interesting
@azsantosk certainly then i am not capable of self-modification and many other things presupposed by the various decision theories on less wrong dot com
1.05万
32
本页面内容由第三方提供。除非另有说明,欧易不是所引用文章的作者,也不对此类材料主张任何版权。该内容仅供参考,并不代表欧易观点,不作为任何形式的认可,也不应被视为投资建议或购买或出售数字资产的招揽。在使用生成式人工智能提供摘要或其他信息的情况下,此类人工智能生成的内容可能不准确或不一致。请阅读链接文章,了解更多详情和信息。欧易不对第三方网站上的内容负责。包含稳定币、NFTs 等在内的数字资产涉及较高程度的风险,其价值可能会产生较大波动。请根据自身财务状况,仔细考虑交易或持有数字资产是否适合您。