preasket@lemy.lol to

Showerthoughts@lemmy.world · 2 years ago

The problem with AI alignment is that humans aren't aligned

69

The problem with AI alignment is that humans aren't aligned

preasket@lemy.lol to

Showerthoughts@lemmy.world · 2 years ago

I’m sure there are some AI peeps here. Neural networks scale with size because the number of combinations of parameter values that work for a given task scales exponentially (or, even better, factorially if that’s a word???) with the network size. How can such a network be properly aligned when even humans, the most advanced natural neural nets, are not aligned? What can we realistically hope for?

Here’s what I mean by alignment:

Ability to specify a loss function that humanity wants
Some strict or statistical guarantees on the deviation from that loss function as well as potentially unaccounted side effects

Chat

Quatity_Control@lemm.ee
link
fedilink
arrow-up
1·
2 years ago
Align means two very different things here, despite being the same word.
- preasket@lemy.lolOP
  link
  fedilink
  arrow-up
  4·
  2 years ago
  Does it? People act in all sorts of sensible and crazy ways even though the basic principle of operation is the same
  - Quatity_Control@lemm.ee
    link
    fedilink
    arrow-up
    1·
    2 years ago
    What loss function do you want AI to align on?
    
    If I have a language model AI and an AI designed to function as a nurse, what are they going to align on?

Showerthoughts@lemmy.world

showerthoughts@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !showerthoughts@lemmy.world

A “Showerthought” is a simple term used to describe the thoughts that pop into your head while you’re doing everyday things like taking a shower, driving, or just daydreaming. The most popular seem to be lighthearted clever little truths, hidden in daily life.

Here are some examples to inspire your own showerthoughts:

Rules

All posts must be showerthoughts
The entire showerthought must be in the title
No politics
- If your topic is in a grey area, please phrase it to emphasize the fascinating aspects, not the dramatic aspects. You can do this by avoiding overly politicized terms such as “capitalism” and “communism”. If you must make comparisons, you can say something is different without saying something is better/worse.
- A good place for politics is c/politicaldiscussion
Posts must be original/unique
Adhere to Lemmy’s Code of Conduct and the TOS

If you made it this far, showerthoughts is accepting new mods. This community is generally tame so its not a lot of work, but having a few more mods would help reports get addressed a little sooner.

Whats it like to be a mod? Reports just show up as messages in your Lemmy inbox, and if a different mod has already addressed the report, the message goes away and you never worry about it.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

894 users / day
4.11K users / week
6.7K users / month
15.4K users / 6 months
407 local subscribers
39K subscribers
3.13K Posts
96.9K Comments
Modlog