Can you fine-tune on localized steering of an LLM?

hok@lemmy.dbzer0.com · edit-2 7 days ago

Can you fine-tune on localized steering of an LLM?

iii@mander.xyz · 7 days ago

Would you call token (N+1), given tokens (1 to N) as a ground truth?

hok@lemmy.dbzer0.com · 7 days ago

No, in that case there’s no labelling required. That would be unsupervised learning.

https://en.wikipedia.org/wiki/Unsupervised_learning

Conceptually, unsupervised learning divides into the aspects of data, training, algorithm, and downstream applications. Typically, the dataset is harvested cheaply “in the wild”, such as massive text corpus obtained by web crawling, with only minor filtering (such as Common Crawl). This compares favorably to supervised learning, where the dataset (such as the ImageNet1000) is typically constructed manually, which is much more expensive.

iii@mander.xyz · 7 days ago

So supervised vs unsupervised, according to you, is a property of the dataset?

hok@lemmy.dbzer0.com · 7 days ago

Sorry, I really don’t care to continue talking about the difference between supervised and unsupervised learning. It’s a pattern used to describe how you are doing ML. It’s not a property of a dataset (you wouldn’t call Dataset A “unsupervised”). Read the Wikipedia articles for more details.

iii@mander.xyz · edit-2 7 days ago

It’s alright :)