• 2 Posts
  • 32 Comments
Joined 2 years ago
cake
Cake day: June 7th, 2023

help-circle

  • v_krishna@lemmy.mltoAnnouncements@lemmy.mlLemmy AMA March 2025
    link
    fedilink
    English
    arrow-up
    13
    ·
    4 months ago

    This generally goes against security best practices as it can be used for attempted user enumeration. A better version would be “we’ll send you an email with your account status if this user exists” but obviously that results in a fair amount more complexity (and cost) to implement



  • v_krishna@lemmy.mltoOpen Source@lemmy.mlProton's biased article on Deepseek
    link
    fedilink
    English
    arrow-up
    24
    arrow-down
    2
    ·
    5 months ago

    In deep learning generally open source doesn’t include actual training or inference code. Rather it means they publish the model weights and parameters (necessary to run it locally/on your own hardware) and publish academic papers explaining how the model was trained. I’m sure Stallman disagrees but from the standpoint of deep learning research DeepSeek definitely qualifies as an “open source model”

















  • Re thumb-key do you have recommended tutorials for getting comfortable with it? I found trying to do touch typing tutorials didn’t really help, both because they are generally made for desktop environments and they are geared towards qwerty layout (e.g., get comfortable with home row first etc). I tried forcing myself to use it for a full 24 hours as the concept makes a ton of sense to me, but got very frustrated with myself and then dug into the world of which layout to choose, got overwhelmed, and switched back to whatever this qwerty layout that samsung one ui provides on galaxys.


  • I think that is overly simplistic. Embeddings used for LLMs do definitely include a concept of what things mean and the relationship of things to other things.

    E.g., compare the embeddings of Paris, Athens, and London to other cities and they will have small cosine distance between them. Compare France, Greece, and England and same. Then very interestingly, look at Paris - France, Athens - Greece, London - England and you’ll find the resulting vectors all align (fundamentally the vector operation seems to account for the relationship “is the capital of”). Then go a step further, compare those vector to Paris - US, Athens - US, London - Canada. You’ll see the previous set are not aligned with these nearly as much but these are aligned with each other (relationship being something like “is a smaller city in this countrry, named after a famous city in some other country”)

    The way attention works there is a whole bunch of semantic meaning baked into embeddings, and by comparing embeddings you can get to pragmatic meaning as well.