mozz@mbin.grits.dev to

Technology@beehaw.org · 9 months ago

"No, seriously. All those things Google couldn't find anymore? Top of the search pile. Queries that generated pages of spam in Google results? Fucking pristine on Kagi – the right answers, over and ov

pluralistic.net

cross-posted to:
[email protected]

161

"No, seriously. All those things Google couldn't find anymore? Top of the search pile. Queries that generated pages of spam in Google results? Fucking pristine on Kagi – the right answers, over and ov

pluralistic.net

mozz@mbin.grits.dev to

Technology@beehaw.org · 9 months ago

cross-posted to:
[email protected]

Pluralistic: Too big to care (04 Apr 2024) – Pluralistic: Daily links from Cory Doctorow

pluralistic.net

You must log in or # to comment.

Chat

Drinvictus@discuss.tchncs.de
link
fedilink
arrow-up
81·
edit-2
9 months ago
Downvote me into oblivion but Kagi ain’t shit. It’s a glorified Google frontend. The author is right that the web is filled with AI generated articles and fake reviews and lists but Kagi is not immune to this enshittification.

I even tried the same query the author was bitching about. Here is Kagi’s first two links for top 10 air purifiers. Notice how the first result is a BS website called top10.com and the second one is one of the “fake review” websites .

And here is Google’s. First result is Wirecutter, and this might be subjective but I trust Wirecutter reviews on most things.

Rest of the Google results are exactly what the author was mentioning. But Kagi was no different.

So $10/month to get the same shit? No thank you. I agree that Google turned to shit compared to what it was but it is still the best search engine out there. Now if the article was about privacy concerns then they would have a point. Which is what Kagi is all about anyway. So let’s stop the fucking act.
- Melody Fwygon@beehaw.org
  link
  fedilink
  arrow-up
  39·
  9 months ago
  I pay nothing for running SearXNG locally on my machine.
  - 1984@lemmy.today
    link
    fedilink
    arrow-up
    12·
    9 months ago
    It’s not even comparable in quality. It’s like almost trolling to even suggest they are in the same league. If you don’t want to spend 10 dollars, fine, but maybe stop pretending that your instance is somehow the best quality search engine that exists… :)
    - kattenluik@feddit.nl
      link
      fedilink
      arrow-up
      6·
      9 months ago
      On top of that they’re still paying using their time (and power).
    - Melody Fwygon@beehaw.org
      link
      fedilink
      arrow-up
      2·
      8 months ago
      Your argument clearly shows that you fail to see the benefits of doing it yourself. I get quality results from my local instance due to my persistence and work put in to adjust the settings necessary. I’ve balanced the privacy and functionality of the instance to fit my needs and it costs me nothing but a few minutes of my time each week to do so.
      
      Kagi doing it for $10 a month sounds like they’re turning a neat profit off of you; and you’re refusing to accept that I have achieved levels of search competence that Kagi has without paying for Kagi or even using their free searches or service.
      
      Whether or not it makes sense to you value-wise to pay or not pay for Kagi does not matter in this discussion. it only matters that none of the things Kagi can do that I find useful are things that cannot be done with SearXNG.
  - Drinvictus@discuss.tchncs.de
    link
    fedilink
    arrow-up
    7·
    9 months ago
    That is the way to go.
    - Melody Fwygon@beehaw.org
      link
      fedilink
      arrow-up
      11·
      9 months ago
      The nice thing is that I can customize it however I like too; change weights, choose which engines to pull from always, or even from search to search; so I’m not getting cruft.
      
      SearXNG always rearranges the crap most engines serve to the bottom without fail.
  - Zworf@beehaw.org
    link
    fedilink
    arrow-up
    3·
    9 months ago
    TIL of this. Thanks.
  - Fubarberry@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    2·
    9 months ago
    My understanding is that a locally hosted SearXNG instance doesn’t really give you any privacy, unless you “dilute” your searches by letting others do searches from your instance too.
    - MalReynolds@slrpnk.net
      link
      fedilink
      English
      arrow-up
      5·
      9 months ago
      Route it through a vpn with gluetun and it does…
    - Melody Fwygon@beehaw.org
      link
      fedilink
      arrow-up
      1·
      8 months ago
      To be honest the “Privacy” aspect can be taken care of in other ways; like using a VPN for query dilution, for example. You don’t have to recruit 100 mechanical turks to do junk searches for you; although there are browser addons that can in fact do this automated searching for you…I’ve run them before.
      
      SearXNG is a front-end that protects your privacy still. Hosting it locally dilutes it some; but provides maximal control; as you can use VPNs and control things much more tightly than you could if you hosted it elsewhere.
- Atemu@lemmy.ml
  link
  fedilink
  arrow-up
  26·
  9 months ago
  Your search results look very different to mine:
  
  Did you disable Grouped Results?
  
  All the LLM-generated “top 10” listicles are grouped into one large block I can safely ignore. (I could hide them entirely but the visual grouping allows for easy mental filtering, so I haven’t bothered.) Your weird top10 fake site does not show up.
  
  But yes, as the linked article says, Kagi is primarily a proxy for Google with some extra on top. This is, unfortunately, a feature as Google’s index still reigns supreme for general purpose search. It absolutely is bad and getting worse but sadly still the best you can get. Using only non-Google indices would just result in bad search results.
  The Google-ness is somewhat mitigated by Kagi-exclusive features such as the LLM garbage grouping.
  
  What Google also cannot do is highlighted in my screenshot: You can customise filtering and ranking.
  The first search result is a Reddit thread with some decent discussion because I configured Kagi to prefer Reddit search results. In the case of household appliances, this doesn’t do a whole lot as I have not researched trusted/untrusted sources in this field yet but it’s very noticeable in fields like programming where I have manually ranked sites.
  
  Kagi is not “all about” privacy. It’s a factor, sure but ultimately you still have to trust a U.S. company. Better than “trusting” a known abuser (Google, M$) but without an external audit, I wouldn’t put too much wight into this.
  The index ain’t it either as it’s mostly Google though sometimes a bit better.
  What really sets it apart is the features. Customised ranking aswell as blocking some sites outright (bye bye pinterest and userbenchmark) are immensely useful. So are filtering garbage results that Google still likes to return.
  - Drinvictus@discuss.tchncs.de
    link
    fedilink
    arrow-up
    5·
    9 months ago
    I didn’t change any settings. fresh account
    - Atemu@lemmy.ml
      link
      fedilink
      arrow-up
      2·
      9 months ago
      Is “Grouped Results” disabled in settings?
- mozz@mbin.grits.devOP
  link
  fedilink
  arrow-up
  11·
  edit-2
  9 months ago
  I wouldn’t use “air purifier” as a metric, since it was already a big public story that surely any search engine that’s even half paying attention would have made sure the results for are good. Probably some other consumer good is better for an un-preannounced test run.
  
  (Also I’m not sure that searching “top 10 air purifier” and complaining that you got a top result of top10.com/air-purifiers and that’s not what you wanted makes a ton of sense. FWIW, I did try “air purifier” just out of curiosity and saw a very clear result that DDG had the best results, Google second, and Kagi third.)
  
  I repeated it for “good wireless router” and saw different results; for them, the outcomes were fairly similar with Kagi somewhat better (returning Wirecutter as the top result, and an obselete Stack Exchange answer as the 2nd, which okay it’s not right but I get where you’re coming from sir), and Google and DDG as secondary (returning PCMag and CNet at the top and Wirecutter only further down below).
- thejevans@lemmy.ml
  link
  fedilink
  arrow-up
  9·
  9 months ago
  I searched both Cory Doctorow’s post and the linked 404media article in his post for “air purifier” and found nothing. What author are you referencing?
  - Drinvictus@discuss.tchncs.de
    link
    fedilink
    arrow-up
    9·
    9 months ago
    https://pluralistic.net/2024/02/21/im-feeling-unlucky/#not-up-to-the-task
    - thejevans@lemmy.ml
      link
      fedilink
      arrow-up
      3·
      9 months ago
      Thanks.
- AnonStoleMyPants@sopuli.xyz
  link
  fedilink
  arrow-up
  6·
  9 months ago
  Dunno, I got entirely different results. Reddit, homeairguides, forbes, a bunch of listicles like consumerreports, wired, ny times, cnet and whatnot, and other websites.
noodlejetski@lemm.ee
link
fedilink
arrow-up
42·
9 months ago
I’m still steering clear from Kagi after how they handled criticism after they started including Brave’s index
- Atemu@lemmy.ml
  link
  fedilink
  arrow-up
  27·
  9 months ago
  That whole situation was such an overblown idiotic mess. Kagi has always used indices from companies that do far more unethical things than committing the extreme crime of having a CEO who has stupid opinions on human rights.
  I 100% agree with Vlad’s response to this whole thing and anyone who thinks otherwise should question what exactly it is they’re criticising.
  
  I don’t like Brave (super shady IMHO) and certainly not their CEO but I didn’t sign up for a 100% ethically correct search engine, I signed up for a search engine with innovative features and good search results. The only viable alternatives are to use 100% not ethically correct search indices with meh (Google) to bad (Bing, DDG) search results. If you’re going to tell me how Google and M$ are somehow ethical, I’m going to have to laugh at you.
  
  The whole argument amounts to whining about the status quo and bashing the one company that tries anything to change it. The only way to get away from the Google monopoly is alternative indices. Yes those alternatives may not be much more ethical than friggin Google. So what.
  - FIash Mob #5678@beehaw.org
    link
    fedilink
    arrow-up
    14·
    edit-2
    9 months ago
    You can’t really engage as a consumer without enabling shitty practices on some level, and that’s particularly true of electronics.
    
    The phone you’re using to access Beehaw? Assembled by child labor or wage slaves somewhere in Asia. Even if you assembled it yourself, the parts were manufactured unethically.
    
    It’s not just Amazon or Nestle. You might as well criticize someone for breathing because unethical consumption, on some level, is inevitable, particularly so if you live in a capitalist country.
    
    I use Brave because its ad block feature works better than the others I’ve tried, plain and simple.
    
    But, by all means, people can still be as holier than thou as they like.
    - noodlejetski@lemm.ee
      link
      fedilink
      arrow-up
      8·
      9 months ago
      
      The phone you’re using to access Beehaw? Assembled by child labor or wage slaves somewhere in Asia. Even if you assembled it yourself, the parts were manufactured unethically.
      
      which is one of the reasons why I own a Fairphone.
      
      and sure, you can’t avoid all bad choices, but everyone draws a line somewhere. and when a techbro makes a techbroy post about how eVErYThiNg iS pOLiTiCiZeD ThESe dAyS and how that’s supposedly stopping innovation, because people like me don’t want him to work with a guy with a history of opposing our rights, then I stop having confidence in him and cancel my subscription because I don’t want to support him financially anymore.
      - FIash Mob #5678@beehaw.org
        link
        fedilink
        arrow-up
        5·
        9 months ago
        Don’t get me wrong. I do the best I can to be ethical in my choices, but it’s just a pet peeve of mine to see people behaving in a holier-than-thou when it’s simply impossible to achieve what they’re pretending to achieve.
      - TehPers@beehaw.org
        link
        fedilink
        English
        arrow-up
        4·
        edit-2
        9 months ago
        
        people like me don’t want him to work with a guy with a history of oppressing our rights
        
        Maybe I’m misunderstanding, but what search do you use that relies on a more ethical index? I don’t use SearXNG, but as far as I can tell, even something like that relies on other indices, like Google, Bing, and Brave (which seems to be configurable). Are you suggesting you use a different index that comes from a more ethical company, or do you just not like Vlad’s response?
        
        noodlejetski@lemm.ee
        link
        fedilink
        arrow-up
        2·
        9 months ago
        
        Are you suggesting you use a different index that comes from a more ethical company
        
        I don’t, I use a mixture of Startpage and DDG, but I’m always for a lookout for other options and also don’t sign up for a subscription to throw my money at them every month.
  - TehPers@beehaw.org
    link
    fedilink
    English
    arrow-up
    3·
    edit-2
    9 months ago
    To add, from what I understand at least, Kagi does build its own index for accessing smaller sites. To some extent, results are also served by a custom index, meaning some percentage of results do not come from [your disliked companies] and instead come directly from Kagi. It doesn’t seem like a significant percentage of results come from that index, but it supposedly is still >0%.
    
    Personally I mostly use Kagi for the ability to put Reddit on the bottom of the results, MDN to the top, and otherwise prioritize sites in ways that I want but which I know are purely based on my own opinions. It works well for my usecase, and I don’t have to scroll through a bunch of sponsored links before finding my search results. Also, the recent integration with Wolfram|Alpha has been convenient with a couple of searches, like one where I needed the prime factors of some numbers.
- Melody Fwygon@beehaw.org
  link
  fedilink
  arrow-up
  12·
  edit-2
  9 months ago
  I genuinely won’t even use Brave indexes on my SearXNG instance; I have the engines disabled. My search quality has not suffered; as most of my results end up being DDG or Yahoo anyways; and Brave was only ever duplicating results from other engines anyways.
- mozz@mbin.grits.devOP
  link
  fedilink
  arrow-up
  4·
  9 months ago
  Just curious, do you buy things from Amazon?
  
  I’m not trying at all to disagree with the idea of being ethical in how you send your dollars, but I’m curious how much is prioritized actual harm to suffering people in the real world when you do this.
  - noodlejetski@lemm.ee
    link
    fedilink
    arrow-up
    18·
    9 months ago
    
    Just curious, do you buy things from Amazon?
    
    no I don’t. and to answer your next whatabout question, I don’t buy from brands owned by Nestle, either.
    - towerful@programming.dev
      link
      fedilink
      arrow-up
      13·
      9 months ago
      Good work doing what you can!
      Dodging shitty companies is difficult.
    - mozz@mbin.grits.devOP
      link
      fedilink
      arrow-up
      7·
      9 months ago
      Got it, makes sense.
      
      And ha, yeah sorry if it was overly snarky, I just see some people where “oh my GAWD you said the wrong thing in your press release” is their only barometer of ethical behavior and was just wanting to poke at you if that was the case. 🙂
greysemanticist@lemmy.one
link
fedilink
English
arrow-up
29·
9 months ago
One of my best monthly expenses. I also appreciate being able to block low-quality domains from my search results.
- Melody Fwygon@beehaw.org
  link
  fedilink
  arrow-up
  10·
  9 months ago
  I can do everything Kagi does for free…using SearXNG.
  - kandykarter@lemmy.ca
    link
    fedilink
    arrow-up
    5·
    edit-2
    9 months ago
    How do you duplicate this feature in SearXNG? https://help.kagi.com/kagi/features/website-info-personalized-results.html
    
    It’s basically the major thing keeping me with Kagi.
    - notfromhere@lemmy.ml
      link
      fedilink
      arrow-up
      5·
      edit-2
      9 months ago
      That seems like a crutch instead of a real feature. I hate even just thinking about having to manage that. What if you want info from sites you do not already know about? Seems like finding new things through search is basically dead anymore.
      - kandykarter@lemmy.ca
        link
        fedilink
        arrow-up
        6·
        edit-2
        9 months ago
        Websites I’ve never heard of come up very frequently. The feature I find highly useful involves a) hiding shitty disinformation websites as I encounter them, or blocking sites like pintrest, and b) elevating results from websites I like, like having letterboxd results rank higher than IMDB when I search for movies or whatEVER.
        
        Being able to customize my own results to favour what I’m actually looking for is such a crutch.
    - Zworf@beehaw.org
      link
      fedilink
      arrow-up
      4·
      9 months ago
      Isn’t that exactly what the “weight” in searXNG does?
      - kandykarter@lemmy.ca
        link
        fedilink
        arrow-up
        1·
        9 months ago
        
        weight
        
        Where can I find that setting? I don’t see anything like that anywhere in the UI. If it’s in the config files and not in the UI, that isn’t particularly useful to me.
        
        Zworf@beehaw.org
        link
        fedilink
        arrow-up
        2·
        edit-2
        9 months ago
        It’s in the UI on the Engines tab. However, you can only see it there, you still have to use the config file to actually change it, sorry. That’s not hard at all though.
        
        If you don’t see this option, perhaps you’re running an older version? I’m running the latest docker.
        
        kandykarter@lemmy.ca
        link
        fedilink
        arrow-up
        1·
        9 months ago
        Oh, I think this is different to what I’m talking about. Seems like the weight in SearXNG impacts search engines used, where what I’m talking about is in regards to the actual results. So I could prioritize or deprioritize websites which tend to produce good or bad results, block domains, pin favourites, etc. The example I used elsewhere is like having letterboxd results rank higher than IMDB when I search for movies
        
        Zworf@beehaw.org
        link
        fedilink
        arrow-up
        1·
        9 months ago
        Ah I see. I didn’t really understand the requirement. That would indeed be a nice one though pretty hard to configure for general search because the results can come from so many sources.
        
        As well as that, for special-purpose things like movies it does in fact have a ranking for those by querying common sites like IMDB directly as an engine. So in that case you can use the weighting system to show preference. It doesn’t seem to support letterboxd as a source but it does some others:
  - Steve@communick.news
    link
    fedilink
    English
    arrow-up
    3·
    9 months ago
    So I only have to find one thing Kagi can do, that SearXNG can’t?
    Then what?
    - Melody Fwygon@beehaw.org
      link
      fedilink
      arrow-up
      4·
      9 months ago
      SearXNG is probably at the very core of Kagi; with a bunch of extra code and UI on top of it. Bangs work as well as lenses, including external bangs. If you can dream of an engine; you can use it with SearXNG; creating an engine with it is documented well.
      
      I promise any single thing Kagi can do that SearXNG can’t or won’t; won’t matter. SearXNG is superior to Kagi in every way by cost, and by being free, open source, and functionally all you need.
      - Steve@communick.news
        link
        fedilink
        English
        arrow-up
        4·
        9 months ago
        So it doesn’t actually matter if Kagi can do something SearXNG can’t.
        It’s cheaper. No argument with that.
      - Zworf@beehaw.org
        link
        fedilink
        arrow-up
        1·
        9 months ago
        Yeah I always thought Kagi used that money to build their complete own index.
        
        Now I hear that their index is only tiny and it’s mostly just delegated searches? Why does it cost so much then?
  - greysemanticist@lemmy.one
    link
    fedilink
    English
    arrow-up
    1·
    9 months ago
    I have the big SearXNG portal bookmarked ( https://searx.space/ ) but I don’t find that I ever reach for it that often. Not being able to cull lower quality sites is just a little bit of extra toil I’m happy to pay to go away.
    - Melody Fwygon@beehaw.org
      link
      fedilink
      arrow-up
      1·
      8 months ago
      You do have to host it yourself or run your own personal instance to get the power of SearXNG; if you’ve not tried this, please do not write it off.
      
      If hosting it yourself or even running it locally in a container on your machine at home is too technical for you; nobody is going to bane you for that. In fact there’s several guides and videos out there that might help you if you’re inclined to learn.
      
      If not; you’re also free to continue consuming as you do.
- Imprudent3449@lemm.ee
  link
  fedilink
  arrow-up
  10·
  9 months ago
  I am fucking sick of monthly subs… Happily pay for kagi. It’s really great at just getting you the results you need sorted right at the top.
Danterious@lemmy.dbzer0.com
link
fedilink
arrow-up
25·
9 months ago
If kagi is just an aggregate of other search engines why not just use a searx instance instead? Its open source and customizable.
- kandykarter@lemmy.ca
  link
  fedilink
  arrow-up
  8·
  9 months ago
  
  SearXNG
  
  I’d consider it if they had some of the features Kagi has like raising/lowering/pinning/excluding certain websites from results, but every time I try it it still feels very light on features.
- 1984@lemmy.today
  link
  fedilink
  arrow-up
  8·
  edit-2
  9 months ago
  Why not use a bicycle instead of a car? Because it’s 10 miles to work.
  
  One search engine is a hobby project, the other ones competes with Google and gives better results and ability to filter them and prioritize them as you want.
- Steve@communick.news
  link
  fedilink
  English
  arrow-up
  6·
  9 months ago
  Because it’s not just that.
millie@beehaw.org
link
fedilink
English
arrow-up
19·
9 months ago
Okay, but it doesn’t know where I am. When I type ‘dunkin’, Google doesn’t just know I want hours for a dunkin donuts, it knows which two or three stores I’m probably looking at hours for and it does it without me having to specify.

If I’m looking stuff up on my phone or just want a quick answer, I actually do want the context of all that data on me. I like that when I type the word ‘glamour’ it knows I’m probably thinking of the bard subclass, and that when I type ‘Conan’ it knows I probably mean Exiles, not O’Brien. I mean like, I know it doesn’t know these things, but it fills in that gap much faster.

I do like the way their search is layed out for doing something more complex, though. It really is a better designed search engine, but I feel like a search engine is the one place I want data collection of some kind, literally because it benefits me.
- mozz@mbin.grits.devOP
  link
  fedilink
  arrow-up
  6·
  9 months ago
  On Kagi you type ‘Dunkin’ and then click ‘maps’
  
  I want to see a screenshot of your ‘glamour’ and ‘Conan’ searches working the way you’re describing
  - millie@beehaw.org
    link
    fedilink
    English
    arrow-up
    2·
    edit-2
    9 months ago
    
    Sounds like an extra step.
    
    Damn. If only you knew how to ask politely.
- flora_explora@beehaw.org
  link
  fedilink
  arrow-up
  2·
  9 months ago
  Wow, it’s been ages since I’ve used google without a layer of privacy in between and haven’t realized how comfortable it would be with all its spying power enabled. But anyways, I find it scary that companies like google try to get so much information about you that they then sell to third parties. I’d rather have less comfortability if it means I have control over my own data. And I guess Kagi could be better in this regard if they value your privacy while still having some data on you.
  - millie@beehaw.org
    link
    fedilink
    English
    arrow-up
    2·
    edit-2
    9 months ago
    Honestly, if I could get Kagi to slurp up all my Google data and use it without any extra clicks, I’d probably switch. I don’t like having it sold to third parties, but it saves a ton of time when used for the actual reason they ought to have it in the first place.
    
    I also don’t imagine that stopping using google would have a tremendous effect on the amount of data gathered on me at this point. Like, I’ve taken my personal projects off of google so they won’t scrape the data, but half the internet is gathering metadata. It’s not going to stop all the sites I visit from gathering it all together. Admittedly, uBlock might in cases where tracking is built into ads.
    
    But like, weighing it against making my ability to move through the world more functional, my spite for Google’s information vacuum isn’t so great that I can’t just like, use the thing anyway. There’s no ethical consumption under capitalism, and I’m not really sure there’s such a thing as a moral or ethical human society. There’s already a lot of other shit I’m forced to tolerate out of necessity just to be a human being, and while I don’t love something else being added to the pile, at least it’s not like tortured animals or intentionally bombing children.
    
    To be clear, I would love an alternative and have actively sought one. Within the past 6 months I’ve tried both DuckDuckGo and Kagi. I stopped using both out of frustration with the need to constantly specify my location.
    - flora_explora@beehaw.org
      link
      fedilink
      arrow-up
      2·
      9 months ago
      Yeah, I get that. While “no ethical consumption under capitalism” shouldn’t be used to justify passivity, each individual person has their own limits to what they can reasonably achieve. Sometimes when I’m traveling and my anxiety peaks, I also eat dairy products/eggs because I cannot mentally afford to search for vegan alternatives. It’s so hard to always keep the balance between doing what you can and trying to stay sane.
      
      I’ve been using startpage for years and don’t really miss the missing location features. But I hardly leave the house anyways.
- mozz@mbin.grits.devOP
  link
  fedilink
  arrow-up
  1·
  9 months ago
  deleted by creator
👍Maximum Derek👍@discuss.tchncs.de
link
fedilink
English
arrow-up
19·
9 months ago
I started using Kagi a few months before $10 became unlimited queries.

When I first switched I’d still, occasionally, swap back to google using bangs because I had to unlearn all the hacks I had to make Google turn up useful things. Now I can’t go back, Google is unsable without those hacks. Its barely usable with them.

Plus Kagi has a “fediverse forum” lens that lets me search Lemmy much more effectively than Lemmy’s search.
- Melody Fwygon@beehaw.org
  link
  fedilink
  arrow-up
  3·
  8 months ago
  SearXNG has fediverse search functionality too.
- greysemanticist@lemmy.one
  link
  fedilink
  English
  arrow-up
  2·
  9 months ago
  Ok, you piqued me: Got a link to a guide on using Kagi for the fediverse?
  - 👍Maximum Derek👍@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    1·
    9 months ago
    There’s not much to it. Under settings in Kagi there’s a tab for “Lenses.” Make sure “Fediverse Forums” is turned on.
    
    Then, after you search, you can filter from a broad web search to any of your enabled lenses.
    - greysemanticist@lemmy.one
      link
      fedilink
      English
      arrow-up
      3·
      9 months ago
      Oh that’s awesome. The drop-down arrow “disapeared” with my mental blinders-- I was thinking it was only a toggle for PDFs.
Zworf@beehaw.org
link
fedilink
arrow-up
16·
edit-2
9 months ago
10 bucks is too much though for a search engine, at least for me. Especially now that I use LLMs to replace most of the usecases of web searches.

I never used Google much anyway the last few years, I use duckduckgo which isn’t quite as bad as google is now. Yeah I know it’s just microsoft bling with a lick of paint but they didn’t enshittify as much as google. But $10 + VAT is just a lot of money in Spain.

Maybe I’ll try the $5 plan though, I never come even close to 300 searches a month anyway.

Edit: SearXNG sounds much better actually, thanks!! <3

Edit2: I installed SearXNG and love it <3 Really thanks for the tips here.
- greysemanticist@lemmy.one
  link
  fedilink
  English
  arrow-up
  5·
  9 months ago
  This is a useful take: I too will use LLMs for search-- but not for search for journal articles with data and evidence. LLMs too easily confabulate these.
  
  LLM-as-search is fantastic when you want a no-bullshit statistical result for what you’re looking for when you’re wanting an overview or interactive tutorial.
  - Ilandar@aussie.zone
    link
    fedilink
    arrow-up
    3·
    9 months ago
    As long as it has footnoting so I can see where each piece of information was sourced from, AI chat has its use cases. Without that I genuinely do not see the point at all. It’s like when people “ask Google” something and just blindly trust the highlighted “answer” as infallible truth. It’s just a really, really bad habit to develop and I wish more people understood this.
    - Zworf@beehaw.org
      link
      fedilink
      arrow-up
      1·
      edit-2
      9 months ago
      Not infallible truth. But very often it’s something that is just for personal use.
      
      Some things I’ve asked it recently were like “Which torch is smaller out of these 5 models?”. Once I find which one I want it’s easy to verify. Or “what does this Spanish expression mean?” or “how do I do …”.
      
      Not everyone uses it to try and write authoritative stuff. And Google is full of clickbaity “comparison sites” that are nothing but fake advertising.
      - Ilandar@aussie.zone
        link
        fedilink
        arrow-up
        1·
        9 months ago
        All of those questions you asked it return authoritative answers which you take on face value, unless you spend extra time fact checking them yourself.
        
        Zworf@beehaw.org
        link
        fedilink
        arrow-up
        1·
        edit-2
        9 months ago
        Yeah but accuracy isn’t a given with the other methods either. If I ask some randos on reddit I won’t get a perfect answer either. If I google specs or reviews online they are often biased, wrong (think the magical Chinese lumens of torches) or even literally fraudulent paid reviews too.
        
        So yeah for me the LLM output is more than good enough with a bit of verification if necessary.
        
        I don’t really understand why people are suddenly hung up about holding LLMs up to this lofty ideal of an unbiased super-truth. Where did that requirement come from all of a sudden? It’s not really realistic and not something we’ve ever had in the past.
        
        I feel the same about self-driving systems. People get all hung up if they crash once in a while, expecting them to be 100% perfect in all situations. But ignoring the concept that they already might be a hell of a lot safer than human drivers. They fail in different situations generally but why do we suddenly demand perfection?
        
        Ilandar@aussie.zone
        link
        fedilink
        arrow-up
        1·
        edit-2
        9 months ago
        I’m sorry, but citing other examples of bad research practices does not magically make AI reliable. That is a whataboutism.
- pacoboyd@lemm.ee
  link
  fedilink
  arrow-up
  4·
  9 months ago
  Self hosted searxng is where it’s at. Seriously love it and have replaced my search engines on all my computers and phone.
  
  I use this along with Vivaldi browser that will let me switch engines quickly with “search shortcuts” for those few times I need local Google results.
  - Zworf@beehaw.org
    link
    fedilink
    arrow-up
    2·
    9 months ago
    Yep I hosted it on my VPN server so I can reach it from all my devices. Love it. Learned about it here and I’m really happy I did.
  - flora_explora@beehaw.org
    link
    fedilink
    arrow-up
    1·
    9 months ago
    I’d love to do that, too. But I’m a bit overwhelmed with setting it up :/
    - mozz@mbin.grits.devOP
      link
      fedilink
      arrow-up
      3·
      edit-2
      9 months ago
      Plenty of public instances which are probably 100% privacy preserving in practice.
- thegreekgeek@midwest.social
  link
  fedilink
  English
  arrow-up
  2·
  9 months ago
  Stract.com also looks promising.
  - Pixel@beehaw.org
    link
    fedilink
    arrow-up
    2·
    9 months ago
    stract has same issue mentioned above where search engines are actually the only time i want data on me to be easily accessible. not being able to search “food near me” is frustrating, and no privacy-centered google alternative i’ve been happy with has had that feature. im fine with my location and other relevant metadata on me getting used in a search, as long as that metadata is in a black box restricted to me that doesn’t create a profile for advertising companies
    - thegreekgeek@midwest.social
      link
      fedilink
      English
      arrow-up
      1·
      8 months ago
      Yeah that makes sense. My comment had more to do with the potential of a open source search engine/crawler than anything it currently does. Though I feel the optics feature might be able to account for that eventually.
  - Areldyb [he/him]@beehaw.org
    link
    fedilink
    English
    arrow-up
    1·
    9 months ago
    Promising, but not ready for primetime. I spent the last two days using it as my phone’s default search after you mentioned it, and… well, I went back to Google, at least for now.
  - Zworf@beehaw.org
    link
    fedilink
    arrow-up
    1·
    edit-2
    9 months ago
    I put in the name of my home town and the first 2 links were escort sites offering ladies in that town… Seriously.
    
    Not really impressed so far.
Buelldozer@lemmy.today
link
fedilink
arrow-up
10·
9 months ago
How is it that Cory Doctorow hadn’t hear of Kagi until March of 2024? It’s been widely discussed in tech spaces for quite a while now!
- The Doctor@beehaw.org
  link
  fedilink
  English
  arrow-up
  16·
  9 months ago
  Maybe it’s taken him this long to kick the tires and develop an opinion from daily use. There’s nothing wrong with that.
  - Buelldozer@lemmy.today
    link
    fedilink
    arrow-up
    7·
    9 months ago
    Sure but in the article he says that he hadn’t even heard of it until some friends mentioned it “last month”, which would have been March of 2024. Taking a few weeks to feel it out is one thing but to have not even know it existed until last month is wild.
    - The Doctor@beehaw.org
      link
      fedilink
      English
      arrow-up
      15·
      9 months ago
      We don’t know what Cory does all day. We know he has a family, I think he has a kid, that means that he has responsibilities that don’t involve blogging. For all we know, at the end of the day he curls up with a dead tree book and unplugs to relax. He might not be as online as his overall style might make him appear and we don’t know what all circles of people he runs with, so it’s entirely possible that he just heard about it.
      - FarceOfWill@infosec.pub
        link
        fedilink
        arrow-up
        8·
        9 months ago
        He writes all day. He can’t have time to listen to anyone with his volume of output :D
        
        pop@lemmy.ml
        link
        fedilink
        arrow-up
        1·
        9 months ago
        Writing also need vast amount research, which also involves a search engine, which at some point in time you begin to get fed up with spam, and maybe just may be, if you’re actually smart as people think you are, you begin to look for an alternative.
        
        but I also don’t know what to tell people who believe everything they read on the internet.
      - pop@lemmy.ml
        link
        fedilink
        arrow-up
        4·
        9 months ago
        Exactly, We don’t know what he does all day and it’s creepy you have this weird imagination going through. Like seriously? He probably could be checking the balance that kagi sent him for this post.
        
        Why are people so obsessed with internet personalities just because they say “nice” things you like? This is what every one of them does.
        
        The Doctor@beehaw.org
        link
        fedilink
        English
        arrow-up
        1·
        9 months ago
        Parasocial relationships are weird.
      - Revan343@lemmy.ca
        link
        fedilink
        arrow-up
        2·
        9 months ago
        
        I think he has a kid
        
        She’d be about 16, so probably fairly independent by now, but not entirely so.
        
        But he also has actual books to write, in addition to blog articles. I’d imagine he’s pretty busy
thejevans@lemmy.ml
link
fedilink
arrow-up
9·
9 months ago
I use Kagi, stract, and a self-hosted searx-ng instance. Kagi is so well polished that it’s what I use most of the time, but I keep an eye on the other two and continually ask myself if I’m ready to drop Kagi to get away from financially supporting Google and Microsoft.
Ilandar@aussie.zone
link
fedilink
arrow-up
9·
9 months ago
No matter how many times people claim “X search engine” is universally better than Google, it has just never been true in my experience. And this is coming from someone who puts up with the frustrations of other search engines to avoid Google’s data harvesting. Like Google’s search engine can have rapidly deteriorated and still be miles better than the competition. Both can be true, but people always seem to act like the SEO spam has made it unusable and that is just not reality at all.
- Echo Dot@feddit.uk
  link
  fedilink
  arrow-up
  4·
  9 months ago
  Most of the time they’re either just googling a different skin or Bing in a different skin, which is why they are never any better.
anothermember@lemmy.zip
link
fedilink
arrow-up
9·
edit-2
9 months ago

Remember the first time you used Google search? It was like magic. After years of progressively worsening search quality from Altavista and Yahoo, Google was literally stunning, a gateway to the very best things on the internet.

No, I’m not having that! That’s rewriting of history. I remember when Google came out, it was pretty much as good as Altavista and no more. It had the additional appeal that it looked (for the time) unique and fresh and had a weird name, I remember getting my friends to try this “weird new search engine that might someday beat Altavista” but it never revolutionised anything in terms of search results at the time.

Also Altavista was not getting progressively worse, I still remember the days when you could type a simple dictionary word into a search engine and have it return 0 results. Altavista is what changed that, not Google.
- bitwolf@lemmy.one
  link
  fedilink
  arrow-up
  6·
  9 months ago
  I remember the competition on speed. And Google publishing the response time on the results page as a way to showcase it’s speed over the competitors.
bloup@lemmy.sdf.org
link
fedilink
arrow-up
9·
edit-2
9 months ago
I personally have not found Kagi’s default search results to be all that impressive, contrary to what most users seem to feel. I don’t know. When ddg and Google fail me, I will try Kagi and I think maybe only once or twice has it actually made finding what I’m looking for any easier.

I will mention though, you can do a lot of personalization on the results unlike other engines. So maybe if I took the time to customize, I might feel differently.
- Atemu@lemmy.ml
  link
  fedilink
  arrow-up
  1·
  9 months ago
  
  I personally have not found Kagi’s default search results to be all that impressive
  
  At their worst, they’re as bad as Google’s. For me however, this is a great improvement over using bing/Google proxies which would be the alternative.
  
  maybe if I took the time to customize, I might feel differently.
  
  That’s the killer feature IMHO.
- 1984@lemmy.today
  link
  fedilink
  arrow-up
  1·
  edit-2
  9 months ago
  Well yeah, once your trusted sites are at the top, and all the shitty ones don’t even appear, it’s a very nice feeling to search for anything.
HelixDab2@lemm.ee
link
fedilink
arrow-up
8·
edit-2
9 months ago
Requires a log-in. That means that there’s absolutely no way to anonymize your searches. At least if I want to do an anonymous search, I can open my laptop, boot up in Tails, and search DDG on Tor. With a required log-in (and billing that presumably doesn’t include a Monero option), you can’t make that work.

Pass.
- Atemu@lemmy.ml
  link
  fedilink
  arrow-up
  6·
  edit-2
  9 months ago
  Whether this is bad depends on your threat model. Additionally, you must also consider that other search engines are able to easily identify you without you explicitly identifying yourself. If you can’t fool https://abrahamjuliot.github.io/creepjs/, you certainly can’t fool Google for instance. And that’s even ignoring the immense identifying potential of user behaviour.
  
  Billing supports OpenNode AFAICT which I guess you could funnel your Moneros through but meh.
  
  Edit: Phrasing.
  - HelixDab2@lemm.ee
    link
    fedilink
    arrow-up
    1·
    9 months ago
    When I open Tor browser on my desktop–not in Tails–that site doesn’t even load. So either it is fooling it, or it requires javascript to run, which I have turned off by default in Tor.
    
    I’m going to leave it at that.
- jet@hackertalks.com
  link
  fedilink
  English
  arrow-up
  1·
  9 months ago
  It would be nice to have a Kagi crypto proxy you load with a few monies and can do searchers per session.
  
  I.e. no big deal until you close the browser.
Irdial@lemmy.sdf.org
link
fedilink
English
arrow-up
8·
9 months ago
My issue with Kagi is that it relies on aggregate results from other search engine indices
- 👍Maximum Derek👍@discuss.tchncs.de
  link
  fedilink
  English
  arrow-up
  9·
  9 months ago
  So do DDG and a lot of other search engines. In addition to the time and cost of running a spider and maintaining a database (for little to no technological benefit these days), a lot of server admins will block crawlers that aren’t googlebot or msnbot/bingbot.
- Dark Arc@social.packetloss.gg
  link
  fedilink
  English
  arrow-up
  7·
  9 months ago
  It has its own index in addition to aggregating results.
- mozz@mbin.grits.devOP
  link
  fedilink
  arrow-up
  5·
  9 months ago
  Why is that an issue?
  - Irdial@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    9·
    9 months ago
    I don’t get the impression that Kagi intends to compete with major search engines. It is clearly marketed toward privacy-focused, tech-minded individuals. You can take that one of two ways. Either you are frustrated with the erosion of search engine quality due to advertising, or you disagree with the predatory practices such as data mining that comes along with such advertising. In both cases, the only real way to signal to major search engines that you disagree with these practices is to stop using their services (including their APIs).
    
    For example, I have been using DuckDuckGo for decades. At first, I had to compromise search result quality, but now it has enough users and support that results are on-par with the likes of Google.
    
    I do not think that Kagi is bad or that people should not use it. It simply isn’t for me, because it does not actually address the reasons I do not use search engines like Google.
    - Nia_The_Cat@beehaw.org
      link
      fedilink
      arrow-up
      17·
      edit-2
      9 months ago
      deleted by creator
      - Irdial@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        5·
        9 months ago
        Yes, and I largely disagree with it :/
        
        Atemu@lemmy.ml
        link
        fedilink
        arrow-up
        2·
        9 months ago
        I think you’re underestimating how huge of an undertaking a half-decent search index is, much less a good one.
        
        Nia_The_Cat@beehaw.org
        link
        fedilink
        arrow-up
        2·
        edit-2
        9 months ago
        deleted by creator
        
        thejevans@lemmy.ml
        link
        fedilink
        arrow-up
        4·
        9 months ago
        https://stract.com/ is the new kid on the block
        
        Zworf@beehaw.org
        link
        fedilink
        arrow-up
        2·
        edit-2
        9 months ago
        Lol. I typed the name of my hometown and the two first results were escort sites from that area.
        
        I mean, either it knows me really well and their privacy claims are wrong 🤭 Or it has a funny way of prioritising indexes.
        
        Blaze@lemmy.zip
        link
        fedilink
        arrow-up
        1·
        9 months ago
        Thanks
        
        Nia_The_Cat@beehaw.org
        link
        fedilink
        arrow-up
        1·
        9 months ago
        deleted by creator
        
        Zworf@beehaw.org
        link
        fedilink
        arrow-up
        1·
        9 months ago
        On the other hand, it doesn’t really matter so much anymore.
        
        LLM is the new search. I can ask it the actual question I have and it will give me the answer. If it’s not exactly what I need I can ask it to specify further.
        
        Contrast that with a search engine that just gives me a ton of bookmarks to sift through to see if they actually might answer my question or are just clickbait.
        
        Of course there’s still some times when you need search, like when you need to find an actual website, or when you need a source reference. But really the need for me is greatly reduced now.
        
        TehPers@beehaw.org
        link
        fedilink
        English
        arrow-up
        2·
        9 months ago
        Be careful relying on LLMs for “searching”. I’m speaking from experience here - getting actually accurate results from the current generation of LLMs, even with RAG, is difficult. You might get accurate results most of the time (even 80% or more), but it can be difficult to identify the inaccurate results due to the confidence models present their output with when hallucinating.
        
        Also, if your LLM isn’t doing retrieval-augmented generation (RAG), then it isn’t actually a search and won’t find results more recent than the data it was trained off of.
    - mozz@mbin.grits.devOP
      link
      fedilink
      arrow-up
      7·
      edit-2
      9 months ago
      I think it’s not that complicated – Kagi’s search results are just far more useful. I think it’s marketed at people who want good search results, not anything dealing with privacy (although, Kagi doesn’t log your searches, so it’s fully private for most everyday definitions) – your viewpoint for you makes perfect sense to me and sure I respect it, but I don’t think it’s right to say that people are linking their credit cards to do a have-to-be-logged-in-first search on Kagi chiefly for reasons of privacy focus.
      
      (I just tried the same experiment Doctorow tried, of searching for something that I’d been unable to find through Google, and Kagi did the same thing for me that it did for him (i.e. found it). That’s actually not important enough for me to pay for Kagi, but “Google is shit now” is no fringe opinion and it’s pretty easy to verify that Kagi does in practice work markedly better.)
some_guy@lemmy.sdf.org
link
fedilink
arrow-up
6·
9 months ago
I’ve wanted to test out Kagi for some time, but I don’t really do a hell of a lot of deep-dive searches anymore. I mostly passively read a combination of RSS, stuff I see on Lemmy, and videos that I still watch on the old site while not logged in. And some YouTube stuff.

When I do search for something, it’s typically something that I hear about in a podcast and I wanna know more and in the moment I just use my default (goog). I’m not thinking of Kagi cause I’m listening to something already and thinking about that.

Technology@beehaw.org

technology@beehaw.org

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

528 users / day
1.26K users / week
2.9K users / month
7.08K users / 6 months
3 local subscribers
37.8K subscribers
3.4K Posts
69.3K Comments
Modlog