• 4 Posts
  • 1.29K Comments
Joined 2 years ago
cake
Cake day: March 22nd, 2024

help-circle


  • Vllm is a bit better with parallelization. All the kv cache sits in a single “pool”, and it uses as many slots as will fit. If it gets a bunch of short requests, it does many in parallel. If it gets a long context request, it kinda just does that one.

    You still have to specify a maximum context though, and it is best to set that as low as possible.

    …The catch is it’s quite vram inefficient. But it can split over multiple cards reasonably well, better than llama.cpp can, depending on your PCIe speeds.

    You might try TabbyAPI exl2s as well. It’s very good with parallel calls, thoughts I’m not sure how well it supports MI50s.


    Another thing to tweak is batch size. If you are actually making a bunch of 47K context calls, you can increase the prompt processing batch size a ton to load the MI50 better, and get it to process the prompt faster.


    EDIT: Also, now that I think about it, I’m pretty sure ollama is really dumb with parallelization. Does it even support paged attention batching?

    The llama.cpp server should be much better, eg use less VRAM for each of the “slots” it can utilize.








  • Lulz.

    It’s an interesting coding exercise, though. Trying to (for example) OCR all the documents, or generate a relations graph between the documents or concepts, is a great into to language modeling (which is not prompt engineering, like most seem to think).

    If you’re like a reporter or something, it’s also the obvious way to comb through the documents looking for clues to actually make headlines. I dunno what techniques they use at big outlets, though.



  • Meme finetunes are nothing new.

    As an example, there are DPO datasets with positive/negative examples intended to train LLMs to respond politely and helpfully (as opposed to the negative response). There are some that include toxic comments plucked from the web as negative examples.

    And the immediate community thought was “…What if I reversed them?”


  • I dunno what the ‘writing style’ would end up as. The bulk of the text seems to be formatted like this:

    ...
    10. Is Epstein cooperating with federal suit against Bear Stearns hedge fund managers Ralph Cioffi
    and Matthew Tannin? Will he testify in their cases?
    
    11. Mr Epstein was deposed on this week, on Thursday. Is it true that he answered almost every
    question by invoking his Fifth Amendment rights?
    
    12. Defense attorney Brad Evans has filed a motion to freeze Mr Epstein’s assets. Has Mr.
    Epstein moved his money from the US offshore or abroad, or does he intend to, in order to
    protect his assets from possible damage claims?
    
    13. What did Mr. Epstein do during his work release program while serving time. Reports have
    said he engaged in “scientific research.” If so, what was he researching?
    ...
    
    Response
    
    "That's because it isn't, and everyone here
    (apparently save one) is rational and objective enough
    to understand that. Physical phenomena, and
    phenomena in general, are
    
    ultimately perceptual in nature and subject to
    observational replication - that's why they call
    physics an empirical science. But consciousness is
    not.
    
    Consciousness cannot be objectively, replicably
    observed. Its putative physical correlates, including
    ...
    
    Bill Clinton identified in lawsuit against his former friend and
    pedophile Jeffrey Epstein who had 'regular' orgies at his Caribbean
    compound that the former president visited multiple times
    
    e The former president was friends with Jeffrey Epstein, a financier who was arrested
    in 2008 for soliciting underage prostitutes
    
    e Anew lawsuit has revealed how Clinton took multiple trips to Epstein's private island
    where he 'kept young women as sex slaves'
    
    e Clinton was also apparently friends with a woman who collected naked pictures of
    underage girls for Epstein to choose from
    
    e He hasn't cut ties with that woman, however, and invited her to Chelsea's wedding
    
    e Comes as friends now fear that if Hillary Clinton runs for president in 2016, all of
    their family's old scandals will be brought to the forefront
    
    e Epstein has a host of famous friends including Prince Andrew who stayed at his New
    
    York mansion AFTER his arrest
    By Daily Mail Reporter
    Published: 09:06 EST, 19 March 2014 | Updated: 21:10 EST, 5 January 2015
    

    I’d have to generate prompt/response wrappers too. But it would definitely bring up Trump and Clinton randomly, heh.

    …There are automated metrics to rank English text by reading level, ‘quality’ and such. I guess it could be filtered to most ‘interesting’ emails and reformatted.