dreamingleo12 t1_jdnewt2 wrote on March 25, 2023 at 6:26 PM

But what if you're training a model for a narrow use-case and don't intend for anyone to use it except for a niche set of users? Is that enough to be in the clear? Or is any use of OpenAI's model output to train a model for any purpose a no-no?

bumbo-pa t1_jdnbwyr wrote on March 25, 2023 at 6:05 PM

Reply to comment by Zealousideal_Low1287 in [D] Do you use a website or program to organise and annotate your papers? by who_here_condemns_me

I write crimson notes on my cave wall.

EchoMyGecko t1_jdnbrve wrote on March 25, 2023 at 6:04 PM

Reply to [D] Do you use a website or program to organise and annotate your papers? by who_here_condemns_me

My annotations make too little sense for anything but me to possibly understand them :)

I save them as pdfs and write notes in the margins.

RiyazRockz t1_jdnbroi wrote on March 25, 2023 at 6:04 PM

Reply to comment by machineko in [R] Hello Dolly: Democratizing the magic of ChatGPT with open models by austintackaberry

Hey, I want to fine tune a model to solve a pharma related problem. I want to know if I can fine tune my model with this.. Could you please share your contact details so that I can learn about this more?

Fit-Recognition9795 t1_jdnbjar wrote on March 25, 2023 at 6:02 PM

Reply to comment by Eggy-Toast in [D] What happens if you give as input to bard or GPT4 an ASCII version of a screenshot of a video game and ask it from what game it has been taken or to describe the next likely action or the input? by Periplokos

Even worse, gpt-4 doesn't know it is gpt-4.

I have chatgpt-plus and the above answer is generated using the gpt-4 model.

Thebadwolf47 t1_jdnbfya wrote on March 25, 2023 at 6:01 PM

Reply to comment by Short_Change in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700

wasn't he rather comparing the parameters to the volume of the first computer and not their transistor count?

Simusid t1_jdnag2i wrote on March 25, 2023 at 5:54 PM

Reply to comment by RiotSia in [D] Simple Questions Thread by AutoModerator

I’m unable to connect to hamata.so. Can you tell me what kind of analysis you want to do?

philipgutjahr t1_jdn95o8 wrote on March 25, 2023 at 5:45 PM

Reply to comment by michaelthwan_ai in [N] March 2023 - Recent Instruction/Chat-Based Models and their parents by michaelthwan_ai

for completeness, you should also add all those proprietary models: Megatron-Turing (530B, NVIDIA), Gopher (280B, Google), Chinchilla (70B, DeepMind) and Chatgenie (WriteCream)

currentscurrents t1_jdn7spo wrote on March 25, 2023 at 5:36 PM

Reply to comment by pornthrowaway42069l in [N] GPT-4 has 1 trillion parameters by mrx-ai

Bigger models are more sample efficient for a given amount of data.

Scale is a triangle of three factors; model size, data size, and compute size. If you want to make more efficient use of data, you need to increase the other two.

In practice LLMs are not data limited right now, they're limited by compute and model size. Which is why you see models like LLaMa that throw huge amounts of data at smaller models.

Eggy-Toast t1_jdn7cki wrote on March 25, 2023 at 5:33 PM

Reply to comment by Fit-Recognition9795 in [D] What happens if you give as input to bard or GPT4 an ASCII version of a screenshot of a video game and ask it from what game it has been taken or to describe the next likely action or the input? by Periplokos

ChatGPT doesn’t know that GPT-4 has multimodal input though, right? I assume based on “not [designed] to analyze images or visual data” this is the case.

Puzzleheaded_Acadia1 t1_jdn6ugl wrote on March 25, 2023 at 5:29 PM

Reply to comment by soggy_mattress in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700

I see a future where LLMs or llamas that are multimodels or any other new kind artificial intelligence run on esp32 level of hardware i don't know how that will work but I'm pretty sure we are heading there

pornthrowaway42069l t1_jdn6noe wrote on March 25, 2023 at 5:28 PM

Reply to [N] GPT-4 has 1 trillion parameters by mrx-ai

Not going to deny that GPT-4 looks impressive, but, they could set up 10 bajillion-quadrillion parameters, question is, do they have the data to effectively utilize all of these? Maybe its time to start looking into decreasing number of parameters, and making more efficient use of the data.

Professional_Price89 t1_jdn6lhn wrote on March 25, 2023 at 5:28 PM

Reply to [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-

Lets integrate winSpy with it

ajingnk t1_jdn5uwr wrote on March 25, 2023 at 5:22 PM

Reply to [D] Simple Questions Thread by AutoModerator

What is the minimum hardware requirement to fine tune like Stanford Alpaca? I am thinking to build a workstation to do some DL exploration and fine-tuning work. For fine-tuning, I have around 10k samples.

Ph0masta t1_jdn5o16 wrote on March 25, 2023 at 5:21 PM

Reply to [N] March 2023 - Recent Instruction/Chat-Based Models and their parents by michaelthwan_ai

Where does Google’s LAMDA fit on this chart?

Recent comments in /f/MachineLearning