Activity
Mon
Wed
Fri
Sun
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
What is this?
Less
More

Memberships

Data Alchemy

Public • 23.5k • Free

1 contribution to Data Alchemy
Why I am here?
Hi guys I am a practicing Data Scientist in one of the big threes. Unfortunately, the title does very little to describe the day-to-day, and I have been looking for ways to quit my job and start out on my own. Well, that's a story for another time. Currently, I am trying to develop a CSV-Reader that can use my data as embeddings and then use langchains to query an LLM for insights. The theory seems simple enough and there is a lot of material available so getting it built was not a big deal. But that's where my trouble started. 1) VSCode, I love it but I am so confused. Whenever I start conda doesn't work, I need to activate it every time, still a minor issue. The bigger problem is that whenever I make any changes in one of my custom modules VSCode behaves as if that never happens until I restart, please tell me if there is a setting that can solve this. 2) Now coming to the interesting part. The CSV-Reader I built sucks, it's so bad that I don't have proper words for it. I am beginning to think that examples on YouTube are cherry-picked. The good news is that I have yet to see it make things up, but it never gets any question right, not even as simple as: What are the values in the 1st index? I believe a better prompt might be the starting point, but that in itself cannot be the whole story, right? Where do I start, to get it from 0% accuracy to anything at this point? For context, I am using, Kaggle data set as input, a local llama2 7b as the LLM, sentence-transformers/all-MiniLM-L6-v2 from Huggingfce for embeddings and FAISS for vector store
6
6
New comment Aug '23
1 like • Aug '23
@Dave Ebbelaar I did changed some setting in the Workspace section, seems to have made a difference. I upgraded to llama 2 13B, had no meaningful impact. Have been playing around with the prompt template a bit, but I feel like the whole pipeline is missing something.
2 likes • Aug '23
@Mattia Vadalà Read your post, sounds awfully similar to my situation, seems like it's a common problem. Also, I dug around more to find the same thing about the Anaconda variables not being added to the path. Apparently, the preferred method seems to be to use the Anaconda Powershell Prompt.
1-1 of 1
Sarthak Kansal
2
10points to level up
@sarthak-kansal-9271
Data Scientist, sort of!

Active 485d ago
Joined Aug 21, 2023
powered by