Hi All, I have a noob question.
I have been reading about Alpaca and Alpaca Lora. I have a use case in which I want to fine tune/train Alpaca Lora on a large corpus of books which are in the txt format. I know for Alpaca, the data was in "Instruction : Prompt" format. however, my text is huge and is not in that format. It's simply a library of books and journal articles. I want to be able to ask a question and the model answers based on the books I trained it on. I also want to be able to ask general questions for example which books discussed topic x or y.
I have tried OpenAI's API to create embeddings, but I want to use Alpaca.
I have tried OpenAI's API to create embeddings, but I want to use Alpaca.
I really appreciate your help.