Spaces:

zej97
/

AI-Research-Assistant

Runtime error

App Files Files Community

zej97 commited on Aug 3, 2023

Commit

4d7183d

•

1 Parent(s): 3eaeddb

Upload folder using huggingface_hub

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.gitignore +12 -0
LICENSE +21 -0
README.md +70 -8
__pycache__/aira.cpython-311.pyc +0 -0
__pycache__/aira.cpython-39.pyc +0 -0
__pycache__/app.cpython-311.pyc +0 -0
__pycache__/app.cpython-39.pyc +0 -0
__pycache__/components.cpython-311.pyc +0 -0
__pycache__/home.cpython-311.pyc +0 -0
__pycache__/main.cpython-311.pyc +0 -0
__pycache__/main.cpython-39.pyc +0 -0
__pycache__/style.cpython-311.pyc +0 -0
__pycache__/test.cpython-311.pyc +0 -0
__pycache__/test2.cpython-311.pyc +0 -0
__pycache__/test3.cpython-311.pyc +0 -0
actions/__pycache__/duck_search.cpython-311.pyc +0 -0
actions/__pycache__/google_search.cpython-311.pyc +0 -0
actions/__pycache__/web_scrape.cpython-311.pyc +0 -0
actions/__pycache__/web_scrape.cpython-39.pyc +0 -0
actions/__pycache__/web_search.cpython-311.pyc +0 -0
actions/__pycache__/web_search.cpython-39.pyc +0 -0
actions/duck_search.py +11 -0
actions/google_search.py +63 -0
agent/__init__.py +0 -0
agent/__pycache__/__init__.cpython-311.pyc +0 -0
agent/__pycache__/llm_utils.cpython-311.pyc +0 -0
agent/__pycache__/llm_utils.cpython-39.pyc +0 -0
agent/__pycache__/prompts.cpython-311.pyc +0 -0
agent/__pycache__/prompts.cpython-39.pyc +0 -0
agent/__pycache__/research_agent.cpython-311.pyc +0 -0
agent/__pycache__/research_agent.cpython-39.pyc +0 -0
agent/__pycache__/run.cpython-311.pyc +0 -0
agent/__pycache__/run.cpython-39.pyc +0 -0
agent/__pycache__/toolkits.cpython-311.pyc +0 -0
agent/llm_utils.py +39 -0
agent/prompts.py +132 -0
agent/research_agent.py +109 -0
agent/toolkits.py +15 -0
app.py +81 -0
config/__init__.py +9 -0
config/__pycache__/__init__.cpython-311.pyc +0 -0
config/__pycache__/__init__.cpython-39.pyc +0 -0
config/__pycache__/config.cpython-311.pyc +0 -0
config/__pycache__/config.cpython-39.pyc +0 -0
config/__pycache__/singleton.cpython-311.pyc +0 -0
config/__pycache__/singleton.cpython-39.pyc +0 -0
config/config.py +82 -0
config/singleton.py +24 -0
outputs/Should I invest in the Large Language Model industry in 2023/research--2012672616352147449.txt +1 -0
outputs/What are the most recent advancements in the domain of superconductors as of 2023/research--2821165325009188188.txt +1 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,12 @@

+#Ignore env containing secrets
+.env
+#Ignore Virtual Env
+env/
+#Ignore generated outputs
+outputs/
+#Ignore pycache
+**/__pycache__/
+test*.py
+./test/
+./flagged/

LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2023 Ze Jin
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

README.md CHANGED Viewed

@@ -1,12 +1,74 @@
 ---
-title: AI Research Assistant
-emoji: ⚡
-colorFrom: blue
-colorTo: green
-sdk: gradio
-sdk_version: 3.39.0
 app_file: app.py
-pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: AI-Research-Assistant
 app_file: app.py
+sdk: gradio
+sdk_version: 3.38.0
+---
+<div style="width: 100%;">
+    <img src="./statics/title.svg" style="width: 100%;">
+    <div align="right">
+        <a href="./README.md">English</a> |
+        <a href="./statics/README_zh.md">中文</a>
+    </div>
+</div>
+Inspired by [gpt-researcher](https://github.com/assafelovic/gpt-researcher). This project endeavors to develop an AI research assistant capable of **generating research reports** effortlessly for researchers. For instance, researchers can request the AI research assistant to compose a report on *the latest advancements in the field of superconductors as of 2023*, which is currently a trending topic. The AI research assistant will subsequently compile a report based on the relevant information obtained from the internet. Now, AIRA also offers support for **academic English polishing**.
+<!-- make a table -->
+| Image1 | Image2 |
+| :----: | :----: |
+| <img src="./statics/example1-1.png"> | <img src="./statics/example1-2.png"> |
+The currently supported agents encompass a wide range of fields, including *finance, business analysis, clinical medicine, basic medicine, travel, academic research and sociology*.
+In addition to official api, this project offers an alternative approach to generating research reports by utilizing a third-party API. For access to this third-party API, please refer to [chimeragpt](https://chimeragpt.adventblocks.cc/) or [GPT-API-free](https://github.com/chatanywhere/GPT_API_free). Before running the project, kindly ensure that you set the environment variables `OPENAI_API_KEY` and `OPENAI_API_BASE`.
+```shell
+$ export OPENAI_API_KEY = your_api_key
+$ export OPENAI_API_BASE = your_api_base
+```
+or you can set the api key and base in `.env` file.
+## Installation
+1. Clone the repository
+    ```shell
+    $ git clone [email protected]:paradoxtown/ai_research_assistant.git
+    $ cd ai_research_assistant
+    ```
+2. Install the dependencies
+    ```shell
+    $ pip install -r requirements.txt
+    ```
+3. Export evnironment variables
+    ```shell
+    $ export OPENAI_API_KEY = your_api_key
+    $ export OPENAI_API_BASE = your_api_base
+    ```
+    or modify the `.env` file.
+4. Run the project
+    ```shell
+    $ python app.py
+    ```
+## TODO
+- [x] Switch Google Search to DuckDuckGo
+- [ ] Literature review
+- [x] Third-party API
+- [ ] Prettify report
+- [x] Add medical agent and social agent
+- [ ] Add option for users to customize the number of words and temperature
 ---
+<div align="center">Happy researching! 🚀</div>

__pycache__/aira.cpython-311.pyc ADDED Viewed

Binary file (4.71 kB). View file

__pycache__/aira.cpython-39.pyc ADDED Viewed

Binary file (2.39 kB). View file

__pycache__/app.cpython-311.pyc ADDED Viewed

Binary file (6.08 kB). View file

__pycache__/app.cpython-39.pyc ADDED Viewed

Binary file (2.64 kB). View file

__pycache__/components.cpython-311.pyc ADDED Viewed

Binary file (164 Bytes). View file

__pycache__/home.cpython-311.pyc ADDED Viewed

Binary file (2.27 kB). View file

__pycache__/main.cpython-311.pyc ADDED Viewed

Binary file (3.84 kB). View file

__pycache__/main.cpython-39.pyc ADDED Viewed

Binary file (1.99 kB). View file

__pycache__/style.cpython-311.pyc ADDED Viewed

Binary file (1.93 kB). View file

__pycache__/test.cpython-311.pyc ADDED Viewed

Binary file (1.23 kB). View file

__pycache__/test2.cpython-311.pyc ADDED Viewed

Binary file (390 Bytes). View file

__pycache__/test3.cpython-311.pyc ADDED Viewed

Binary file (1.04 kB). View file

actions/__pycache__/duck_search.cpython-311.pyc ADDED Viewed

Binary file (970 Bytes). View file

actions/__pycache__/google_search.cpython-311.pyc ADDED Viewed

Binary file (3.87 kB). View file

actions/__pycache__/web_scrape.cpython-311.pyc ADDED Viewed

Binary file (10.6 kB). View file

actions/__pycache__/web_scrape.cpython-39.pyc ADDED Viewed

Binary file (6.73 kB). View file

actions/__pycache__/web_search.cpython-311.pyc ADDED Viewed

Binary file (1.31 kB). View file

actions/__pycache__/web_search.cpython-39.pyc ADDED Viewed

Binary file (769 Bytes). View file

actions/duck_search.py ADDED Viewed

	@@ -0,0 +1,11 @@

+from duckduckgo_search import DDGS
+def duckduckgo_search(query, max_search_result=3):
+    with DDGS() as ddgs:
+        responses = list()
+        for i, r in enumerate(ddgs.text(query, region='wt-wt', safesearch='Off', timelimit='y')):
+            if i == max_search_result:
+                break
+            responses.append(r)
+        return responses

actions/google_search.py ADDED Viewed

	@@ -0,0 +1,63 @@

+import requests
+from bs4 import BeautifulSoup
+def get_urls(query, proxies=None):
+    query = query
+    url = f"https://www.google.com/search?q={query}"
+    headers = {'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/94.0.4606.61 Safari/537.36'}
+    response = requests.get(url, headers=headers, proxies=proxies)
+    soup = BeautifulSoup(response.content, 'html.parser')
+    results = []
+    for g in soup.find_all('div', class_='g'):
+        anchors = g.find_all('a')
+        if anchors:
+            link = anchors[0]['href']
+            if link.startswith('/url?q='):
+                link = link[7:]
+            if not link.startswith('http'):
+                continue
+            title = g.find('h3').text
+            item = {'title': title, 'link': link}
+            results.append(item)
+    return results
+def scrape_text(url, proxies=None) -> str:
+    """Scrape text from a webpage
+    Args:
+        url (str): The URL to scrape text from
+    Returns:
+        str: The scraped text
+    """
+    headers = {
+        'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/94.0.4606.61 Safari/537.36',
+        'Content-Type': 'text/plain',
+    }
+    try:
+        response = requests.get(url, headers=headers, proxies=proxies, timeout=8)
+        if response.encoding == "ISO-8859-1": response.encoding = response.apparent_encoding
+    except:
+        return "Unable to connect to the server"
+    soup = BeautifulSoup(response.text, "html.parser")
+    for script in soup(["script", "style"]):
+        script.extract()
+    text = soup.get_text()
+    lines = (line.strip() for line in text.splitlines())
+    chunks = (phrase.strip() for line in lines for phrase in line.split("  "))
+    text = "\n".join(chunk for chunk in chunks if chunk)
+    return text
+if __name__ == '__main__':
+    txt = "What is LSTM?"
+    proxies = None
+    urls = get_urls(txt, proxies)
+    max_search_result = 10
+    for url in urls[:max_search_result]:
+        print(url)
+        print(scrape_text(url['link'], proxies))
+        print("\n\n")

agent/__init__.py ADDED Viewed

File without changes

agent/__pycache__/__init__.cpython-311.pyc ADDED Viewed

Binary file (150 Bytes). View file

agent/__pycache__/llm_utils.cpython-311.pyc ADDED Viewed

Binary file (1.66 kB). View file

agent/__pycache__/llm_utils.cpython-39.pyc ADDED Viewed

Binary file (2.95 kB). View file

agent/__pycache__/prompts.cpython-311.pyc ADDED Viewed

Binary file (11.3 kB). View file

agent/__pycache__/prompts.cpython-39.pyc ADDED Viewed

Binary file (9.36 kB). View file

agent/__pycache__/research_agent.cpython-311.pyc ADDED Viewed

Binary file (6.66 kB). View file

agent/__pycache__/research_agent.cpython-39.pyc ADDED Viewed

Binary file (7.01 kB). View file

agent/__pycache__/run.cpython-311.pyc ADDED Viewed

Binary file (729 Bytes). View file

agent/__pycache__/run.cpython-39.pyc ADDED Viewed

Binary file (2.17 kB). View file

agent/__pycache__/toolkits.cpython-311.pyc ADDED Viewed

Binary file (853 Bytes). View file

agent/llm_utils.py ADDED Viewed

	@@ -0,0 +1,39 @@

+from __future__ import annotations
+from config import Config
+import openai
+CFG = Config()
+openai.api_key = CFG.openai_api_key
+openai.api_base = CFG.openai_api_base
+from typing import Optional
+def llm_response(model,
+             messages,
+             temperature: float = CFG.temperature,
+             max_tokens: Optional[int] = None):
+    return openai.ChatCompletion.create(
+            model=model,
+            messages=messages,
+            temperature=temperature,
+            max_tokens=max_tokens,
+        ).choices[0].message["content"]
+def llm_stream_response(model,
+                        messages,
+                        temperature: float = CFG.temperature,
+                        max_tokens: Optional[int] = None):
+    response = ""
+    for chunk in openai.ChatCompletion.create(
+            model=model,
+            messages=messages,
+            temperature=temperature,
+            max_tokens=max_tokens,
+            stream=True,
+    ):
+        content = chunk["choices"][0].get("delta", {}).get("content")
+        if content is not None:
+            response += content
+            yield response

agent/prompts.py ADDED Viewed

	@@ -0,0 +1,132 @@

+def generate_agent_role_prompt(agent):
+    """ Generates the agent role prompt.
+    Args: agent (str): The type of the agent.
+    Returns: str: The agent role prompt.
+    """
+    prompts = {
+        "Finance Agent": "You are a seasoned finance analyst AI assistant. Your primary goal is to compose comprehensive, astute, impartial, and methodically arranged financial reports based on provided data and trends.",
+        "Travel Agent": "You are a world-travelled AI tour guide assistant. Your main purpose is to draft engaging, insightful, unbiased, and well-structured travel reports on given locations, including history, attractions, and cultural insights.",
+        "Academic Research Agent": "You are an AI academic research assistant. Your primary responsibility is to create thorough, academically rigorous, unbiased, and systematically organized reports on a given research topic, following the standards of scholarly work.",
+        "Business Analyst Agent": "You are an experienced AI business analyst assistant. Your main objective is to produce comprehensive, insightful, impartial, and systematically structured business reports based on provided business data, market trends, and strategic analysis.",
+        "Computer Security Analyst Agent": "You are an AI specializing in computer security analysis. Your principal duty is to generate comprehensive, meticulously detailed, impartial, and systematically structured reports on computer security topics. This includes Exploits, Techniques, Threat Actors, and Advanced Persistent Threat (APT) Groups. All produced reports should adhere to the highest standards of scholarly work and provide in-depth insights into the complexities of computer security.",
+        "Clinical Medicine Agent": "You are an AI specializing in clinical medicine analysis. Your primary role is to compose comprehensive, well-researched, impartial, and methodically organized reports on various aspects of clinical medicine. This includes in-depth studies on medical conditions, treatments, medical advancements, patient care, and healthcare practices. Your reports should follow the highest standards of medical research and provide critical insights into the complexities of the clinical medicine field. Whether it's analyzing medical data, conducting literature reviews, or evaluating the efficacy of medical interventions, your goal is to deliver insightful and evidence-based reports to assist medical professionals and researchers in making informed decisions.",
+        "Basic Medicine Agent": "You are an AI specializing in basic medicine. Your goal is to provide comprehensive, unbiased reports on essential healthcare topics. Deliver clear insights into general health practices, common medical conditions, preventive measures, first aid procedures, and healthy lifestyle choices. Aim to be accessible to non-medical professionals and offer evidence-based recommendations for overall well-being.",
+        "Social Science Research Agent": "You are an AI social science research assistant with a focus on providing comprehensive, well-researched, and unbiased reports on various topics within the social sciences. Your primary goal is to delve into the complexities of human behavior, society, and culture to produce insightful and methodically organized reports. Whether it's sociology, psychology, anthropology, economics, or any other social science discipline, you excel in critically analyzing data, academic literature, and historical trends to offer valuable insights into the subject matter. Your reports are crafted to meet the highest standards of scholarly work, adhering to objectivity and academic rigor while presenting information in a clear and engaging manner. With your expertise, you can delve into societal issues, cultural dynamics, economic trends, and other relevant areas within the realm of social sciences.",
+        "Default Agent": "You are an AI critical thinker research assistant. Your sole purpose is to write well written, critically acclaimed, objective and structured reports on given text."
+    }
+    return prompts.get(agent, "No such agent")
+def generate_report_prompt(question, research_summary):
+    """ Generates the report prompt for the given question and research summary.
+    Args: question (str): The question to generate the report prompt for
+          research_summary (str): The research summary to generate the report prompt for
+    Returns: str: The report prompt for the given question and research summary
+    """
+    return f'"""{research_summary}""" Using the above information, answer the following'\
+           f' question or topic: "{question}" in a detailed report --'\
+           " The report should focus on the answer to the question, should be well structured, informative, detailed" \
+           " in depth, with facts and numbers if available, a minimum of 2,400 words and with markdown syntax and apa format. "\
+            "Write all source urls at the end of the report in apa format."
+def generate_search_queries_prompt(question):
+    """ Generates the search queries prompt for the given question.
+    Args: question (str): The question to generate the search queries prompt for
+    Returns: str: The search queries prompt for the given question
+    """
+    return f'Write 5 google search queries to search online that form an objective opinion from the following: "{question}"\n'\
+           'You must respond with a list of strings in the following json format: {"Q1": query1, "Q2": query2, "Q3": query3, "Q4": query4, "Q5": query5}'
+def generate_resource_report_prompt(question, research_summary):
+    """Generates the resource report prompt for the given question and research summary.
+    Args:
+        question (str): The question to generate the resource report prompt for.
+        research_summary (str): The research summary to generate the resource report prompt for.
+    Returns:
+        str: The resource report prompt for the given question and research summary.
+    """
+    return f'"""{research_summary}""" Based on the above information, generate a bibliography recommendation report for the following' \
+           f' question or topic: "{question}". The report should provide a detailed analysis of each recommended resource,' \
+           ' explaining how each source can contribute to finding answers to the research question.' \
+           ' Focus on the relevance, reliability, and significance of each source.' \
+           ' Ensure that the report is well-structured, informative, in-depth, and follows Markdown syntax.' \
+           ' Include relevant facts, figures, and numbers whenever available.' \
+           ' The report should have a minimum length of 1,200 words.'
+def generate_outline_report_prompt(question, research_summary):
+    """ Generates the outline report prompt for the given question and research summary.
+    Args: question (str): The question to generate the outline report prompt for
+            research_summary (str): The research summary to generate the outline report prompt for
+    Returns: str: The outline report prompt for the given question and research summary
+    """
+    return f'"""{research_summary}""" Using the above information, generate an outline for a research report in Markdown syntax'\
+           f' for the following question or topic: "{question}". The outline should provide a well-structured framework'\
+           ' for the research report, including the main sections, subsections, and key points to be covered.' \
+           ' The research report should be detailed, informative, in-depth, and a minimum of 1,200 words.' \
+           ' Use appropriate Markdown syntax to format the outline and ensure readability.'
+def generate_concepts_prompt(question, research_summary):
+    """ Generates the concepts prompt for the given question.
+    Args: question (str): The question to generate the concepts prompt for
+            research_summary (str): The research summary to generate the concepts prompt for
+    Returns: str: The concepts prompt for the given question
+    """
+    return f'"""{research_summary}""" Using the above information, generate a list of 5 main concepts to learn for a research report'\
+           f' on the following question or topic: "{question}". The outline should provide a well-structured framework'\
+           'You must respond with a list of strings in the following format: ["concepts 1", "concepts 2", "concepts 3", "concepts 4, concepts 5"]'
+def generate_lesson_prompt(concept):
+    """
+    Generates the lesson prompt for the given question.
+    Args:
+        concept (str): The concept to generate the lesson prompt for.
+    Returns:
+        str: The lesson prompt for the given concept.
+    """
+    prompt = f'generate a comprehensive lesson about {concept} in Markdown syntax. This should include the definition'\
+    f'of {concept}, its historical background and development, its applications or uses in different'\
+    f'fields, and notable events or facts related to {concept}.'
+    return prompt
+def get_report_by_type(report_type):
+    report_type_mapping = {
+        'Research Report': generate_report_prompt,
+        'Resource Report': generate_resource_report_prompt,
+        'Outline Report': generate_outline_report_prompt
+    }
+    return report_type_mapping[report_type]
+def generate_english_polishing_prompt(content):
+    """ Generates the english polishing prompt for the given content.
+    Inspired by project gpt_academic
+    Args: question (str):
+    Returns: str: The english polishing prompt for the given content
+    """
+    return f'Below is a paragraph from an academic paper. Polish the writing to meet the academic style and improve the spelling, grammar, clarity, concision, and overall readability.  When necessary, rewrite the whole sentence. Furthermore, list all modifications and explain the reasons for doing so in the markdown table. \n {content}'
+def generate_summarize_prompt(content):
+    """ Generates the summarize prompt for the given content.
+    Inspired by project gpt_academic
+    Args: question (str):
+    Returns: str: The summarize prompt for the given content
+    """
+    return f'The following information is crawled from the Internet and will be used in writing the research report. Please clear the junk information and summarize the useful information in depth. Include all factual information, numbers, stats etc if available. \n {content}'

agent/research_agent.py ADDED Viewed

	@@ -0,0 +1,109 @@

+import json
+from actions.duck_search import duckduckgo_search
+from processing.text import read_txt_files
+from agent.llm_utils import llm_response, llm_stream_response
+from config import Config
+from agent import prompts
+import os
+import string
+CFG = Config()
+class ResearchAgent:
+    def __init__(self, question, agent):
+        """ Initializes the research assistant with the given question.
+            Args: question (str): The question to research
+            Returns: None
+        """
+        self.question = question
+        self.agent = agent
+        self.visited_urls = set()
+        self.search_summary = ""
+        self.directory_name = ''.join(c for c in question if c.isascii() and c not in string.punctuation)[:100]
+        self.dir_path = os.path.dirname(f"./outputs/{self.directory_name}/")
+    def call_agent(self, action):
+        messages = [{
+            "role": "system",
+            "content": prompts.generate_agent_role_prompt(self.agent),
+        }, {
+            "role": "user",
+            "content": action,
+        }]
+        return llm_response(
+                    model=CFG.fast_llm_model,
+                    messages=messages,
+                )
+    def call_agent_stream(self, action):
+        messages = [{
+            "role": "system",
+            "content": prompts.generate_agent_role_prompt(self.agent),
+        }, {
+            "role": "user",
+            "content": action,
+        }]
+        yield from llm_stream_response(
+                model=CFG.fast_llm_model,
+                messages=messages
+            )
+    def create_search_queries(self):
+        """ Creates the search queries for the given question.
+        Args: None
+        Returns: list[str]: The search queries for the given question
+        """
+        result = self.call_agent(prompts.generate_search_queries_prompt(self.question))
+        return json.loads(result)
+    def search_single_query(self, query):
+        """ Runs the async search for the given query.
+        Args: query (str): The query to run the async search for
+        Returns: list[str]: The async search for the given query
+        """
+        return duckduckgo_search(query, max_search_result=3)
+    def run_search_summary(self, query):
+        """ Runs the search summary for the given query.
+        Args: query (str): The query to run the search summary for
+        Returns: str: The search summary for the given query
+        """
+        responses = self.search_single_query(query)
+        print(f"Searching for {query}")
+        query = hash(query)
+        file_path = f"./outputs/{self.directory_name}/research-{query}.txt"
+        os.makedirs(os.path.dirname(file_path), exist_ok=True)
+        with open(file_path, "w") as f:
+            json.dump(responses, f)
+            print(f"Saved {query} to {file_path}")
+        return responses
+    def search_online(self):
+        """ Conducts the search for the given question.
+        Args: None
+        Returns: str: The search results for the given question
+        """
+        self.search_summary = read_txt_files(self.dir_path) if os.path.isdir(self.dir_path) else ""
+        if not self.search_summary:
+            search_queries = self.create_search_queries()
+            for _, query in search_queries.items():
+                search_result = self.run_search_summary(query)
+                self.search_summary += f"=Query=:\n{query}\n=Search Result=:\n{search_result}\n================\n"
+        return self.search_summary
+    def write_report(self, report_type):
+        """ Writes the report for the given question.
+        Args: None
+        Returns: str: The report for the given question
+        """
+        yield "Searching online..."
+        report_type_func = prompts.get_report_by_type(report_type)
+        yield from self.call_agent_stream(report_type_func(self.question, self.search_online()))

agent/toolkits.py ADDED Viewed

	@@ -0,0 +1,15 @@

+from agent import prompts, llm_utils
+from config import Config
+CFG = Config()
+def english_polishing(content):
+    prompt = prompts.generate_english_polishing_prompt(content)
+    messages = [{
+        "role": "user",
+        "content": prompt,
+    }]
+    yield from llm_utils.llm_stream_response(
+        model=CFG.fast_llm_model,
+        messages=messages)

app.py ADDED Viewed

	@@ -0,0 +1,81 @@

+import gradio as gr
+from config import check_openai_api_key
+from agent.research_agent import ResearchAgent
+from agent.toolkits import english_polishing
+from statics.style import *
+theme = gr.themes.Soft(
+    primary_hue=gr.themes.Color(c100="#e0e7ff", c200="#c7d2fe", c300="#a5b4fc", c400="#818cf8", c50="#eef2ff", c500="#6366f1", c600="#5e5aaa", c700="#4338ca", c800="#3730a3", c900="#312e81", c950="#2b2c5e"),
+    font_mono=[gr.themes.GoogleFont('Fira Code'), 'ui-monospace', 'Consolas', 'monospace']
+)
+check_openai_api_key()
+def run_agent(task, agent, report_type):
+    assistant = ResearchAgent(task, agent)
+    yield from assistant.write_report(report_type)
+with gr.Blocks(theme=gr.themes.Base(),
+               title="AI Research Assistant",
+               css=css) as demo:
+    gr.HTML(top_bar)
+    with gr.Tab(label="Report"):
+        with gr.Column():
+            gr.HTML(research_report_html)
+            research_report = gr.Markdown(value="&nbsp;&nbsp;**Research report will appear here...**",
+                                          elem_classes="output")
+            with gr.Row():
+                agent_type = gr.Dropdown(label="# Agent Type",
+                                         value="Default Agent",
+                                         interactive=True,
+                                         allow_custom_value=False,
+                                         choices=["Default Agent",
+                                                 "Business Analyst Agent",
+                                                 "Finance Agent",
+                                                 "Travel Agent",
+                                                 "Academic Research Agent",
+                                                 "Computer Security Analyst Agent",
+                                                 "Clinical Medicine Agent",
+                                                 "Basic Medicine Agent",
+                                                 "Social Science Research Agent"])
+                report_type = gr.Dropdown(label="# Report Type",
+                                         value="Research Report",
+                                         interactive=True,
+                                         allow_custom_value=False,
+                                         choices=["Research Report",
+                                                 "Resource Report",
+                                                 "Outline Report"])
+            input_box = gr.Textbox(label="# What would you like to research next?", placeholder="Enter your question here")
+            submit_btn = gr.Button("Generate Report")
+            submit_btn.click(run_agent, inputs=[input_box, agent_type, report_type],
+                                        outputs=research_report)
+            gr.Examples(["Should I invest in the Large Language Model industry in 2023?",
+                         "Is it advisable to make investments in the electric car industry during the year 2023?",
+                         "What constitutes the optimal approach for investing in the Bitcoin industry during the year 2023?",
+                         "What are the most recent advancements in the domain of superconductors as of 2023?"],
+                         inputs=input_box)
+    with gr.Tab("English Polishing"):
+        gr.HTML(english_polishing_html)
+        polished_result = gr.Markdown("&nbsp;&nbsp;**Polished result will appear here...**", elem_classes="output")
+        sentences = gr.Textbox(label="# What would you like to polish?", placeholder="Enter your sentence here")
+        with gr.Row():
+            polish_btn = gr.Button("Polish")
+            save_btn = gr.Button("Save")
+        polish_btn.click(english_polishing, inputs=[sentences], outputs=polished_result)
+        def save_result(history, origin, result):
+            history += f"\n**Origin** : {origin}\n\n**Polished Result** : {result}"
+            return history
+        gr.HTML(history_result_html)
+        history_result = gr.Markdown("&nbsp;&nbsp;**History result will appear here...**")
+        save_btn.click(save_result, inputs=[history_result, sentences, polished_result], outputs=history_result)
+    with gr.Tab("Literature Review"):
+        pass
+demo.queue().launch()

config/__init__.py ADDED Viewed

	@@ -0,0 +1,9 @@

+from config.config import Config, check_openai_api_key
+from config.singleton import AbstractSingleton, Singleton
+__all__ = [
+    "check_openai_api_key",
+    "AbstractSingleton",
+    "Config",
+    "Singleton",
+]

config/__pycache__/__init__.cpython-311.pyc ADDED Viewed

Binary file (407 Bytes). View file

config/__pycache__/__init__.cpython-39.pyc ADDED Viewed

Binary file (334 Bytes). View file

config/__pycache__/config.cpython-311.pyc ADDED Viewed

Binary file (5.13 kB). View file

config/__pycache__/config.cpython-39.pyc ADDED Viewed

Binary file (3.51 kB). View file

config/__pycache__/singleton.cpython-311.pyc ADDED Viewed

Binary file (1.46 kB). View file

config/__pycache__/singleton.cpython-39.pyc ADDED Viewed

Binary file (1.04 kB). View file

config/config.py ADDED Viewed

	@@ -0,0 +1,82 @@

+"""Configuration class to store the state of bools for different scripts access."""
+import os
+import openai
+from colorama import Fore
+from dotenv import load_dotenv
+from config.singleton import Singleton
+load_dotenv(verbose=True)
+class Config(metaclass=Singleton):
+    """
+    Configuration class to store the state of bools for different scripts access.
+    """
+    def __init__(self) -> None:
+        """Initialize the Config class"""
+        self.debug_mode = False
+        self.allow_downloads = False
+        self.selenium_web_browser = os.getenv("USE_WEB_BROWSER", "chrome")
+        self.fast_llm_model = os.getenv("FAST_LLM_MODEL", "gpt-3.5-turbo")
+        self.smart_llm_model = os.getenv("SMART_LLM_MODEL", "gpt-4")
+        self.fast_token_limit = int(os.getenv("FAST_TOKEN_LIMIT", 8000))
+        self.smart_token_limit = int(os.getenv("SMART_TOKEN_LIMIT", 8000))
+        self.browse_chunk_max_length = int(os.getenv("BROWSE_CHUNK_MAX_LENGTH", 8192))
+        self.openai_api_key = os.getenv("OPENAI_API_KEY")
+        self.openai_api_base = os.getenv("OPENAI_API_BASE", openai.api_base)
+        self.temperature = float(os.getenv("TEMPERATURE", "1"))
+        self.user_agent = os.getenv(
+            "USER_AGENT",
+            "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_4) AppleWebKit/537.36"
+            " (KHTML, like Gecko) Chrome/83.0.4103.97 Safari/537.36",
+        )
+        self.memory_backend = os.getenv("MEMORY_BACKEND", "local")
+        # Initialize the OpenAI API client
+        openai.api_key = self.openai_api_key
+    def set_fast_llm_model(self, value: str) -> None:
+        """Set the fast LLM model value."""
+        self.fast_llm_model = value
+    def set_smart_llm_model(self, value: str) -> None:
+        """Set the smart LLM model value."""
+        self.smart_llm_model = value
+    def set_fast_token_limit(self, value: int) -> None:
+        """Set the fast token limit value."""
+        self.fast_token_limit = value
+    def set_smart_token_limit(self, value: int) -> None:
+        """Set the smart token limit value."""
+        self.smart_token_limit = value
+    def set_browse_chunk_max_length(self, value: int) -> None:
+        """Set the browse_website command chunk max length value."""
+        self.browse_chunk_max_length = value
+    def set_openai_api_key(self, value: str) -> None:
+        """Set the OpenAI API key value."""
+        self.openai_api_key = value
+    def set_debug_mode(self, value: bool) -> None:
+        """Set the debug mode value."""
+        self.debug_mode = value
+def check_openai_api_key() -> None:
+    """Check if the OpenAI API key is set in config.py or as an environment variable."""
+    cfg = Config()
+    if not cfg.openai_api_key:
+        print(
+            Fore.RED
+            + "Please set your OpenAI API key in .env or as an environment variable."
+        )
+        print("You can get your key from https://platform.openai.com/account/api-keys")
+        exit(1)

config/singleton.py ADDED Viewed

	@@ -0,0 +1,24 @@

+"""The singleton metaclass for ensuring only one instance of a class."""
+import abc
+class Singleton(abc.ABCMeta, type):
+    """
+    Singleton metaclass for ensuring only one instance of a class.
+    """
+    _instances = {}
+    def __call__(cls, *args, **kwargs):
+        """Call method for the singleton metaclass."""
+        if cls not in cls._instances:
+            cls._instances[cls] = super(Singleton, cls).__call__(*args, **kwargs)
+        return cls._instances[cls]
+class AbstractSingleton(abc.ABC, metaclass=Singleton):
+    """
+    Abstract singleton class for ensuring only one instance of a class.
+    """
+    pass

outputs/Should I invest in the Large Language Model industry in 2023/research--2012672616352147449.txt ADDED Viewed

	@@ -0,0 +1 @@

+ [{"title": "Top Investing Trends For 2023 - Forbes Advisor", "href": "https://www.forbes.com/advisor/investing/top-investing-trends-2023/", "body": "With 2022 drawing to a close, the S&P 500 has clawed its way out of bear market territory but remains down 17% as of this writing. As we look ahead to 2023, here are nine investing trends that can ..."}, {"title": "Global Risks Report 2023: the biggest risks facing the world", "href": "https://www.weforum.org/agenda/2023/01/these-are-the-biggest-risks-facing-the-world-global-risks-2023/", "body": "Davos 2023. The World Economic Forum's latest Global Risks Report identifies the key risks facing the world over the next decade. In the next two years, the cost-of-living crisis is seen as the biggest risk, while over the next 10 years environmental risks dominate. The interconnectedness of global risks and crises is giving rise to the threat ..."}, {"title": "Global Risks Report 2023: We know what the risks are - here's what ...", "href": "https://www.weforum.org/agenda/2023/01/global-risks-report-2023-experts-davos2023/", "body": "The urgency of a cost of living crisis dominates 2023's Global Risks Report, which is in danger of deprioritizing other risks. Experts at the World Economic Forum give their insights into how their sectors are seeking to manage risks, build resilience and use new opportunities to shore up defences in 2023."}]

outputs/What are the most recent advancements in the domain of superconductors as of 2023/research--2821165325009188188.txt ADDED Viewed

	@@ -0,0 +1 @@

+ [{"title": "Physicists discover a new switch for superconductivity - ScienceDaily", "href": "https://www.sciencedaily.com/releases/2023/06/230622120822.htm", "body": "June 22, 2023 Source: Massachusetts Institute of Technology Summary: A study sheds surprising light on how certain superconductors undergo a 'nematic transition' -- unlocking new,..."}, {"title": "Physicists discover a new switch for superconductivity", "href": "https://news.mit.edu/2023/physicists-discover-new-switch-superconductivity-0622", "body": "June 22, 2023 Press Inquiries Caption When some ultrathin materials undergo a \"nematic transition,\" their atomic lattice structure stretches in ways that unlock superconductivity (as this conceptual image shows). MIT physicists have identified how this essential nematic switch occurs in one class of superconductors. Credits Image: iStock"}, {"title": "New Room-Temperature Superconductor Discovered by Scientists - The New ...", "href": "https://www.nytimes.com/2023/03/08/science/room-temperature-superconductor-ranga-dias.html", "body": "New Room-Temperature Superconductor Discovered by Scientists - The New York Times New Room-Temperature Superconductor Offers Tantalizing Possibilities The breakthrough could one day..."}]