Py之PandasAI:PandasAI库的简介、安装、使用方法之详细攻略

Py之PandasAI:PandasAI库的简介、安装、使用方法之详细攻略

目录

PandasAI库的简介

PandasAI库的安装

PandasAI库的使用方法

1、基础用法

(1)、在DataFrame中找到所有列值大于5的行

(2)、绘图

2、进阶用法

(1)、将多个数据框架传递给PandasAI并询问与之相关的问题


PandasAI库的简介

PandasAI被设计为与pandas一起使用。它使pandas具有对数据进行自然语言提问的能力。PandasAI是一个为pandas添加生成式人工智能功能的Python库,pandas是一种流行的数据分析和处理工具。它被设计为与pandas一起使用,并不是其替代品。

GitHub官网GitHub - gventuri/pandas-ai: Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational

PandasAI库的安装

pip install pandasai



pip install -i https://pypi.tuna.tsinghua.edu.cn/simple pandasai

PandasAI库的使用方法

1、基础用法

(1)、在DataFrame中找到所有列值大于5的行

可以向PandasAI询问,在DataFrame中找到所有列值大于5的行,并返回只包含这些行的DataFrame

import pandas as pd
from pandasai import PandasAI

# Sample DataFrame
df = pd.DataFrame({
    "country": ["United States", "United Kingdom", "France", "Germany", "Italy", "Spain", "Canada", "Australia", "Japan", "China"],
    "gdp": [19294482071552, 2891615567872, 2411255037952, 3435817336832, 1745433788416, 1181205135360, 1607402389504, 1490967855104, 4380756541440, 14631844184064],
    "happiness_index": [6.94, 7.16, 6.66, 7.07, 6.38, 6.4, 7.23, 7.22, 5.87, 5.12]
})

# Instantiate a LLM
from pandasai.llm.openai import OpenAI
llm = OpenAI(api_token="YOUR_API_TOKEN")

pandas_ai = PandasAI(llm)
pandas_ai(df, prompt='Which are the 5 happiest countries?')

(2)、绘图

pandas_ai(
    df,
    "Plot the histogram of countries showing for each the gdp, using different colors for each bar",
)

2、进阶用法

(1)、将多个数据框架传递给PandasAI并询问与之相关的问题

import pandas as pd
from pandasai import PandasAI

employees_data = {
    'EmployeeID': [1, 2, 3, 4, 5],
    'Name': ['John', 'Emma', 'Liam', 'Olivia', 'William'],
    'Department': ['HR', 'Sales', 'IT', 'Marketing', 'Finance']
}

salaries_data = {
    'EmployeeID': [1, 2, 3, 4, 5],
    'Salary': [5000, 6000, 4500, 7000, 5500]
}

employees_df = pd.DataFrame(employees_data)
salaries_df = pd.DataFrame(salaries_data)


llm = OpenAI()
pandas_ai = PandasAI(llm)
pandas_ai([employees_df, salaries_df], "Who gets paid the most?")

猜你喜欢

转载自blog.csdn.net/qq_41185868/article/details/131277998