Reading the results of a custom procedure as a pandas dataframe

Question

Reading the results of a custom procedure as a pandas dataframe

Given:

CREATE PROCEDURE my_procedure
    @Param INT
AS
    SELECT Col1, Col2
    FROM Table
    WHERE Col2 = @Param

I would like to be able to use this as:

import pandas as pd
import pyodbc

query = 'EXEC my_procedure @Param = {0}'.format(my_param)
conn = pyodbc.connect(my_connection_string)

df = pd.read_sql(query, conn)

But this causes an error:

ValueError: Reading a table with read_sql is not supported for a DBAPI2 connection. Use an SQLAlchemy engine or specify an sql query

SQLAlchemy does not work:

import sqlalchemy
engine = sqlalchemy.create_engine(my_connection_string)
df = pd.read_sql(query, engine)

Throws:

ValueError: Could not init table 'my_procedure'

I can actually execute the statement using pyodbcdirectly:

cursor = conn.cursor()
cursor.execute(query)
results = cursor.fetchall()
df = pd.DataFrame.from_records(results)

Is there a way to send these procedure results directly to a DataFrame?

+6

pandas sql-server stored-procedures sqlalchemy pyodbc

joeb1415 01 Oct '14 at 1:22

source share

4 answers

CRAFTY DBA · Answer 1 · 2014-10-01T02:13:15+0000

https://code.google.com/p/pyodbc/wiki/StoredProcedures

I am not a python expert, but SQL Server sometimes returns counts to execute statements. For example, an update will tell you how many rows have been updated.

"SET NO COUNT"; . , .

, .

.

adhoc SQL .

python

.

steamer25 · Answer 2 · 2014-11-13T18:48:19+0000

read_sql_query().

, @joris (+1) , , .

SQLA - SQLAlchemy, Pandas SQLite. read_sql_query() read_sql(). , , , "EXEC". read_sql_query() , ( read_sql_table()).

import pandas as pd
import sqlalchemy

query = 'EXEC my_procedure @Param = {0}'.format(my_param)
engine = sqlalchemy.create_engine(my_connection_string)
df = pd.read_sql_query(query, engine)

Bryan · Answer 3 · 2014-10-01T13:11:49+0000

ODBC ( ) pandas 0.14.1 pyodbc 3.0.7. AdventureWorks2008R2.

, pyodbc:

import pandas as pd
import pyodbc
connection = pyodbc.connect(driver='{SQL Server Native Client 11.0}', server='ServerInstance', database='AdventureWorks2008R2', trusted_connection='yes')
sql = "{call dbo.uspGetEmployeeManagers(?)}"
params = (3,)
cursor = connection.cursor()
rows = cursor.execute(sql, params).fetchall()
print(rows)

:

[(0, 3, 'Roberto', 'Tamburello', '/1/1/', 'Terri', 'Duffy'), (1, 2, 'Terri', 'Duffy',
'/1/', 'Ken', 'Sánchez')]

pandas :

df = pd.read_sql(sql=sql, con=connection, params=params)
print(df)

:

   RecursionLevel  BusinessEntityID FirstName    LastName OrganizationNode  \
0               0                 3   Roberto  Tamburello            /1/1/
1               1                 2     Terri       Duffy              /1/

  ManagerFirstName ManagerLastName
0            Terri           Duffy
1              Ken         Sánchez

Since you cannot upgrade to pandas 0.14.1, download the results from pyodbc using pandas.DataFrame.from_records :

# get column names from pyodbc results
columns = [column[0] for column in cursor.description]
df = pd.DataFrame.from_records(rows, columns=columns)

as - if · Answer 4 · 2018-11-30T16:16:05+0000

This worked for me after adding SET NOCOUNT ONthanks to @CRAFTY DBA

sql_query = """SET NOCOUNT ON; EXEC db_name.dbo.StoreProc '{0}';""".format(input)

df = pandas.read_sql_query(sql_query , conn)

Reading the results of a custom procedure as a pandas dataframe

More articles: