用了好久SQLAlchemy,作為Python ORM里的重量級(jí)module,用得很爽。
真是人生苦短,請(qǐng)用Python的一個(gè)極好例子。
原來一直只用了SQLAlchemy基本功能CRUD:增加(Create)、讀取查詢(Retrieve)、更新(Update)和刪除(Delete)。最近在開發(fā)實(shí)際Flask網(wǎng)站時(shí),遇到復(fù)雜的數(shù)據(jù)庫(kù)處理,感覺要從頭好好再深入學(xué)習(xí)一遍。
注: 這是初稿,會(huì)持續(xù)更新(20171027)
官網(wǎng):http://docs.sqlalchemy.org

兩大核心: SQLAlchemy ORM, SQLAlchemy Core
SQLAlchemy ORM
創(chuàng)建基本框架:
(建議復(fù)制到Jupyter Notebook里實(shí)際運(yùn)行和嘗試,加深理解!)
# -*- coding: utf-8 -*-
from sqlalchemy import Sequence, Column, DateTime, String, Integer, ForeignKey, func
from sqlalchemy.orm import relationship, backref
from sqlalchemy.ext.declarative import declarative_base
from sqlalchemy import create_engine
# 在內(nèi)存中創(chuàng)建臨時(shí)表
engine = create_engine('sqlite:///:memory:', echo=True)
Base.metadata.create_all(engine)
from sqlalchemy.orm import sessionmaker
session = sessionmaker(bind=engine)
s = session()
定義第一個(gè)表:
Base = declarative_base()
class User(Base):
__tablename__ = 'users'
id = Column(Integer, Sequence('user_id_seq'), primary_key=True)
name = Column(String)
fullname = Column(String)
password = Column(String)
def __repr__(self):
return "<User(name='%s', fullname='%s', password='%s')>" % (
self.name, self.fullname, self.password)
Session操作:
- add
u = User(name='ed', fullname='edward jack', password='23r23sdfa')
s.add(u)
# add之后,query(),已經(jīng)能查到,因?yàn)闀?huì)觸發(fā)Flush
# 沒有commit之前,可以任意修改 u
s.dirty # 查看修改的
s.commit()
- update
class User(Base):
... __tablename__ = 'users'
... id = Column(Integer, primary_key=True)
... name = Column(String)
... fullname = Column(String)
... password = Column(String)
... addresses = relationship("Address", back_populates='user',
... cascade="all, delete, delete-orphan")
class Address(Base):
... __tablename__ = 'addresses'
... id = Column(Integer, primary_key=True)
... email_address = Column(String, nullable=False)
... user_id = Column(Integer, ForeignKey('users.id'))
... user = relationship("User", back_populates="addresses")
- query
s.query(User).filter_by(name='ed').first()
Query API
http://docs.sqlalchemy.org/en/rel_1_1/orm/query.html
Query是最常用的操作,建議深入學(xué),提高Web App訪問數(shù)據(jù)庫(kù)的效率
filter操作符
query.filter(User.name == 'ed')
query.filter(User.name != 'ed')
query.filter(User.name.like('%ed%'))
ColumnOperators.ilike() 忽略大小寫查找
NOT IN:
query.filter(~User.name.in_(['ed', 'wendy', 'jack']))
User.name == None
User.name != None
AND: 等同于:“, ”
or_
is_
isnot
text(): 直接寫SQL
filter(text("id<224")).
order_by(text("id")).all():
get(Primary_key)
Flask-sqlalchemy:
- get_or_404()
- first_or_404()
提高查詢效率:to reduce the number of queries (dramatically, in many cases)
Eager Loading:
- orm.subqueryload()
- Joined Load
- Explicit Join + Eagerload
from sqlalchemy.orm import subqueryload
jack = session.query(User).\
... options(subqueryload(User.addresses)).\
... filter_by(name='jack').one()
jack
#<User(name='jack', fullname='Jack Bean', password='gjffdd')>
jack.addresses
#[<Address(email_address='jack@google.com')>, <Address(email_address='j25@yahoo.com')>]
One to Many:
- joined eager loading only makes sense when the size of the collections are relatively small
- subquery load makes sense when the collections are larger.
Many to One:
- using the default of lazy=’select’ is by far the most efficient way to go
- For a load of objects where there are many possible target references which may have not been loaded already, joined loading with an INNER JOIN is extremely efficient.
Load Only Cols
query = session.query(Book, Author).join(Book.author)
query = query.options(
Load(Book).load_only("summary", "excerpt"),
Load(Author).defer("bio")
)
Relationship Operators 關(guān)系表操作符
Operators:
contains()
==, !=, has()
Query.with_parent()
Join查詢:
先查關(guān)聯(lián)表,然后Join主表,用主表的column再二次過濾
session.query(RelationTable1).filter(XXX).join(MasterTable1).filter(MT1.XXX)
To access data from other tables, join the other tables and pass the desired columns to the add_columns() function.
Employee.query.join(Person).add_columns(Employee.id, Person.name).paginate(...)
Join From
主表是Address,但用關(guān)聯(lián)表User字段過濾
q = session.query(Address).select_from(User).
join(User.addresses).
filter(User.name == 'ed')
直接讀出ourbits_users表+Ob表:order_by(), paginate()
ss.query(ourbits_users, Ob).select_from(User).join(User.ob_seeding).filter(ourbits_users.c.user_id==User.id, ourbits_users.c.ourbits_id==Ob.id).all()
[(1, 56229, 2, 8, 0, 0.0, 9.71, '', '3天06:31:46', '3天7時(shí)', <Ob-56229>), (3, 56369, 1, 0, 0, 0.0, 0.0, '', '1天00:01:40', '2月2天', <Ob-56369>), (3, 42951, 1, 0, 0, 0.0, 0.0, '', '1天00:01:12', '3月14天', <Ob-42951>)]
Many to Many
先定義兩張表(db.Model),再定義第三張表作為M2M的聯(lián)系表。雖然可以用(db.Model) type,而且本地sqlite沒問題。但部署到Heroku postgressql(psycopg)上就出錯(cuò):
2017-10-03T13:09:08.233301+00:00 app[web.1]: [2017-10-03 13:09:08,222] ERROR in app: Exception on /admin/user/edit/ [GET]
2017-10-03T13:09:08.233312+00:00 app[web.1]: Traceback (most recent call last):
2017-10-03T13:09:08.233314+00:00 app[web.1]: File "/app/.heroku/python/lib/python3.6/site-packages/flask/app.py", line 1982, in wsgi_app
2017-10-03T13:09:08.233315+00:00 app[web.1]: response = self.full_dispatch_request()
...
2017-10-03T13:09:08.233853+00:00 app[web.1]: File "/app/.heroku/python/lib/python3.6/site-packages/flask_admin/contrib/sqla/fields.py", line 169, in
iter_choices
2017-10-03T13:09:08.233854+00:00 app[web.1]: yield (pk, self.get_label(obj), obj in self.data)
2017-10-03T13:09:08.233916+00:00 app[web.1]: TypeError: argument of type 'AppenderBaseQuery' is not iterable
查了很久,后來發(fā)現(xiàn),把聯(lián)系表,db.Model改成db.Table定義就好了!
查詢 http://pythoncentral.io/sqlalchemy-faqs/
#To find all the employees in the IT department, we can write it in ORM:
s.query(Employee).filter(Employee.departments.any(Department.name == 'IT')).one().name
#To find marry, i.e., all the employees who belong to at least two departments, we use group_by and having in an ORM query:
from sqlalchemy import func
s.query(Employee).join(Employee.departments).group_by(Employee.id).having(func.count(Department.id) > 1).one().name
http://docs.sqlalchemy.org/en/latest/orm/query.html#sqlalchemy.orm.query.Query
Multiple criteria may be specified as comma separated; the effect is that they will be joined together using the and_() function:
session.query(MyClass).\ filter(MyClass.name == 'some name', MyClass.id > 5)
session.query(MyClass).\
filter_by(name = 'some name', id = 5)
Model:
String to Model對(duì)象:
sort = eval('Ob.%s.%s()' % (sort_field, sort_order))
Table對(duì)象,使用paginate:
pagination = Ob.query.filter(or_(Ob.tag_gf==True, Ob.tag_gffbz==True)).
filter(and_(Ob.ob_seeding.contains(u),not_(Ob.ourbits_ac.contains(u)), not_(Ob.ourbits_as.contains(u)))).\ order_by(sort).paginate(page=int(paras['page']),per_page=int(paras['per_page']),error_out=False)
查找某User做種的種子Ob,一并返回第三表的某些列:
(這時(shí),可以選擇三張表的任意字段排序order_by)
(注1:這里用了join()+group_by(),只會(huì)返回聯(lián)結(jié)表里有值的Ob行。如果想返回所有的Ob行,則用outerjoin(),這樣即可以用第三表排序,又不會(huì)漏掉聯(lián)結(jié)表里無值的Ob行)
(注2:Postgres比MySQL、SQLite嚴(yán)格,很多你本地調(diào)試成功的SQL語句,部署到postgres會(huì)報(bào)錯(cuò)。比如:Group_by時(shí),會(huì)強(qiáng)制把你query的所有表或字段,都需要加進(jìn)來。)
ss.query(Ob.id).join(ourbits_users, User).filter(User.ob_username=='kevinqqnj').group_by(Ob.id).add_columns(User.ob_username, ourbits_users.c.download_size, ourbits_users.c.download_duration).all()
[(56229, 'kevinqqnj', 9.71, ''), (56335, 'kevinqqnj', 4.98, ''), (56369, 'kevinqqnj', 36.61, ''), (61, 'kevinqqnj', 47.7, '')]
查找User和聯(lián)結(jié)表ourbits_users,所有有種子下載<50的User
ss.query(User).join(ourbits_users).group_by(User.id).having(ourbits_users.c.download_size<50).all()
[<User-1 'admin'>, <User-3 'ob643819671'>]
查找種子,綁定聯(lián)結(jié)表ourbits_users,所有種子大小>5的Ob,一并返回第三表的某些列
ss.query(Ob).join(ourbits_users).group_by(Ob.id).having(ourbits_users.c.download_size>5).add_columns(ourbits_users.c.download_size, ourbits_users.c.download_duration).first()
(<Ob-61>, 47.7, '')
查找種子,哪些有多于一人做種(ourbits_users.c.user_id)的:
ss.query(Ob).join(ourbits_users).group_by(Ob.id).having(func.count(ourbits_users.c.user_id)>1).add_columns(ourbits_users.c.download_size, ourbits_users.c.download_duration).all()
[(<Ob-56229>, 0.0, ''), (<Ob-56369>, 0.0, '')]
求和:某個(gè)User,所有認(rèn)領(lǐng)種子的總大?。?/p>
ss.query(func.sum(Ob.size_f), User).group_by(User.id).filter(ourbits_users_ac.c.ourbits_id==Ob.id, ourbits_users_ac.c.user_id==User.id).all()
[(1364.7000000000005, <User-1 'admin'>), (223.86, <User-2 'ob18025968'>)]
Testing:
注意:測(cè)試時(shí),如果Postgres報(bào)錯(cuò)
sqlalchemy.exc.InternalError: (psycopg2.InternalError) current transaction is aborted, commands ignored until end of transaction block
當(dāng)前Shell環(huán)境下,必須重啟Session,不然,不能執(zhí)行query?。?/p>
ss.close()
ss = db.session()
Table:
Table 操作:insert(), update(), select(), delete()
db.session.execute(ourbits_users.delete()) # ==> delete all
db.session.commit()
db.session.query(ourbits_users_as).paginate()
#<flask_sqlalchemy.Pagination object at 0x0000000006C511D0>
db.session.query(ourbits_users_as).filter_by(ourbits_id=60147).paginate().items
db.session.execute(ourbits_users_as.select().order_by(ourbits_users_as.c.ourbits_id.desc())).fetchall()
# [(1, 60150, None), (1, 60149, None), (1, 60148, None), (1, 60147, None), (1, 60146, None)]