首页 > 代码库 > Python著名的lib和开发框架(均为转载)
Python著名的lib和开发框架(均为转载)
第一,https://github.com/vinta/awesome-python
Awesome Python
A curated list of awesome Python frameworks, libraries, software and resources.
Inspired by awesome-php.
- Awesome Python
- Admin Panels
- Algorithms and Design Patterns
- Anti-spam
- Asset Management
- Audio
- Authentication
- Build Tools
- Caching
- ChatOps Tools
- CMS
- Code Analysis and Linter
- Command-line Tools
- Compatibility
- Computer Vision
- Concurrency and Parallelism
- Configuration
- Cryptography
- Data Analysis
- Data Validation
- Data Visualization
- Database Drivers
- Database
- Date and Time
- Debugging Tools
- Deep Learning
- DevOps Tools
- Distribution
- Documentation
- Downloader
- E-commerce
- Editor Plugins and IDEs
- Environment Management
- Files
- Foreign Function Interface
- Forms
- Functional Programming
- Game Development
- Geolocation
- GUI
- Hardware
- HTML Manipulation
- HTTP
- Imagery
- Implementations
- Interactive Interpreter
- Internationalization
- Job Scheduler
- Logging
- Machine Learning
- MapReduce
- Miscellaneous
- Natural Language Processing
- Network Virtualization
- Networking
- News Feed
- ORM
- Package Management
- Package Repositories
- Permissions
- Processes
- Queue
- RESTful API
- RPC Servers
- Science
- Search
- Serialization
- Serverless Frameworks
- Specific Formats Processing
- Static Site Generator
- Tagging
- Template Engine
- Testing
- Text Processing
- Third-party APIs
- URL Manipulation
- Video
- Web Content Extracting
- Web Crawling
- Web Frameworks
- WebSocket
- WSGI Servers
- Services
- Code Quality
- Continuous Integration
- Resources
- Podcasts
- Websites
- Weekly
- Other Awesome Lists
- Contributing
Admin Panels
Libraries for administrative interfaces.
- Ajenti - The admin panel your servers deserve.
- django-suit - Alternative Django Admin-Interface (free only for Non-commercial use).
- django-xadmin - Drop-in replacement of Django admin comes with lots of goodies.
- flask-admin - Simple and extensible administrative interface framework for Flask.
- flower - Real-time monitor and web admin for Celery.
- Grappelli - A jazzy skin for the Django Admin-Interface.
- Wooey - A Django app which creates automatic web UIs for Python scripts.
Algorithms and Design Patterns
Python implementation of algorithms and design patterns.
- algorithms - A module of algorithms for Python.
- PyPattyrn - A simple yet effective library for implementing common design patterns.
- python-patterns - A collection of design patterns in Python.
- sortedcontainers - Fast, pure-Python implementation of SortedList, SortedDict, and SortedSet types.
Anti-spam
Libraries for fighting spam.
- django-simple-captcha - A simple and highly customizable Django app to add captcha images to any Django form.
- django-simple-spam-blocker - Simple spam blocker for Django.
Asset Management
Tools for managing, compressing and minifying website assets.
- django-compressor - Compresses linked and inline JavaScript or CSS into a single cached file.
- django-pipeline - An asset packaging library for Django.
- django-storages - A collection of custom storage back ends for Django.
- fanstatic - Packages, optimizes, and serves static file dependencies as Python packages.
- fileconveyor - A daemon to detect and sync files to CDNs, S3 and FTP.
- flask-assets - Helps you integrate webassets into your Flask app.
- jinja-assets-compressor - A Jinja extension to compile and compress your assets.
- webassets - Bundles, optimizes, and manages unique cache-busting URLs for static resources.
Audio
Libraries for manipulating audio.
- audiolazy - Expressive Digital Signal Processing (DSP) package for Python.
- audioread - Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.
- beets - A music library manager and MusicBrainz tagger.
- dejavu - Audio fingerprinting and recognition.
- django-elastic-transcoder - Django + Amazon Elastic Transcoder.
- eyeD3 - A tool for working with audio files, specifically MP3 files containing ID3 metadata.
- id3reader - A Python module for reading MP3 meta data.
- m3u8 - A module for parsing m3u8 file.
- mingus - An advanced music theory and notation package with MIDI file and playback support.
- mutagen - A Python module to handle audio metadata.
- pydub - Manipulate audio with a simple and easy high level interface.
- pyechonest - Python client for the Echo Nest API.
- talkbox - A Python library for speech/signal processing.
- TimeSide - Open web audio processing framework.
- tinytag - A library for reading music meta data of MP3, OGG, FLAC and Wave files.
Authentication
Libraries for implementing authentications schemes.
- OAuth
- Authomatic - Simple but powerful framework agnostic authentication/authorization client.
- django-allauth - Authentication app for Django that "just works."
- django-oauth-toolkit - OAuth 2 goodies for Django.
- Flask-OAuthlib - OAuth 1.0/a, 2.0 implementation of client and provider for Flask.
- OAuthLib - A generic and thorough implementation of the OAuth request-signing logic.
- python-oauth2 - A fully tested, abstract interface to creating OAuth clients and servers.
- python-social-auth - An easy-to-setup social authentication mechanism.
- rauth - A Python library for OAuth 1.0/a, 2.0, and Ofly.
- sanction - A dead simple OAuth2 client implementation.
- Others
- jose - JavaScript Object Signing and Encryption draft implementation.
- PyJWT - Implementation of the JSON Web Token draft 01.
- python-jws - Implementation of JSON Web Signatures draft 02.
- python-jwt - Module for generating and verifying JSON Web Tokens.
Build Tools
Compile software from source code.
- BitBake - A make-like build tool for embedded Linux.
- buildout - A build system for creating, assembling and deploying applications from multiple parts.
- PlatformIO - A console tool to build code with different development platforms.
- PyBuilder - A continuous build tool written in pure Python.
- SCons - A software construction tool.
CMS
Content Management Systems.
- django-cms - An Open source enterprise CMS based on the Django.
- djedi-cms - A lightweight but yet powerful Django CMS with plugins, inline editing and performance in mind.
- FeinCMS - One of the most advanced Content Management Systems built on Django.
- Kotti - A high-level, Pythonic web application framework built on Pyramid.
- Mezzanine - A powerful, consistent, and flexible content management platform.
- Opps - A Django-based CMS for magazines, newspapers websites and portals with high-traffic.
- Plone - A CMS built on top of the open source application server Zope.
- Quokka - Flexible, extensible, small CMS powered by Flask and MongoDB.
- Wagtail - A Django content management system.
- Widgy - Last CMS framework, based on Django.
Caching
Libraries for caching data.
- Beaker - A library for caching and sessions for use with web applications and stand-alone Python scripts and applications.
- DiskCache - SQLite and file backed cache backend with faster lookups than memcached and redis.
- django-cache-machine - Automatic caching and invalidation for Django models.
- django-cacheops - A slick ORM cache with automatic granular event-driven invalidation.
- django-viewlet - Render template parts with extended cache control.
- dogpile.cache - dogpile.cache is next generation replacement for Beaker made by same authors.
- HermesCache - Python caching library with tag-based invalidation and dogpile effect prevention.
- johnny-cache - A caching framework for django applications.
- pylibmc - A Python wrapper around the libmemcached interface.
ChatOps Tools
Libraries for chatbot development.
- Errbot - The easiest and most popular chatbot to implement ChatOps.
Code Analysis and Linter
Libraries and tools for analysing, parsing and manipulation codebases.
- Code Analysis
- coala - Language independent and easily extendable code analysis application.
- code2flow - Turn your Python and JavaScript code into DOT flowcharts.
- pycallgraph - A library that visualises the flow (call graph) of your Python application.
- pysonar2 - A type inferencer and indexer for Python.
- Linter
- Flake8 - The modular source code checker: pep8, pyflakes and co.
- pylama - Code audit tool for Python and JavaScript.
- Pylint - A Fully customizable source code analyzer.
Command-line Tools
Libraries for building command-line application.
- Command-line Application Development
- asciimatics - Cross-platform, full-screen terminal package (i.e. mouse/keyboard input and coloured, positioned text output) complete with high-level API for complex animations and special effects.
- cement - CLI Application Framework for Python.
- click - A package for creating beautiful command line interfaces in a composable way.
- cliff - A framework for creating command-line programs with multi-level commands.
- clint - Python Command-line Application Tools.
- colorama - Cross-platform colored terminal text.
- docopt - Pythonic command line arguments parser.
- Gooey - Turn command line programs into a full GUI application with one line
- python-prompt-toolkit - A Library for building powerful interactive command lines.
- Productivity Tools
- aws-cli - A universal command-line interface for Amazon Web Services.
- bashplotlib - Making basic plots in the terminal.
- caniusepython3 - Determine what projects are blocking you from porting to Python 3.
- cookiecutter - A command-line utility that creates projects from cookiecutters (project templates).
- doitlive - A tool for live presentations in the terminal.
- howdoi - Instant coding answers via the command line.
- httpie - A command line HTTP client, a user-friendly cURL replacement.
- mycli - A Terminal Client for MySQL with AutoCompletion and Syntax Highlighting.
- PathPicker - Select files out of bash output.
- percol - Adds flavor of interactive selection to the traditional pipe concept on UNIX.
- pgcli - Postgres CLI with autocompletion and syntax highlighting.
- SAWS - A Supercharged AWS CLI.
- thefuck - Correcting your previous console command.
- try - A dead simple CLI to try out python packages - It‘s never been easier.
Compatibility
Libraries for migrating from Python 2 to 3.
- Python-Future - The missing compatibility layer between Python 2 and Python 3.
- Python-Modernize - Modernizes Python code for eventual Python 3 migration.
- Six - Python 2 and 3 compatibility utilities.
Computer Vision
Libraries for computer vision.
- OpenCV - Open Source Computer Vision Library.
- pyocr - A wrapper for Tesseract and Cuneiform.
- pytesseract - Another wrapper for Google Tesseract OCR.
- SimpleCV - An open source framework for building computer vision applications.
Concurrency and Parallelism
Libraries for concurrent and parallel execution.
- eventlet - Asynchronous framework with WSGI support.
- gevent - A coroutine-based Python networking library that uses greenlet.
- multiprocessing - (Python standard library) Process-based "threading" interface.
- threading - (Python standard library) Higher-level threading interface.
- Tomorrow - Magic decorator syntax for asynchronous code.
- uvloop - Ultra fast implementation of asyncio event loop on top of libuv.
Configuration
Libraries for storing and parsing configuration options.
- config - Hierarchical config from the author of logging.
- ConfigObj - INI file parser with validation.
- ConfigParser - (Python standard library) INI file parser.
- profig - Config from multiple formats with value conversion.
- python-decouple - Strict separation of settings from code.
Cryptography
- cryptography - A package designed to expose cryptographic primitives and recipes to Python developers.
- hashids - Implementation of hashids in Python.
- Paramiko - A Python (2.6+, 3.3+) implementation of the SSHv2 protocol, providing both client and server functionality.
- Passlib - Secure password storage/hashing library, very high level.
- PyNacl - Python binding to the Networking and Cryptography (NaCl) library.
Data Analysis
Libraries for data analyzing.
- Blaze - NumPy and Pandas interface to Big Data.
- Open Mining - Business Intelligence (BI) in Pandas interface.
- Orange - Data mining, data visualization, analysis and machine learning through visual programming or scripts.
- Pandas - A library providing high-performance, easy-to-use data structures and data analysis tools.
Data Validation
Libraries for validating data. Used for forms in many cases.
- Cerberus - A lightweight and extensible data validation library.
- colander - Validating and deserializing data obtained via XML, JSON, an HTML form post.
- jsonschema - An implementation of JSON Schema for Python.
- schema - A library for validating Python data structures.
- Schematics - Data Structure Validation.
- valideer - Lightweight extensible data validation and adaptation library.
- voluptuous - A Python data validation library.
Data Visualization
Libraries for visualizing data. See: awesome-javascript.
- Altair - Declarative statistical visualization library for Python.
- Bokeh - Interactive Web Plotting for Python.
- ggplot - Same API as ggplot2 for R.
- Matplotlib - A Python 2D plotting library.
- Pygal - A Python SVG Charts Creator.
- PyGraphviz - Python interface to Graphviz.
- PyQtGraph - Interactive and realtime 2D/3D/Image plotting and science/engineering widgets.
- Seaborn - Statistical data visualization using Matplotlib.
- VisPy - High-performance scientific visualization based on OpenGL.
Database
Databases implemented in Python.
- pickleDB - A simple and lightweight key-value store for Python.
- PipelineDB - The Streaming SQL Database.
- TinyDB - A tiny, document-oriented database.
- ZODB - A native object database for Python. A key-value and object graph database.
Database Drivers
Libraries for connecting and operating databases.
- MySQL - awesome-mysql
- mysql-python - The MySQL database connector for Python.
- mysqlclient - mysql-python fork supporting Python 3.
- oursql - A better MySQL connector with support for native prepared statements and BLOBs.
- PyMySQL - Pure Python MySQL driver compatible to mysql-python.
- PostgreSQL
- psycopg2 - The most popular PostgreSQL adapter for Python.
- queries - A wrapper of the psycopg2 library for interacting with PostgreSQL.
- txpostgres - Twisted based asynchronous driver for PostgreSQL.
- Other Relational Databases
- apsw - Another Python SQLite wrapper.
- dataset - Store Python dicts in a database - works with SQLite, MySQL, and PostgreSQL.
- pymssql - A simple database interface to Microsoft SQL Server.
- NoSQL Databases
- cassandra-python-driver - Python driver for Cassandra.
- HappyBase - A developer-friendly library for Apache HBase.
- Plyvel - A fast and feature-rich Python interface to LevelDB.
- py2neo - Python wrapper client for Neo4j‘s restful interface.
- pycassa - Python Thrift driver for Cassandra.
- PyMongo - The official Python client for MongoDB.
- redis-py - The Redis Python Client.
- telephus - Twisted based client for Cassandra.
- txRedis - Twisted based client for Redis.
Date and Time
Libraries for working with dates and times.
- arrow - Better dates & times for Python.
- Chronyk - A Python 3 library for parsing human-written times and dates.
- dateutil - Extensions to the standard Python datetime module.
- delorean - A library for clearing up the inconvenient truths that arise dealing with datetimes.
- moment - A Python library for dealing with dates/times. Inspired by Moment.js.
- Pendulum - Python datetimes made easy.
- PyTime - A easy-use Python module which aims to operate date/time/datetime by string.
- pytz - World timezone definitions, modern and historical. Brings the tz database into Python.
- when.py - Providing user-friendly functions to help perform common date and time actions.
Debugging Tools
Libraries for debugging code.
- pdb-like Debugger
- ipdb - IPython-enabled pdb.
- pdb++ - Another drop-in replacement for pdb.
- pudb - A full-screen, console-based Python debugger.
- remote-pdb - Remote vanilla PDB (over TCP sockets).
- wdb - An improbable web debugger through WebSockets.
- Profiler
- line_profiler - Line-by-line profiling.
- memory_profiler - Monitor Memory usage of Python code.
- profiling - An interactive Python profiler.
- Others
- django-debug-toolbar - Display various debug information for Django.
- django-devserver - A drop-in replacement for Django‘s runserver.
- flask-debugtoolbar - A port of the django-debug-toolbar to flask.
- hunter - Hunter is a flexible code tracing toolkit.
- lptrace - strace for Python programs.
- manhole - Debug service that will accept unix domain socket connections and present the stacktraces for all threads and an interactive prompt.
- pyelftools - Parsing and analyzing ELF files and DWARF debugging information.
- pyringe - Debugger capable of attaching to and injecting code into Python processes.
Deep Learning
Frameworks for Neural Networks and Deep Learning. See: awesome-deep-learning.
- Caffe - A fast open framework for deep learning..
- Keras - A high-level neural networks library and capable of running on top of either TensorFlow or Theano.
- Neupy - Running and testing different Artificial Neural Networks algorithms.
- Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration.
- TensorFlow - The most popular Deep Learning framework created by Google.
- Theano - A library for fast numerical computation.
DevOps Tools
Software and libraries for DevOps.
- Ansible - A radically simple IT automation platform.
- Cloud-Init - A multi-distribution package that handles early initialization of a cloud instance.
- cuisine - Chef-like functionality for Fabric.
- Docker Compose - Fast, isolated development environments using Docker.
- Fabric - A simple, Pythonic tool for remote execution and deployment.
- Fabtools - Tools for writing awesome Fabric files.
- honcho - A Python clone of Foreman, for managing Procfile-based applications.
- OpenStack - Open source software for building private and public clouds.
- pexpect - Controlling interactive programs in a pseudo-terminal like GNU expect.
- psutil - A cross-platform process and system utilities module.
- SaltStack - Infrastructure automation and management system.
- supervisor - Supervisor process control system for UNIX.
Distribution
Libraries to create packaged executables for release distribution.
- dh-virtualenv - Build and distribute a virtualenv as a Debian package.
- Nuitka - Compile scripts, modules, packages to an executable or extension module.
- py2app - Freezes Python scripts (Mac OS X).
- py2exe - Freezes Python scripts (Windows).
- PyInstaller - Converts Python programs into stand-alone executables (cross-platform).
- pynsist - A tool to build Windows installers, installers bundle Python itself.
Documentation
Libraries for generating project documentation.
- Sphinx - Python Documentation generator.
- awesome-sphinxdoc
- MkDocs - Markdown friendly documentation generator.
- pdoc - Epydoc replacement to auto generate API documentation for Python libraries.
- Pycco - The literate-programming-style documentation generator.
Downloader
Libraries for downloading.
- s3cmd - A command line tool for managing Amazon S3 and CloudFront.
- s4cmd - Super S3 command line tool, good for higher performance.
- you-get - A YouTube/Youku/Niconico video downloader written in Python 3.
- youtube-dl - A small command-line program to download videos from YouTube.
E-commerce
Frameworks and libraries for e-commerce and payments.
- alipay - Unofficial Alipay API for Python.
- Cartridge - A shopping cart app built using the Mezzanine.
- django-oscar - An open-source e-commerce framework for Django.
- django-shop - A Django based shop system.
- merchant - A Django app to accept payments from various payment processors.
- money - Money class with optional CLDR-backed locale-aware formatting and an extensible currency exchange solution.
- python-currencies - Display money format and its filthy currencies.
- shoop - An open source E-Commerce platform based on Django.
Editor Plugins and IDEs
- Emacs
- Elpy - Emacs Python Development Environment.
- Sublime Text
- Anaconda - Anaconda turns your Sublime Text 3 in a full featured Python development IDE.
- SublimeJEDI - A Sublime Text plugin to the awesome auto-complete library Jedi.
- Vim
- Jedi-vim - Vim bindings for the Jedi auto-completion library for Python.
- Python-mode - An all in one plugin for turning Vim into a Python IDE.
- YouCompleteMe - Includes Jedi-based completion engine for Python.
- Visual Studio
- PTVS - Python Tools for Visual Studio.
- Visual Studio Code
- Python - An extension with rich support for the Python language, with features including linting, IntelliSense, formatting, refactoring, debugging, unit tests, and jupyter support.
- Magic Python - Cutting edge Python syntax highlighter for Sublime Text, Atom, and Visual Studio Code. Used by GitHub to highlight your Python code!
- IDE
- LiClipse - Free polyglot IDE based on Eclipse. Uses PyDev for Python support.
- PyCharm - Commercial Python IDE by JetBrains. Has free community edition available.
- Spyder - Open Source Python IDE.
Libraries for sending and parsing email.
- envelopes - Mailing for human beings.
- flanker - A email address and Mime parsing library.
- imbox - Python IMAP for Humans.
- inbox.py - Python SMTP Server for Humans.
- lamson - Pythonic SMTP Application Server.
- Marrow Mailer - High-performance extensible mail delivery framework.
- modoboa - A mail hosting and management platform including a modern and simplified Web UI.
- Nylas Sync Engine - Providing a RESTful API on top of a powerful email sync platform.
- yagmail - Yet another Gmail/SMTP client.
Environment Management
Libraries for Python version and environment management.
- p - Dead simple interactive Python version management.
- pyenv - Simple Python version management.
- venv - (Python standard library in Python 3.3+) Creating lightweight virtual environments.
- virtualenv - A tool to create isolated Python environments.
- virtualenvwrapper - A set of extensions to virtualenv.
Files
Libraries for file manipulation and MIME type detection.
- imghdr - (Python standard library) Determine the type of an image.
- mimetypes - (Python standard library) Map filenames to MIME types.
- path.py - A module wrapper for os.path.
- pathlib - (Python standard library in Python 3.4+) An cross-platform, object-oriented path library.
- python-magic - A Python interface to the libmagic file type identification library.
- Unipath - An object-oriented approach to file/directory operations.
- watchdog - API and shell utilities to monitor file system events.
Foreign Function Interface
Libraries for providing foreign function interface.
- cffi - Foreign Function Interface for Python calling C code.
- ctypes - (Python standard library) Foreign Function Interface for Python calling C code.
- PyCUDA - A Python wrapper for Nvidia‘s CUDA API.
- SWIG - Simplified Wrapper and Interface Generator.
Forms
Libraries for working with forms.
- Deform - Python HTML form generation library influenced by the formish form generation library.
- django-bootstrap3 - Bootstrap 3 integration with Django.
- django-crispy-forms - A Django app which lets you create beautiful forms in a very elegant and DRY way.
- django-remote-forms - A platform independent Django form serializer.
- WTForms - A flexible forms validation and rendering library.
Functional Programming
Functional Programming with Python.
- CyToolz - Cython implementation of Toolz: High performance functional utilities.
- fn.py - Functional programming in Python: implementation of missing features to enjoy FP.
- funcy - A fancy and practical functional tools.
- Toolz - A collection of functional utilities for iterators, functions, and dictionaries.
GUI
Libraries for working with graphical user interface applications.
- curses - Built-in wrapper for ncurses used to create terminal GUI applications.
- enaml - Creating beautiful user-interfaces with Declaratic Syntax like QML.
- Flexx - Flexx is a pure Python toolkit for creating GUI‘s, that uses web technology for its rendering.
- kivy - A library for creating NUI applications, running on Windows, Linux, Mac OS X, Android and iOS.
- pyglet - A cross-platform windowing and multimedia library for Python.
- PyGObject - Python Bindings for GLib/GObject/GIO/GTK+ (GTK+3)
- PyQt - Python bindings for the Qt cross-platform application and UI framework, with support for both Qt v4 and Qt v5 frameworks.
- PySide - Python bindings for the Qt cross-platform application and UI framework, supporting the Qt v4 framework.
- Tkinter - Tkinter is Python‘s de-facto standard GUI package.
- Toga - A Python native, OS native GUI toolkit.
- urwid - A library for creating terminal GUI applications with strong support for widgets, events, rich colors, etc.
- wxPython - A blending of the wxWidgets C++ class library with the Python.
Game Development
Awesome game development libraries.
- Cocos2d - cocos2d is a framework for building 2D games, demos, and other graphical/interactive applications. It is based on pyglet.
- Panda3D - 3D game engine developed by Disney and maintained by Carnegie Mellon‘s Entertainment Technology Center. Written in C++, completely wrapped in Python.
- Pygame - Pygame is a set of Python modules designed for writing games.
- PyOgre - Python bindings for the Ogre 3D render engine, can be used for games, simulations, anything 3D.
- PyOpenGL - Python ctypes bindings for OpenGL and it‘s related APIs.
- PySDL2 - A ctypes based wrapper for the SDL2 library.
- RenPy - A Visual Novel engine.
Geolocation
Libraries for geocoding addresses and working with latitudes and longitudes.
- django-countries - A Django app that provides country choices for use with forms, flag icons static files, and a country field for models.
- GeoDjango - A world-class geographic web framework.
- GeoIP - Python API for MaxMind GeoIP Legacy Database.
- geojson - Python bindings and utilities for GeoJSON.
- geopy - Python Geocoding Toolbox.
- pygeoip - Pure Python GeoIP API.
HTML Manipulation
Libraries for working with HTML and XML.
- BeautifulSoup - Providing Pythonic idioms for iterating, searching, and modifying HTML or XML.
- bleach - A whitelist-based HTML sanitization and text linkification library.
- cssutils - A CSS library for Python.
- html5lib - A standards-compliant library for parsing and serializing HTML documents and fragments.
- lxml - A very fast, easy-to-use and versatile library for handling HTML and XML.
- MarkupSafe - Implements a XML/HTML/XHTML Markup safe string for Python.
- pyquery - A jQuery-like library for parsing HTML.
- untangle - Converts XML documents to Python objects for easy access.
- WeasyPrint - A visual rendering engine for HTML and CSS that can export to PDF.
- xmldataset - Simple XML Parsing.
- xmltodict - Working with XML feel like you are working with JSON.
HTTP
Libraries for working with HTTP.
- grequests - requests + gevent for asynchronous HTTP requests.
- httplib2 - Comprehensive HTTP client library.
- requests - HTTP Requests for Humans?.
- treq - Python requests like API built on top of Twisted‘s HTTP client.
- urllib3 - A HTTP library with thread-safe connection pooling, file post support, sanity friendly.
Hardware
Libraries for programming with hardware.
- ino - Command line toolkit for working with Arduino.
- Pingo - Pingo provides a uniform API to program devices like the Raspberry Pi, pcDuino, Intel Galileo, etc.
- Pyro - Python Robotics.
- PyUserInput - A module for cross-platform control of the mouse and keyboard.
- scapy - A brilliant packet manipulation library.
- wifi - A Python library and command line tool for working with WiFi on Linux.
Imagery
Libraries for manipulating images.
- hmap - Image histogram remapping.
- imgSeek - A project for searching a collection of images using visual similarity.
- nude.py - Nudity detection.
- pagan - Retro identicon (Avatar) generation based on input string and hash.
- pillow - Pillow is the friendly PIL fork.
- pyBarcode - Create barcodes in Python without needing PIL.
- pygram - Instagram-like image filters.
- python-qrcode - A pure Python QR Code generator.
- Quads - Computer art based on quadtrees.
- scikit-image - A Python library for (scientific) image processing.
- thumbor - A smart imaging service. It enables on-demand crop, re-sizing and flipping of images.
- wand - Python bindings for MagickWand, C API for ImageMagick.
Implementations
Implementations of Python.
- CLPython - Implementation of the Python programming language written in Common Lisp.
- CPython - Default, most widely used implementation of the Python programming language written in C.Optimizing Static Compiler for Python. Uses type mixins to compile Python into C or C++ modules resulting in large performance gains.
- Grumpy - More compiler than interpreter as more powerful CPython2.7 replacement (alpha).
- IronPython - Implementation of the Python programming language written in C# targeting the .NET Framework and Mono.
- Jython - Implementation of Python programming language written in Java for the Java virtual machine (JVM).
- MicroPython - MicroPython - a lean and efficient Python programming language implementation for microcontrollers and constrained systems
- Numba - Python JIT complier to LLVM aimed at scientific Python.
- PeachPy - x86-64 assembler embedded in Python. Can be used as inline assembler for Python or as a stand-alone assembler for Windows, Linux, OS X, Native Client and Go.
- Pyjion - A JIT for Python based upon CoreCLR.
- PyPy - Implementation of the Python programming language written in RPython and translated into C. PyPy focuses on speed, efficiency and compatibility with the original CPython interpreter. The interpreter uses black magic to make Python very fast without having to add in additional type information.
- PySec - Hardened version of python that makes it easier for security professionals and developers to write applications more resilient to attacks and manipulations.
- Pyston - A Python implementation built using LLVM and modern JIT techniques with the goal of achieving good performance.
- Stackless Python - An enhanced version of the Python programming language which allows programmers to reap the benefits of thread-based programming without the performance and complexity problems associated with conventional threads.
Interactive Interpreter
Interactive Python interpreters (REPL).
- bpython - A fancy interface to the Python interpreter.
- Jupyter Notebook (IPython) - A rich toolkit to help you make the most out of using Python interactively.
- ptpython - Advanced Python REPL built on top of the python-prompt-toolkit.
Internationalization
Libraries for working with i18n.
- Babel - An internationalization library for Python.
- PyICU - A wrapper of International Components for Unicode C++ library (ICU).
Job Scheduler
Libraries for scheduling jobs.
- APScheduler - A light but powerful in-process task scheduler that lets you schedule functions.
- django-schedule - A calendaring app for Django.
- doit - A task runner and build tool.
- gunnery - Multipurpose task execution tool for distributed systems with web-based interface.
- Joblib - A set of tools to provide lightweight pipelining in Python.
- Plan - Writing crontab file in Python like a charm.
- schedule - Python job scheduling for humans.
- Spiff - A powerful workflow engine implemented in pure Python.
- TaskFlow - A Python library that helps to make task execution easy, consistent and reliable.
Logging
Libraries for generating and working with logs.
- Eliot - Logging for complex & distributed systems.
- logbook - Logging replacement for Python.
- logging - (Python standard library) Logging facility for Python.
- Raven - The Python client for Sentry.
- Sentry - A realtime logging and aggregation server.
Machine Learning
Libraries for Machine Learning. See: awesome-machine-learning.
- gensim - Topic Modelling for Humans.
- MLlib - Apache Spark‘s scalable Machine Learning library.
- NuPIC - Numenta Platform for Intelligent Computing.
- Pylearn2 - A Machine Learning library based on Theano.
- scikit-learn - The most popular Python library for Machine Learning.
- surprise - A scikit for building and analyzing recommender systems.
- vowpal_porpoise - A lightweight Python wrapper for Vowpal Wabbit.
MapReduce
Frameworks and libraries for MapReduce.
- dpark - Python clone of Spark, a MapReduce alike framework in Python.
- dumbo - Python module that allows one to easily write and run Hadoop programs.
- luigi - A module that helps you build complex pipelines of batch jobs.
- mrjob - Run MapReduce jobs on Hadoop or Amazon Web Services.
- PySpark - The Spark Python API.
- streamparse - Run Python code against real-time streams of data. Integrates with Apache Storm.
Microsoft Windows
Python programming on Microsoft Windows.
- Python(x,y) - Scientific-applications-oriented Python Distribution based on Qt and Spyder.
- pythonlibs - Unofficial Windows binaries for Python extension packages.
- PythonNet - Python Integration with the .NET Common Language Runtime (CLR).
- PyWin32 - Python Extensions for Windows.
- WinPython - Portable development environment for Windows 7/8.
Miscellaneous
Useful libraries or tools that don‘t fit in the categories above.
- blinker - A fast Python in-process signal/event dispatching system.
- itsdangerous - Various helpers to pass trusted data to untrusted environments.
- pluginbase - A simple but flexible plugin system for Python.
- Pychievements - A framework for creating and tracking achievements.
- Tryton - A general purpose business framework.
Natural Language Processing
Libraries for working with human languages.
- Jieba - Chinese text segmentation.
- langid.py - Stand-alone language identification system.
- NLTK - A leading platform for building Python programs to work with human language data.
- Pattern - A web mining module for the Python.
- SnowNLP - A library for processing Chinese text.
- spaCy - A library for industrial-strength natural language processing in Python and Cython.
- TextBlob - Providing a consistent API for diving into common NLP tasks.
- TextGrocery - A simple, efficient short-text classification tool based on LibLinear and Jieba.
Network Virtualization
Tools and libraries for Virtual Networking and SDN (Software Defined Networking).
- Mininet - A popular network emulator and API written in Python.
- POX - An open source development platform for Python-based Software Defined Networking (SDN) control applications, such as OpenFlow SDN controllers.
- Pyretic - A member of the Frenetic family of SDN programming languages that provides powerful abstractions over network switches or emulators.
- SDX Platform - SDN based IXP implementation that leverages Mininet, POX and Pyretic.
Networking
Libraries for networking programming.
- asyncio - (Python standard library) Asynchronous I/O, event loop, coroutines and tasks.
- diesel - Greenlet-based event I/O Framework for Python.
- pulsar - Event-driven concurrent framework for Python.
- pyzmq - A Python wrapper for the ZeroMQ message library.
- Twisted - An event-driven networking engine.
- txZMQ - Twisted based wrapper for the ZeroMQ message library.
News Feed
Libraries for building user‘s activities.
- django-activity-stream - Generating generic activity streams from the actions on your site.
- Stream-Framework - Building newsfeed and notification systems using Cassandra and Redis.
ORM
Libraries that implement Object-Relational Mapping or data mapping techniques.
- Relational Databases
- Django Models - A part of Django.
- SQLAlchemy - The Python SQL Toolkit and Object Relational Mapper.
- awesome-sqlalchemy
- Orator - The Orator ORM provides a simple yet beautiful ActiveRecord implementation.
- Peewee - A small, expressive ORM.
- PonyORM - ORM that provides a generator-oriented interface to SQL.
- python-sql - Write SQL queries pythonically.
- NoSQL Databases
- django-mongodb-engine - Django MongoDB Backend.
- flywheel - Object mapper for Amazon DynamoDB.
- hot-redis - Rich Python data types for Redis.
- MongoEngine - A Python Object-Document-Mapper for working with MongoDB.
- PynamoDB - A Pythonic interface for Amazon DynamoDB.
- redisco - A Python Library for Simple Models and Containers Persisted in Redis.
- Others
- butterdb - A Python ORM for Google Drive Spreadsheets.
- dataset - A JSON-based database.
Package Management
Libraries for package and dependency management.
- pip - The Python package and dependency manager.
- Python Package Index
- conda - Cross-platform, Python-agnostic binary package manager.
- Curdling - Curdling is a command line tool for managing Python packages.
- pip-tools - A set of tools to keep your pinned Python dependencies fresh.
- wheel - The new standard of Python distribution and are intended to replace eggs.
Package Repositories
Local PyPI repository server and proxies.
- warehouse - Next generation Python Package Repository (PyPI).
- Warehouse
- bandersnatch - PyPI mirroring tool provided by Python Packaging Authority (PyPA).
- devpi - PyPI server and packaging/testing/release tool.
- localshop - Local PyPI server (custom packages and auto-mirroring of pypi).
Permissions
Libraries that allow or deny users access to data or functionality.
- Carteblanche - Module to align code with thoughts of users and designers. Also magically handles navigation and permissions.
- django-guardian - Implementation of per object permissions for Django 1.2+
- django-rules - A tiny but powerful app providing object-level permissions to Django, without requiring a database.
Processes
Libraries for starting and communicating with OS processes.
- envoy - Python subprocess for Humans?.
- sarge - Yet another wrapper for subprocess.
- sh - A full-fledged subprocess replacement for Python.
Queue
Libraries for working with event and task queues.
- celery - An asynchronous task queue/job queue based on distributed message passing.
- huey - Little multi-threaded task queue.
- mrq - Mr. Queue - A distributed worker task queue in Python using Redis & gevent.
- rq - Simple job queues for Python.
- simpleq - A simple, infinitely scalable, Amazon SQS based queue.
RESTful API
Libraries for developing RESTful APIs.
- Django
- django-formapi - Create JSON APIs with Django‘s form validation.
- django-rest-framework - A powerful and flexible toolkit to build web APIs.
- django-tastypie - Creating delicious APIs for Django apps.
- Flask
- eve - REST API framework powered by Flask, MongoDB and good intentions.
- flask-api-utils - Taking care of API representation and authentication for Flask.
- flask-api - Browsable Web APIs for Flask.
- flask-restful - Quickly building REST APIs for Flask.
- flask-restless - Generating RESTful APIs for database models defined with SQLAlchemy.
- Pyramid
- cornice - A RESTful framework for Pyramid.
- Framework agnostic
- falcon - A high-performance framework for building cloud APIs and web app backends.
- hug - A Python3 framework for cleanly exposing APIs over HTTP and the Command Line with automatic documentation and validation.
- restless - Framework agnostic REST framework based on lessons learned from Tastypie.
- ripozo - Quickly creating REST/HATEOAS/Hypermedia APIs.
- sandman - Automated REST APIs for existing database-driven systems.
RPC Servers
RPC-compatible servers.
- SimpleJSONRPCServer - This library is an implementation of the JSON-RPC specification.
- SimpleXMLRPCServer - (Python standard library) Simple XML-RPC server implementation, single-threaded.
- zeroRPC - zerorpc is a flexible RPC implementation based on ZeroMQ and MessagePack.
Science
Libraries for scientific computing.
- astropy - A community Python library for Astronomy.
- bcbio-nextgen - Providing best-practice pipelines for fully automated high throughput sequencing analysis.
- bccb - Collection of useful code related to biological analysis.
- Biopython - Biopython is a set of freely available tools for biological computation.
- cclib - A library for parsing and interpreting the results of computational chemistry packages.
- NetworkX - A high-productivity software for complex networks.
- NIPY - A collection of neuroimaging toolkits.
- NumPy - A fundamental package for scientific computing with Python.
- Open Babel - A chemical toolbox designed to speak the many languages of chemical data.
- PyDy - Short for Python Dynamics, used to assist with workflow in the modeling of dynamic motion.
- PyMC - Markov Chain Monte Carlo sampling toolkit.
- RDKit - Cheminformatics and Machine Learning Software.
- SciPy - A Python-based ecosystem of open-source software for mathematics, science, and engineering.
- statsmodels - Statistical modeling and econometrics in Python.
- SymPy - A Python library for symbolic mathematics.
- Zipline - A Pythonic algorithmic trading library.
Search
Libraries and software for indexing and performing search queries on data.
- django-haystack - Modular search for Django.
- elasticsearch-dsl-py - The official high-level Python client for Elasticsearch.
- elasticsearch-py - The official low-level Python client for Elasticsearch.
- esengine - ElasticSearch ODM (Object Document Mapper) for Python.
- solrpy - A Python client for solr.
- Whoosh - A fast, pure Python search engine library.
Serialization
Libraries for serializing complex data types
- marshmallow - marshmallow is an ORM/ODM/framework-agnostic library for converting complex datatypes, such as objects, to and from native Python datatypes.
Serverless Frameworks
Frameworks for developing serverless Python code.
- apex - Build, deploy, and manage AWS Lambda functions with ease.
- python-lambda - A toolkit for developing and deploying Python code in AWS Lambda.
- Zappa - A tool for deploying WSGI applications on AWS Lambda and API Gateway.
Specific Formats Processing
Libraries for parsing and manipulating specific text formats.
- General
- tablib - A module for Tabular Datasets in XLS, CSV, JSON, YAML.
- Office
- Marmir - Takes Python data structures and turns them into spreadsheets.
- openpyxl - A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
- pyexcel - Providing one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files.
- python-docx - Reads, queries and modifies Microsoft Word 2007/2008 docx files.
- relatorio - Templating OpenDocument files.
- unoconv - Convert between any document format supported by LibreOffice/OpenOffice.
- XlsxWriter - A Python module for creating Excel .xlsx files.
- xlwings - A BSD-licensed library that makes it easy to call Python from Excel and vice versa.
- xlwt / xlrd - Writing and reading data and formatting information from Excel files.
- PDF
- PDFMiner - A tool for extracting information from PDF documents.
- PyPDF2 - A library capable of splitting, merging and transforming PDF pages.
- ReportLab - Allowing Rapid creation of rich PDF documents.
- Markdown
- Mistune - Fastest and full featured pure Python parsers of Markdown.
- Python-Markdown - A Python implementation of John Gruber’s Markdown.
- YAML
- PyYAML - YAML implementations for Python.
- CSV
- csvkit - Utilities for converting to and working with CSV.
- Archive
- unp - A command line tool that can unpack archives easily.
Static Site Generator
Static site generator is a software that takes some text + templates as input and produces HTML files on the output.
- Cactus - Static site generator for designers.
- Hyde - Jinja2-based static web site generator.
- Lektor - An easy to use static CMS and blog engine.
- Nikola - A static website and blog generator.
- Pelican - Uses Markdown or ReST for content and Jinja 2 for themes. Supports DVCS, Disqus. AGPL.
- Tinkerer - Tinkerer is a blogging engine/.static website generator powered by Sphinx.
Tagging
Libraries for tagging items.
- django-taggit - Simple tagging for Django.
Template Engine
Libraries and tools for templating and lexing.
- Genshi - Python templating toolkit for generation of web-aware output.
- Jinja2 - A modern and designer friendly templating language.
- Mako - Hyperfast and lightweight templating for the Python platform.
Testing
Libraries for testing codebases and generating test data.
- Testing Frameworks
- hypothesis - Hypothesis is an advanced Quickcheck style property based testing library.
- mamba - The definitive testing tool for Python. Born under the banner of BDD.
- nose - A nicer unittest for Python.
- nose2 - The successor to nose, based on unittest2.
- pytest - A mature full-featured Python testing tool.
- Robot Framework - A generic test automation framework.
- unittest - (Python standard library) Unit testing framework.
- Test Runners
- green - A clean, colorful test runner.
- tox - Auto builds and tests distributions in multiple Python versions
- GUI / Web Testing
- locust - Scalable user load testing tool written in Python.
- PyAutoGUI - PyAutoGUI is a cross-platform GUI automation Python module for human beings.
- Selenium - Python bindings for Selenium WebDriver.
- sixpack - A language-agnostic A/B Testing framework.
- splinter - Open source tool for testing web applications.
- Mock
- doublex - Powerful test doubles framework for Python.
- freezegun - Travel through time by mocking the datetime module.
- httmock - A mocking library for requests for Python 2.6+ and 3.2+.
- httpretty - HTTP request mock tool for Python.
- mock - (Python standard library) A mocking and patching library.
- responses - A utility library for mocking out the requests Python library.
- VCR.py - Record and replay HTTP interactions on your tests.
- Object Factories
- factory_boy - A test fixtures replacement for Python.
- mixer - Another fixtures replacement. Supported Django, Flask, SQLAlchemy, Peewee and etc.
- model_mommy - Creating random fixtures for testing in Django.
- Code Coverage
- coverage - Code coverage measurement.
- Fake Data
- church - is a Python library that help you generate fake data.
- fake2db - Fake database generator.
- faker - A Python package that generates fake data.
- radar - Generate random datetime / time.
- Error Handler
- FuckIt.py - FuckIt.py uses state-of-the-art technology to make sure your Python code runs whether it has any right to or not.
Text Processing
Libraries for parsing and manipulating plain texts.
- General
- chardet - Python 2/3 compatible character encoding detector.
- difflib - (Python standard library) Helpers for computing deltas.
- ftfy - Makes Unicode text less broken and more consistent automagically.
- fuzzywuzzy - Fuzzy String Matching.
- Levenshtein - Fast computation of Levenshtein distance and string similarity.
- pangu.py - Spacing texts for CJK and alphanumerics.
- pyfiglet - An implementation of figlet written in Python.
- shortuuid - A generator library for concise, unambiguous and URL-safe UUIDs.
- unidecode - ASCII transliterations of Unicode text.
- uniout - Print readable chars instead of the escaped string.
- xpinyin - A library to translate Chinese hanzi (漢字) to pinyin (拼音).
- Slugify
- awesome-slugify - A Python slugify library that can preserve unicode.
- python-slugify - A Python slugify library that translates unicode to ASCII.
- unicode-slugify - A slugifier that generates unicode slugs with Django as a dependency.
- Parser
- phonenumbers - Parsing, formatting, storing and validating international phone numbers.
- PLY - Implementation of lex and yacc parsing tools for Python
- Pygments - A generic syntax highlighter.
- pyparsing - A general purpose framework for generating parsers.
- python-nameparser - Parsing human names into their individual components.
- python-user-agents - Browser user agent parser.
- sqlparse - A non-validating SQL parser.
Third-party APIs
Libraries for accessing third party services APIs. See: List of Python API Wrappers and Libraries.
- apache-libcloud - One Python library for all clouds.
- boto3 - Python interface to Amazon Web Services.
- django-wordpress - WordPress models and views for Django.
- facebook-sdk - Facebook Platform Python SDK.
- facepy - Facepy makes it really easy to interact with Facebook‘s Graph API
- gmail - A Pythonic interface for Gmail.
- google-api-python-client - Google APIs Client Library for Python.
- gspread - Google Spreadsheets Python API.
- twython - A Python wrapper for the Twitter API.
URL Manipulation
Libraries for parsing URLs.
- furl - A small Python library that makes manipulating URLs simple.
- purl - A simple, immutable URL class with a clean API for interrogation and manipulation.
- pyshorteners - A pure Python URL shortening lib.
- short_url - Python implementation for generating Tiny URL and bit.ly-like URLs.
- webargs - A friendly library for parsing HTTP request arguments, with built-in support for popular web frameworks, including Flask, Django, Bottle, Tornado, and Pyramid.
Video
Libraries for manipulating video and GIFs.
- moviepy - A module for script-based movie editing with many formats, including animated GIFs.
- scikit-video - Video processing routines for SciPy.
WSGI Servers
WSGI-compatible web servers.
- bjoern - Asynchronous, very fast and written in C.
- fapws3 - Asynchronous (network side only), written in C.
- gunicorn - Pre-forked, partly written in C.
- meinheld - Asynchronous, partly written in C.
- netius - Asynchronous, very fast.
- paste - Multi-threaded, stable, tried and tested.
- rocket - Multi-threaded.
- uWSGI - A project aims at developing a full stack for building hosting services, written in C.
- waitress - Multi-threaded, powers Pyramid.
- Werkzeug - A WSGI utility library for Python that powers Flask and can easily be embedded into your own projects.
Web Content Extracting
Libraries for extracting web contents.
- Haul - An Extensible Image Crawler.
- html2text - Convert HTML to Markdown-formatted text.
- lassie - Web Content Retrieval for Humans.
- micawber - A small library for extracting rich content from URLs.
- newspaper - News extraction, article extraction and content curation in Python.
- opengraph - A Python module to parse the Open Graph Protocol
- python-goose - HTML Content/Article Extractor.
- python-readability - Fast Python port of arc90‘s readability tool.
- sanitize - Bringing sanity to world of messed-up data.
- sumy - A module for automatic summarization of text documents and HTML pages.
- textract - Extract text from any document, Word, PowerPoint, PDFs, etc.
Web Crawling
Libraries for scraping websites.
- cola - A distributed crawling framework.
- Demiurge - PyQuery-based scraping micro-framework.
- feedparser - Universal feed parser.
- Grab - Site scraping framework.
- MechanicalSoup - A Python library for automating interaction with websites.
- portia - Visual scraping for Scrapy.
- pyspider - A powerful spider system.
- RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser.
- Scrapy - A fast high-level screen scraping and web crawling framework.
Web Frameworks
Full stack web frameworks.
- Bottle - A fast, simple and lightweight WSGI micro web-framework.
- CherryPy - A minimalist Python web framework, HTTP/1.1-compliant and WSGI thread-pooled.
- Django - The most popular web framework in Python.
- awesome-django
- Flask - A microframework for Python.
- awesome-flask
- Pyramid - A small, fast, down-to-earth, open source Python web framework.
- awesome-pyramid
- Tornado - A Web framework and asynchronous networking library.
- TurboGears - A microframework that can scale up to a full stack solution.
WebSocket
Libraries for working with WebSocket.
- AutobahnPython - WebSocket & WAMP for Python on Twisted and asyncio.
- Crossbar - Open-source Unified Application Router (Websocket & WAMP for Python on Autobahn).
- django-socketio - WebSockets for Django.
- WebSocket-for-Python - WebSocket client and server library for Python 2 and 3 as well as PyPy.
Services
Online tools and APIs to simplify development.
Continuous Integration
See: awesome-CIandCD.
- CircleCI - A CI service that can run very fast parallel testing. (GitHub only)
- Travis CI - A popular CI service for your open source and private projects. (GitHub only)
- Vexor CI - A continuous integration tool for private apps with pay-per-minute billing model.
- Wercker - A Docker-based platform for building and deploying applications and microservices.
Code Quality
- Codacy - Automated Code Review to ship better code, faster. Free for Open Source.
- Codecov - Code coverage dashboard.
- Landscape - Hosted continuous Python code metrics.
- QuantifiedCode - A data-driven, automated, continuous code review tool.
Resources
Where to discover new Python libraries.
Podcasts
- Podcast.init
- Talk Python To Me
- Python Bytes
- @codetengu
- @getpy
- @importpython
- @planetpython
- @pycoders
- @pypi
- @pythontrending
- @PythonWeekly
- @TalkPython
Websites
- /r/CoolGithubProjects
- /r/Python
- Awesome Python @LibHunt
- Django Packages
- Full Stack Python
- PyPI Ranking
- Python 3 Wall of Superpowers
- Python Hackers
- Python ZEEF
- Python 开发社区
- Trending Python repositories on GitHub today
Weekly
- CodeTengu Weekly
- Import Python Newsletter
- Pycoder‘s Weekly
- Python Weekly
- Python Bytes
Other Awesome Lists
List of lists.
- Monty
- awesome
- lists
- Python
- pycrumbs
- python-github-projects
- python_reference
- pythonidae
Contributing
Your contributions are always welcome! Please take a look at the contribution guidelines first.
I will keep some pull requests open if I‘m not sure whether those libraries are awesome, you could vote for them by adding to them. Pull requests will be merged when their votes reach 20.
第二,https://pypi.python.org/pypi
第三,The Python Standard Library?
While The Python Language Reference describes the exact syntax and semantics of the Python language, this library reference manual describes the standard library that is distributed with Python. It also describes some of the optional components that are commonly included in Python distributions.
Python’s standard library is very extensive, offering a wide range of facilities as indicated by the long table of contents listed below. The library contains built-in modules (written in C) that provide access to system functionality such as file I/O that would otherwise be inaccessible to Python programmers, as well as modules written in Python that provide standardized solutions for many problems that occur in everyday programming. Some of these modules are explicitly designed to encourage and enhance the portability of Python programs by abstracting away platform-specifics into platform-neutral APIs.
The Python installers for the Windows platform usually include the entire standard library and often also include many additional components. For Unix-like operating systems Python is normally provided as a collection of packages, so it may be necessary to use the packaging tools provided with the operating system to obtain some or all of the optional components.
In addition to the standard library, there is a growing collection of several thousand components (from individual programs and modules to packages and entire application development frameworks), available from the Python Package Index.
- 1. Introduction
- 2. Built-in Functions
- 3. Built-in Constants
- 3.1. Constants added by the
site
module
- 3.1. Constants added by the
- 4. Built-in Types
- 4.1. Truth Value Testing
- 4.2. Boolean Operations —
and
,or
,not
- 4.3. Comparisons
- 4.4. Numeric Types —
int
,float
,complex
- 4.5. Iterator Types
- 4.6. Sequence Types —
list
,tuple
,range
- 4.7. Text Sequence Type —
str
- 4.8. Binary Sequence Types —
bytes
,bytearray
,memoryview
- 4.9. Set Types —
set
,frozenset
- 4.10. Mapping Types —
dict
- 4.11. Context Manager Types
- 4.12. Other Built-in Types
- 4.13. Special Attributes
- 5. Built-in Exceptions
- 5.1. Base classes
- 5.2. Concrete exceptions
- 5.3. Warnings
- 5.4. Exception hierarchy
- 6. Text Processing Services
- 6.1.
string
— Common string operations - 6.2.
re
— Regular expression operations - 6.3.
difflib
— Helpers for computing deltas - 6.4.
textwrap
— Text wrapping and filling - 6.5.
unicodedata
— Unicode Database - 6.6.
stringprep
— Internet String Preparation - 6.7.
readline
— GNU readline interface - 6.8.
rlcompleter
— Completion function for GNU readline
- 6.1.
- 7. Binary Data Services
- 7.1.
struct
— Interpret bytes as packed binary data - 7.2.
codecs
— Codec registry and base classes
- 7.1.
- 8. Data Types
- 8.1.
datetime
— Basic date and time types - 8.2.
calendar
— General calendar-related functions - 8.3.
collections
— Container datatypes - 8.4.
collections.abc
— Abstract Base Classes for Containers - 8.5.
heapq
— Heap queue algorithm - 8.6.
bisect
— Array bisection algorithm - 8.7.
array
— Efficient arrays of numeric values - 8.8.
weakref
— Weak references - 8.9.
types
— Dynamic type creation and names for built-in types - 8.10.
copy
— Shallow and deep copy operations - 8.11.
pprint
— Data pretty printer - 8.12.
reprlib
— Alternaterepr()
implementation - 8.13.
enum
— Support for enumerations
- 8.1.
- 9. Numeric and Mathematical Modules
- 9.1.
numbers
— Numeric abstract base classes - 9.2.
math
— Mathematical functions - 9.3.
cmath
— Mathematical functions for complex numbers - 9.4.
decimal
— Decimal fixed point and floating point arithmetic - 9.5.
fractions
— Rational numbers - 9.6.
random
— Generate pseudo-random numbers - 9.7.
statistics
— Mathematical statistics functions
- 9.1.
- 10. Functional Programming Modules
- 10.1.
itertools
— Functions creating iterators for efficient looping - 10.2.
functools
— Higher-order functions and operations on callable objects - 10.3.
operator
— Standard operators as functions
- 10.1.
- 11. File and Directory Access
- 11.1.
pathlib
— Object-oriented filesystem paths - 11.2.
os.path
— Common pathname manipulations - 11.3.
fileinput
— Iterate over lines from multiple input streams - 11.4.
stat
— Interpretingstat()
results - 11.5.
filecmp
— File and Directory Comparisons - 11.6.
tempfile
— Generate temporary files and directories - 11.7.
glob
— Unix style pathname pattern expansion - 11.8.
fnmatch
— Unix filename pattern matching - 11.9.
linecache
— Random access to text lines - 11.10.
shutil
— High-level file operations - 11.11.
macpath
— Mac OS 9 path manipulation functions
- 11.1.
- 12. Data Persistence
- 12.1.
pickle
— Python object serialization - 12.2.
copyreg
— Registerpickle
support functions - 12.3.
shelve
— Python object persistence - 12.4.
marshal
— Internal Python object serialization - 12.5.
dbm
— Interfaces to Unix “databases” - 12.6.
sqlite3
— DB-API 2.0 interface for SQLite databases
- 12.1.
- 13. Data Compression and Archiving
- 13.1.
zlib
— Compression compatible with gzip - 13.2.
gzip
— Support for gzip files - 13.3.
bz2
— Support for bzip2 compression - 13.4.
lzma
— Compression using the LZMA algorithm - 13.5.
zipfile
— Work with ZIP archives - 13.6.
tarfile
— Read and write tar archive files
- 13.1.
- 14. File Formats
- 14.1.
csv
— CSV File Reading and Writing - 14.2.
configparser
— Configuration file parser - 14.3.
netrc
— netrc file processing - 14.4.
xdrlib
— Encode and decode XDR data - 14.5.
plistlib
— Generate and parse Mac OS X.plist
files
- 14.1.
- 15. Cryptographic Services
- 15.1.
hashlib
— Secure hashes and message digests - 15.2.
hmac
— Keyed-Hashing for Message Authentication - 15.3.
secrets
— Generate secure random numbers for managing secrets
- 15.1.
- 16. Generic Operating System Services
- 16.1.
os
— Miscellaneous operating system interfaces - 16.2.
io
— Core tools for working with streams - 16.3.
time
— Time access and conversions - 16.4.
argparse
— Parser for command-line options, arguments and sub-commands - 16.5.
getopt
— C-style parser for command line options - 16.6.
logging
— Logging facility for Python - 16.7.
logging.config
— Logging configuration - 16.8.
logging.handlers
— Logging handlers - 16.9.
getpass
— Portable password input - 16.10.
curses
— Terminal handling for character-cell displays - 16.11.
curses.textpad
— Text input widget for curses programs - 16.12.
curses.ascii
— Utilities for ASCII characters - 16.13.
curses.panel
— A panel stack extension for curses - 16.14.
platform
— Access to underlying platform’s identifying data - 16.15.
errno
— Standard errno system symbols - 16.16.
ctypes
— A foreign function library for Python
- 16.1.
- 17. Concurrent Execution
- 17.1.
threading
— Thread-based parallelism - 17.2.
multiprocessing
— Process-based parallelism - 17.3. The
concurrent
package - 17.4.
concurrent.futures
— Launching parallel tasks - 17.5.
subprocess
— Subprocess management - 17.6.
sched
— Event scheduler - 17.7.
queue
— A synchronized queue class - 17.8.
dummy_threading
— Drop-in replacement for thethreading
module - 17.9.
_thread
— Low-level threading API - 17.10.
_dummy_thread
— Drop-in replacement for the_thread
module
- 17.1.
- 18. Interprocess Communication and Networking
- 18.1.
socket
— Low-level networking interface - 18.2.
ssl
— TLS/SSL wrapper for socket objects - 18.3.
select
— Waiting for I/O completion - 18.4.
selectors
— High-level I/O multiplexing - 18.5.
asyncio
— Asynchronous I/O, event loop, coroutines and tasks - 18.6.
asyncore
— Asynchronous socket handler - 18.7.
asynchat
— Asynchronous socket command/response handler - 18.8.
signal
— Set handlers for asynchronous events - 18.9.
mmap
— Memory-mapped file support
- 18.1.
- 19. Internet Data Handling
- 19.1.
email
— An email and MIME handling package - 19.2.
json
— JSON encoder and decoder - 19.3.
mailcap
— Mailcap file handling - 19.4.
mailbox
— Manipulate mailboxes in various formats - 19.5.
mimetypes
— Map filenames to MIME types - 19.6.
base64
— Base16, Base32, Base64, Base85 Data Encodings - 19.7.
binhex
— Encode and decode binhex4 files - 19.8.
binascii
— Convert between binary and ASCII - 19.9.
quopri
— Encode and decode MIME quoted-printable data - 19.10.
uu
— Encode and decode uuencode files
- 19.1.
- 20. Structured Markup Processing Tools
- 20.1.
html
— HyperText Markup Language support - 20.2.
html.parser
— Simple HTML and XHTML parser - 20.3.
html.entities
— Definitions of HTML general entities - 20.4. XML Processing Modules
- 20.5.
xml.etree.ElementTree
— The ElementTree XML API - 20.6.
xml.dom
— The Document Object Model API - 20.7.
xml.dom.minidom
— Minimal DOM implementation - 20.8.
xml.dom.pulldom
— Support for building partial DOM trees - 20.9.
xml.sax
— Support for SAX2 parsers - 20.10.
xml.sax.handler
— Base classes for SAX handlers - 20.11.
xml.sax.saxutils
— SAX Utilities - 20.12.
xml.sax.xmlreader
— Interface for XML parsers - 20.13.
xml.parsers.expat
— Fast XML parsing using Expat
- 20.1.
- 21. Internet Protocols and Support
- 21.1.
webbrowser
— Convenient Web-browser controller - 21.2.
cgi
— Common Gateway Interface support - 21.3.
cgitb
— Traceback manager for CGI scripts - 21.4.
wsgiref
— WSGI Utilities and Reference Implementation - 21.5.
urllib
— URL handling modules - 21.6.
urllib.request
— Extensible library for opening URLs - 21.7.
urllib.response
— Response classes used by urllib - 21.8.
urllib.parse
— Parse URLs into components - 21.9.
urllib.error
— Exception classes raised by urllib.request - 21.10.
urllib.robotparser
— Parser for robots.txt - 21.11.
http
— HTTP modules - 21.12.
http.client
— HTTP protocol client - 21.13.
ftplib
— FTP protocol client - 21.14.
poplib
— POP3 protocol client - 21.15.
imaplib
— IMAP4 protocol client - 21.16.
nntplib
— NNTP protocol client - 21.17.
smtplib
— SMTP protocol client - 21.18.
smtpd
— SMTP Server - 21.19.
telnetlib
— Telnet client - 21.20.
uuid
— UUID objects according to RFC 4122 - 21.21.
socketserver
— A framework for network servers - 21.22.
http.server
— HTTP servers - 21.23.
http.cookies
— HTTP state management - 21.24.
http.cookiejar
— Cookie handling for HTTP clients - 21.25.
xmlrpc
— XMLRPC server and client modules - 21.26.
xmlrpc.client
— XML-RPC client access - 21.27.
xmlrpc.server
— Basic XML-RPC servers - 21.28.
ipaddress
— IPv4/IPv6 manipulation library
- 21.1.
- 22. Multimedia Services
- 22.1.
audioop
— Manipulate raw audio data - 22.2.
aifc
— Read and write AIFF and AIFC files - 22.3.
sunau
— Read and write Sun AU files - 22.4.
wave
— Read and write WAV files - 22.5.
chunk
— Read IFF chunked data - 22.6.
colorsys
— Conversions between color systems - 22.7.
imghdr
— Determine the type of an image - 22.8.
sndhdr
— Determine type of sound file - 22.9.
ossaudiodev
— Access to OSS-compatible audio devices
- 22.1.
- 23. Internationalization
- 23.1.
gettext
— Multilingual internationalization services - 23.2.
locale
— Internationalization services
- 23.1.
- 24. Program Frameworks
- 24.1.
turtle
— Turtle graphics - 24.2.
cmd
— Support for line-oriented command interpreters - 24.3.
shlex
— Simple lexical analysis
- 24.1.
- 25. Graphical User Interfaces with Tk
- 25.1.
tkinter
— Python interface to Tcl/Tk - 25.2.
tkinter.ttk
— Tk themed widgets - 25.3.
tkinter.tix
— Extension widgets for Tk - 25.4.
tkinter.scrolledtext
— Scrolled Text Widget - 25.5. IDLE
- 25.6. Other Graphical User Interface Packages
- 25.1.
- 26. Development Tools
- 26.1.
typing
— Support for type hints - 26.2.
pydoc
— Documentation generator and online help system - 26.3.
doctest
— Test interactive Python examples - 26.4.
unittest
— Unit testing framework - 26.5.
unittest.mock
— mock object library - 26.6.
unittest.mock
— getting started - 26.7. 2to3 - Automated Python 2 to 3 code translation
- 26.8.
test
— Regression tests package for Python - 26.9.
test.support
— Utilities for the Python test suite
- 26.1.
- 27. Debugging and Profiling
- 27.1.
bdb
— Debugger framework - 27.2.
faulthandler
— Dump the Python traceback - 27.3.
pdb
— The Python Debugger - 27.4. The Python Profilers
- 27.5.
timeit
— Measure execution time of small code snippets - 27.6.
trace
— Trace or track Python statement execution - 27.7.
tracemalloc
— Trace memory allocations
- 27.1.
- 28. Software Packaging and Distribution
- 28.1.
distutils
— Building and installing Python modules - 28.2.
ensurepip
— Bootstrapping thepip
installer - 28.3.
venv
— Creation of virtual environments - 28.4.
zipapp
— Manage executable python zip archives
- 28.1.
- 29. Python Runtime Services
- 29.1.
sys
— System-specific parameters and functions - 29.2.
sysconfig
— Provide access to Python’s configuration information - 29.3.
builtins
— Built-in objects - 29.4.
__main__
— Top-level script environment - 29.5.
warnings
— Warning control - 29.6.
contextlib
— Utilities forwith
-statement contexts - 29.7.
abc
— Abstract Base Classes - 29.8.
atexit
— Exit handlers - 29.9.
traceback
— Print or retrieve a stack traceback - 29.10.
__future__
— Future statement definitions - 29.11.
gc
— Garbage Collector interface - 29.12.
inspect
— Inspect live objects - 29.13.
site
— Site-specific configuration hook - 29.14.
fpectl
— Floating point exception control
- 29.1.
- 30. Custom Python Interpreters
- 30.1.
code
— Interpreter base classes - 30.2.
codeop
— Compile Python code
- 30.1.
- 31. Importing Modules
- 31.1.
zipimport
— Import modules from Zip archives - 31.2.
pkgutil
— Package extension utility - 31.3.
modulefinder
— Find modules used by a script - 31.4.
runpy
— Locating and executing Python modules - 31.5.
importlib
— The implementation ofimport
- 31.1.
- 32. Python Language Services
- 32.1.
parser
— Access Python parse trees - 32.2.
ast
— Abstract Syntax Trees - 32.3.
symtable
— Access to the compiler’s symbol tables - 32.4.
symbol
— Constants used with Python parse trees - 32.5.
token
— Constants used with Python parse trees - 32.6.
keyword
— Testing for Python keywords - 32.7.
tokenize
— Tokenizer for Python source - 32.8.
tabnanny
— Detection of ambiguous indentation - 32.9.
pyclbr
— Python class browser support - 32.10.
py_compile
— Compile Python source files - 32.11.
compileall
— Byte-compile Python libraries - 32.12.
dis
— Disassembler for Python bytecode - 32.13.
pickletools
— Tools for pickle developers
- 32.1.
- 33. Miscellaneous Services
- 33.1.
formatter
— Generic output formatting
- 33.1.
- 34. MS Windows Specific Services
- 34.1.
msilib
— Read and write Microsoft Installer files - 34.2.
msvcrt
— Useful routines from the MS VC++ runtime - 34.3.
winreg
— Windows registry access - 34.4.
winsound
— Sound-playing interface for Windows
- 34.1.
- 35. Unix Specific Services
- 35.1.
posix
— The most common POSIX system calls - 35.2.
pwd
— The password database - 35.3.
spwd
— The shadow password database - 35.4.
grp
— The group database - 35.5.
crypt
— Function to check Unix passwords - 35.6.
termios
— POSIX style tty control - 35.7.
tty
— Terminal control functions - 35.8.
pty
— Pseudo-terminal utilities - 35.9.
fcntl
— Thefcntl
andioctl
system calls - 35.10.
pipes
— Interface to shell pipelines - 35.11.
resource
— Resource usage information - 35.12.
nis
— Interface to Sun’s NIS (Yellow Pages) - 35.13.
syslog
— Unix syslog library routines
- 35.1.
- 36. Superseded Modules
- 36.1.
optparse
— Parser for command line options - 36.2.
imp
— Access theimport
internals
- 36.1.
- 37. Undocumented Modules
- 37.1. Platform specific modules
第四,http://blog.jobbole.com/72306/
本文作者: 伯乐在线 - 伯乐 。未经作者许可,禁止转载!
欢迎加入伯乐在线 专栏作者。
以下是伯乐在线从GitHub中整理出的15个最受欢迎的Python开源框架。这些框架包括事件I/O,OLAP,Web开发,高性能网络通信,测试,爬虫等。
Django: Python Web应用开发框架
Django 应该是最出名的Python框架,GAE甚至Erlang都有框架受它影响。Django是走大而全的方向,它最出名的是其全自动化的管理后台:只需要使用起ORM,做简单的对象定义,它就能自动生成数据库结构、以及全功能的管理后台。
Diesel:基于Greenlet的事件I/O框架
Diesel提供一个整洁的API来编写网络客户端和服务器。支持TCP和UDP。
Flask:一个用Python编写的轻量级Web应用框架
Flask是一个使用Python编写的轻量级Web应用框架。基于Werkzeug WSGI工具箱和Jinja2 模板引擎。Flask也被称为“microframework”,因为它使用简单的核心,用extension增加其他功能。Flask没有默认使用的数据库、窗体验证工具。
Cubes:轻量级Python OLAP框架
Cubes是一个轻量级Python框架,包含OLAP、多维数据分析和浏览聚合数据(aggregated data)等工具。
Kartograph.py:创造矢量地图的轻量级Python框架
Kartograph是一个Python库,用来为ESRI生成SVG地图。Kartograph.py目前仍处于beta阶段,你可以在virtualenv环境下来测试。
Pulsar:Python的事件驱动并发框架
Pulsar是一个事件驱动的并发框架,有了pulsar,你可以写出在不同进程或线程中运行一个或多个活动的异步服务器。
Web2py:全栈式Web框架
Web2py是一个为Python语言提供的全功能Web应用框架,旨在敏捷快速的开发Web应用,具有快速、安全以及可移植的数据库驱动的应用,兼容Google App Engine。
Falcon:构建云API和网络应用后端的高性能Python框架
Falcon是一个构建云API的高性能Python框架,它鼓励使用REST架构风格,尽可能以最少的力气做最多的事情。
Dpark:Python版的Spark
DPark是Spark的Python克隆,是一个Python实现的分布式计算框架,可以非常方便地实现大规模数据处理和迭代计算。DPark由豆瓣实现,目前豆瓣内部的绝大多数数据分析都使用DPark完成,正日趋完善。
Buildbot:基于Python的持续集成测试框架
Buildbot是一个开源框架,可以自动化软件构建、测试和发布等过程。每当代码有改变,服务器要求不同平台上的客户端立即进行代码构建和测试,收集并报告不同平台的构建和测试结果。
Zerorpc:基于ZeroMQ的高性能分布式RPC框架
Zerorpc是一个基于ZeroMQ和MessagePack开发的远程过程调用协议(RPC)实现。和 Zerorpc 一起使用的 Service API 被称为 zeroservice。Zerorpc 可以通过编程或命令行方式调用。
Bottle: 微型Python Web框架
Bottle是一个简单高效的遵循WSGI的微型python Web框架。说微型,是因为它只有一个文件,除Python标准库外,它不依赖于任何第三方模块。
Tornado:异步非阻塞IO的Python Web框架
Tornado的全称是Torado Web Server,从名字上看就可知道它可以用作Web服务器,但同时它也是一个Python Web的开发框架。最初是在FriendFeed公司的网站上使用,FaceBook收购了之后便开源了出来。
webpy: 轻量级的Python Web框架
webpy的设计理念力求精简(Keep it simple and powerful),源码很简短,只提供一个框架所必须的东西,不依赖大量的第三方模块,它没有URL路由、没有模板也没有数据库的访问。
Scrapy:Python的爬虫框架
Scrapy是一个使用Python编写的,轻量级的,简单轻巧,并且使用起来非常的方便。
第五,2015第三方库
下面是我们在 2015 年使用过 Python 库的十大排名,排名不分先后。我们尽量避开已经被广泛接受的库,例如 Django、Flask、Django Rest Framework 等,介绍一些可能不是那么出名的库。系好安全带,老司机要开车了!
1. jupyter
如果画家无法立即看到他作画的内容,这对画家来说可真是太难了。『Jupyter Notebooks』 让我们更容易与代码、绘图和结果进行交互,它正逐渐成为一款数据科学家们所喜爱的工具。这些 Notebook 是结合了实时代码(live code)和说明文档的文档。由于这个原因,它可以快速创建原型或者教程。
虽然我们只使用 Jupyter 来写 Python,不过它最近增加了对不少编程语言的支持,例如 Julia 或者 Haskell。
2. retrying
『Retrying』库可以帮你避免重复劳动:它实现了『重试』行为。它提供了一个通用的 decorator,这个 decorator 可以很容易给方法增添重试的能力,而且还可以设置一系列的属性,如最大重试次数、延时、退避休眠(backoff sleeping)和错误条件(error conditions)等,以此来获得你想要的重试行为。简单轻便。
3. aiohttp
2015年有很多非常重要的库都被移植到 Python 3,所以我们也要开始拥抱它了。在使用协程写并发代码的时候,我们非常喜欢用 asyncio,所以需要一个使用相同并发模式的 HTTP 客户端(比如 requests)和服务器。而『aiohttp』就是这样一个库,它为 asyncio 提供了简洁易用的 HTTP 客户端和服务器。
4. plumbum
为了在 Python 程序中调用其他脚本或者可执行程序,我们尝试过很多『subprocess』的封装,但『plumbum』模式很轻松就击败了它们。它提供了非常易用的语法,可以轻松地以跨平台的方式执行本地或者远程命令,获取输出或者错误代码。如果这还不够,你还可以组合它们(shell 管道的方式),而且它还提供了创建命令行应用的接口。试试吧!
5. phonenumbers
处理和验证电话号码是一件非常痛苦的事情,因为需要考虑一大堆国际前缀和区域码,还可能遇到某个国家的特殊情形。这个叫做 『phonenumbers』的 Python 库,是从 Google 的『libphonenumbers』库移植而来,后者极大的简化了这项工作。它可以用来解析、格式化或者验证电话号码,而且需要写的代码非常少。最重要的是,『phonenumbers』可以判断一个电话号码是否是唯一的(遵照 E.164 格式)。它同时支持 Python 2 和 Python 3。
我们已经在很多项目中广泛使用这个库了,主要是调整 django-phonenumber-field,用它来解决这个经常遇到的枯燥问题。
6. networkx
在很多任务中,图形和网络是非常常用的工具,比如组织数据、显示它们的流动或者展示实体之间的关联。『NetworkX』允许创建、操作图形和网络,它所使用的算法很容易拓展,并且在处理大型图形时非常理想。除此之外,它提供了很多对图形进行着色的选项,这也让它变成了一种非常棒的可视化工具。
7. influxdb
如果你要按照时间顺序储存一批数据,那你一定要考虑一下用 InfluxDB。InfluxDB 是一个时间序列数据库,我们用它来储存不同时间的测量值。通过 RESTFul API(译注:REST,表现层状态转化,是一种软件架构风格,目前主流的 Web 服务实现方案之一,符合 REST 设计风格的 Web API 称为 RESTful API),它变得极其易用而且高效,这在处理海量数据时是非常必要的。另外,由于其内建了聚类功能,因此对数据的检索和分组也变得十分轻松。这个官方的客户端通过 API 调用抽象掉了大部分工作,不过我们也确实希望它能再改进一下,不要直接写 JSON,而是使用查询这种更符合 Python 风格的方式。
8. elasticsearch-dsl
如果你曾经用过 Elasticsearch,那你想必被那些特别长的 JSON 格式的查询摧残过,而且把大把的时间浪费在了检查哪里有解析错误上。『Elasticsearch DSL』是一个建立在 Elasticsearch 官方客户端基础之上的客户端,它让你不必再为 JSON 担心:你只需要用一个 Python 类或者类 queryset 表达式把需要的东西写好。同时它还提供了将文档转换成 Python 对象、映射等的接口。
9. keras
深度学习是一个新的趋势,这正是『keras』的闪光点。它可以在 Theano 之上运行,并且可以利用多种神经网络架构进行快速实验。它极其精简而且高度模块化,可以在 CPU 和 GPU 上无缝运行。类似『keras』这样的库是我们在 2015 年处理 R&D 项目的关键所在。
10. gensim
如果你对 NLP(自然语言处理)很感兴趣却从没有听说过『Gensim』……火星人你好。它提供了一些常用算法,例如 tf-idf、word2vec、doc2vec、LSA 等的快速、可拓展(内存无关)实现,同时还提供了简单易用的借口和完善的文档。
福利:MonkeyLearn Python
最后,还有绝对不能错过的 MonkeyLearn。这是 Tryolabs 公司的产品的一个分支,它通过简单易用的 RESTFul API 提供了云端的文本挖掘功能。通过它你可以对文本有更加深入的理解,比如文字中的情感、最重要的关键字,可以对文本进行话题检测,还可以使用自定义的文本分类器进行其他任何处理。『MonkeyLearn Python』是这个 API 的官方 Python 客户端,同时支持 Python 2 和 Python 3。
第六,常用第三方库
一、Web框架
1.Django:
开源web开发框架,它鼓励快速开发,并遵循MVC设计,比较庞大,开发周期短。Django的文档最完善、市场占有率最高、招聘职位最多。全套的解决方案,Django象Rails一样,提供全套的解决方案(full-stack framework + batteries included),基本要什么有什么(比如:cache、session、feed、orm、geo、auth),而且全部Django自己造,开发网 站应手的工具Django基本都给你做好了,因此开发效率是不用说的,出了问题也算好找,不在你的代码里就在Django的源码里。
2.web.py: 轻量级Web框架,虽然简单但是功能强大。
3.Tornado:
Web服务器框架。Tornado即是一个Web server,同时又是一个类web.py的micro-framework,作为框架,Tornado的思想主要来源于Web.py,没有好的ORM,没有session支持(虽然官方做法是用cookie代替),WSGI支持不完整。但好处就是它用非阻塞的事件驱动开发,性能不错。并且自带WEB服务器,拿来学习一个非阻塞方式WEB服务器工作原理很适合。因为不用再去读nginx源码了。
4.Zope: 开源的Web应用服务器。
5.ActiveGrid: 企业级的Web2.0解决方案。
6.Karrigell: 简单的Web框架,自身包含了Web服务,py脚本引擎和纯python的数据库PyDBLite。
7.CherryPy: 基于Python的Web应用程序开发框架。
8.Pylons: 基于Python的一个极其高效和可靠的Web开发框架。
9.TurboGears: 基于Python的MVC风格的Web应用程序框架。
10.Twisted: 流行的网络编程库,大型Web框架。
11.Quixote:Web开发框架。
二、科学计算
1.Matplotlib: 用Python实现的类matlab的第三方库,用以绘制一些高质量的数学二维图形。
2.Scipy: 基于Python的matlab实现,旨在实现matlab的所有功能。
3.Numpy: 基于Python的科学计算第三方库,提供了许多高级的数值编程工具,如:矩阵数据类型、矢量处理,线性代数,傅立叶变换,以及精密的运算库。专为进行严格的数字处理而产生。
三、网页爬虫框架
scrapy: Python开发的一个快速,高层次的屏幕抓取和web抓取框架,用于抓取web站点并从页面中提取结构化的数据。Scrapy用途广泛,可以用于数据挖掘、监测和自动化测试。Scrapy吸引人的地方在于它是一个框架,任何人都可以根据需求方便的修改。它也提供了多种类型爬虫的基类,如BaseSpider、sitemap爬虫等,最新版本又提供了web2.0爬虫的支持。
是一个为遍历爬行网站、分解获取数据而设计的应用程序框架,它可以应用在广泛领域:数据挖掘、信息处理和或者历史片(历史记录)打包等等。
四、分布式网络框架
Twisted: 使用Python编写,强壮的、面向对象的解释性语言。Python使它的爱好者充满热情。使用Python编程是一种乐趣,易于编写、易于阅读、易于运行。因为Python是跨平台的,所以可以运行Twisted程序在Linux、Windows、Unix和MAC等等系统上。
Twisted包括大量的功能。Email、WEB、news、chat、DNS、SSH、Telnet、RPC、数据库存取或者更多。
五、游戏框架
Pygame: 基于Python的多媒体开发和游戏软件开发模块。跨平台 Python模块,专为电子游戏设计。包含图像、声音。建立在SDL基础上,允许实时电子游戏研发而无需被低级语言(如机器语言和汇编语言)束缚。基于这样一个设想,所有需要的游戏功能和理念都(主要是图像方面)都完全简化为游戏逻辑本身,所有的资源结构都可以由高级语言提供,如Python。
六、GUI
1.Tkinter: Python下标准的界面编程包,因此不算是第三方库了。
2.PyGtk: 基于Python的GUI程序开发GTK+库。
3.PyQt: 用于Python的QT开发库。
4.WxPython: Python下的GUI编程框架,与MFC的架构相似。
七、其他
1.BeautifulSoup: 基于Python的HTML/XML解析器,简单易用。
2.MySQLdb: 用于连接MySQL数据库。
3.Py2exe: 将python脚本转换为windows上可以独立运行的可执行程序。
4.pefile: Windows PE文件解析器。
5.PIL:基于Python的图像处理库,功能强大,对图形文件的格式支持广泛。
6.cElementTree: 高性能XML解析库,Py2.5应该已经包含了该模块,因此不算一个第三方库了。
Python著名的lib和开发框架(均为转载)