您的位置：首页 > 数据库 > Redis

django使用celery定时任务，使用redis和supervisor。

2017-03-07 01:31 836 查看

前言

在django项目中，使用celery异步执行任务，以及创建定时任务。

使用redis作为中间件，用supervisor管理进程。

django项目代码

http://download.csdn.net/detail/win_turn/9772404

Begin

当前路径

创建虚拟环境

有关virtualenv的讲解，可查看

http://blog.csdn.net/win_turn/article/details/60466451

virtualenv win

进入虚拟环境

source win/bin/activate

安装package

建议先配置一下pip的国内镜像

http://blog.csdn.net/win_turn/article/details/59733715

pip install -r requirements.txt

#文件requirements.txt的内容
amqp==2.1.4
beautifulsoup4==4.5.3
billiard==3.5.0.2
celery==4.0.2
Django==1.10.6
kombu==4.0.2
lxml==3.7.3
pytz==2016.10
redis==2.10.5
requests==2.13.0
vine==1.1.3

创建django项目和应用

django-admin startproject pro
cd pro/
django-admin startapp wechat

当前目录结构

配置django

vi pro/settings.py

INSTALLED_APPS中添加新增的app

在文件末尾添加celery配置

CELERYD_LOG_FILE

是celery worker的日志文件

CELERYBEAT_LOG_FILE

是celery beat的日志文件

2017年3月20日更新

今天进行数据迁移时发现个问题，一直没有celery日志文件。

后来才发现，原来

CELERYD_LOG_FILE

和

CELERYBEAT_LOG_FILE

已经在4.0版本中不再支持了。改用参数

-f

指定日志文件。

创建init_celery.py

vi pro/init_celery.py

修改init.py

为了确保在django启动时加载了celery应用，修改init.py文件

vi pro/__init__.py

2017年10月2日增加

将下图中第5行的from .celery import … 改为 from .init_celery import …

当前目录结构

测试celery

在redis服务启动的情况下，测试celery worker已经准备好接收django的任务

关于redis的安装和启动，以及使用supervisor管理redis进程，请参考：

http://blog.csdn.net/win_turn/article/details/60615729

celery worker -A pro -l info

使用Ctrl+c关闭，然后再测试celery beat是否已经准备好。

celery beat -A pro -l info

测试通过，说明django项目已经配置好了celery。

做好django项目的准备工作

接下来，我们在wechat的视图中，简单的写一个业务逻辑，在用户访问时，及时返回给用户。

当前的业务场景是：访问

192.168.139.129:7777/money/

时，显示用户wangteng的账户余额。

描述待实现功能

希望实现的功能是：

1：当访问192.168.139.127:7777/money/后，系统访问http://www.500.com/pages/info/zhongjiang/网站，获取一等奖每注奖金金额。10秒钟后，访问http://www.500.com/pages/info/zhongjiang/dlt.php，获取二等奖每注奖金金额。将这两个数相加，存入数据库，保存为用户wangteng的余额。（该值没有任何实际意义，只是想创建一个很费时的任务。）

2：每两分钟，打印一次当前时间（模拟定时任务）

创建tasks.py

先在wechat下创建文件

tasks.py

，实现该功能。

vi wechat/tasks.py

在视图views.py中调用tasks.py中的任务时，需要使用

task_name.delay(args)

的方式。

参考文档

有关第33行代码中crontab的用法：

http://docs.celeryproject.org/en/master/reference/celery.schedules.html#celery.schedules.crontab

如果第35行代码ignore_result=True表示不存储任务状态，意味着你不能使用AsyncResult来检查任务是否就绪，或者获取它的返回值。

http://docs.celeryproject.org/en/latest/userguide/tasks.html#Task.ignore_result

http://docs.celeryproject.org/en/latest/userguide/configuration.html#std:setting-task_ignore_result

使用supervisor

使用supervisor管理celery worker程序。

进程管理工具的介绍，可以参考：

http://blog.csdn.net/win_turn/article/details/60466562

sudo vi /etc/supervisord.conf

注意：上面截图中，对redis的设置有误。

command=/etc/redis/redis_init_script start

不可以使用上面这种方式启动redis进程，用这种方式，supervisor监控的是脚本redis_init_script，而不是redis

应该使用下面这个命令启动redis

command=/usr/local/bin/redis-server /etc/redis/6379.conf

启动

celery worker

时的参数说明

Options:
-A APP, --app=APP     app instance to use (e.g. module.attr_name)
-b BROKER, --broker=BROKER
url to broker.  default is 'amqp://guest@localhost//'
--loader=LOADER       name of custom loader class to use.
--config=CONFIG       Name of the configuration module
--workdir=WORKING_DIRECTORY
Optional directory to change to after detaching.
-C, --no-color
-q, --quiet
-c CONCURRENCY, --concurrency=CONCURRENCY #worker的进程数，默认是CPU数量
Number of child processes processing the queue. The
default is the number of CPUs available on your
system.
-P POOL_CLS, --pool=POOL_CLS
Pool implementation: prefork (default), eventlet,
gevent, solo or threads.
--purge, --discard    Purges all waiting tasks before the daemon is started.
**WARNING**: This is unrecoverable, and the tasks will
be deleted from the messaging server.
-l LOGLEVEL, --loglevel=LOGLEVEL
Logging level, choose between DEBUG, INFO, WARNING,
ERROR, CRITICAL, or FATAL.
-n HOSTNAME, --hostname=HOSTNAME
Set custom hostname, e.g. 'w1.%h'. Expands: %h
(hostname), %n (name) and %d, (domain).
-B, --beat            Also run the celery beat periodic task scheduler.
Please note that there must only be one instance of
this service.
-s SCHEDULE_FILENAME, --schedule=SCHEDULE_FILENAME
Path to the schedule database if running with the -B
option. Defaults to celerybeat-schedule. The extension
".db" may be appended to the filename. Apply
optimization profile.  Supported: default, fair
--scheduler=SCHEDULER_CLS
Scheduler class to use. Default is
celery.beat.PersistentScheduler
-S STATE_DB, --statedb=STATE_DB
Path to the state database. The extension '.db' may be
appended to the filename. Default: None
-E, --events          Send events that can be captured by monitors like
celery events, celerymon, and others.
--time-limit=TASK_TIME_LIMIT
Enables a hard time limit (in seconds int/float) for
tasks.
--soft-time-limit=TASK_SOFT_TIME_LIMIT
Enables a soft time limit (in seconds int/float) for
tasks.
--maxtasksperchild=MAX_TASKS_PER_CHILD
Maximum number of tasks a pool worker can execute
before it's terminated and replaced by a new worker.
-Q QUEUES, --queues=QUEUES
List of queues to enable for this worker, separated by
comma. By default all configured queues are enabled.
Example: -Q video,image
-X EXCLUDE_QUEUES, --exclude-queues=EXCLUDE_QUEUES
-I INCLUDE, --include=INCLUDE
Comma separated list of additional modules to import.
Example: -I foo.tasks,bar.tasks
--autoscale=AUTOSCALE
Enable autoscaling by providing max_concurrency,
min_concurrency. Example:: --autoscale=10,3 (always
keep 3 processes, but grow to 10 if necessary)
--autoreload          Enable autoreloading.
--no-execv            Don't do execv after multiprocessing child fork.
--without-gossip      Do not subscribe to other workers events.
--without-mingle      Do not synchronize with other workers at startup.
--without-heartbeat   Do not send event heartbeats.
--heartbeat-interval=HEARTBEAT_INTERVAL
Interval in seconds at which to send worker heartbeat
-O OPTIMIZATION
-D, --detach
-f LOGFILE, --logfile=LOGFILE
Path to log file. If no logfile is specified, stderr
is used.
--pidfile=PIDFILE     Optional file used to store the process pid. The
program will not start if this file already exists and
the pid is still alive.
--uid=UID             User id, or user name of the user to run as after
detaching.
--gid=GID             Group id, or group name of the main group to change to
after detaching.
--umask=UMASK         Effective umask (in octal) of the process after
detaching.  Inherits the umask of the parent process
by default.
--executable=EXECUTABLE
Executable to use for the detached process.
--version             show program's version number and exit
-h, --help            show this help message and exit

启动

celerybeat

时的参数说明

在查看这个说明的时候，才发现

celerybeat

已经不建议使用了。应该使用

celery beat

（中间有个空格）

Usage: celery beat [options]

Start the beat periodic task scheduler.

Examples::

celery beat -l info
celery beat -s /var/run/celery/beat-schedule --detach
celery beat -S djcelery.schedulers.DatabaseScheduler

Options:
-A APP, --app=APP     app instance to use (e.g. module.attr_name)
-b BROKER, --broker=BROKER
url to broker.  default is 'amqp://guest@localhost//'
--loader=LOADER       name of custom loader class to use.
--config=CONFIG       Name of the configuration module
--workdir=WORKING_DIRECTORY
Optional directory to change to after detaching.
-C, --no-color
-q, --quiet
--detach
-s SCHEDULE, --schedule=SCHEDULE
--max-interval=MAX_INTERVAL
-S SCHEDULER_CLS, --scheduler=SCHEDULER_CLS
-l LOGLEVEL, --loglevel=LOGLEVEL
-f LOGFILE, --logfile=LOGFILE
Path to log file. If no logfile is specified, stderr
is used.
--pidfile=PIDFILE     Optional file used to store the process pid. The
program will not start if this file already exists and
the pid is still alive.
--uid=UID             User id, or user name of the user to run as after
detaching.
--gid=GID             Group id, or group name of the main group to change to
after detaching.
--umask=UMASK         Effective umask (in octal) of the process after
detaching.  Inherits the umask of the parent process
by default.
--executable=EXECUTABLE
Executable to use for the detached process.
--version             show program's version number and exit
-h, --help            show this help message and exit

创建日志文件

这两个日志文件，是supervisor的在执行命令期间，屏幕输出的日志。并非celery worker和celery beat的日志。

touch /var/log/supervisor/procelery.log /var/log/supervisor/procelerybeat.log

重启supervisor

sudo supervisorctl update

运行django项目

python manage.py runserver 0.0.0.0:7777

在浏览器输入

192.168.139.129:7777/money/

，浏览器能够立马接收到django返回的数据，并不需要等待。

查看celery日志

查看celery beat和celery worker的日志

celery beat的作用，就是按照规定的时间，将“开始运行任务”的指令发送给celery worker。因此，该日志文件中，记录的是派送任务的时间以及接受任务的任务名称。

celery0.log日志文件中，记录的是celery层面的信息，包括任务的执行时间、结束时间、所用时长、返回值等信息。

celery1.log日志文件中，记录的是task任务过程中的信息，包括运行过程中，时间时间、输出的print和info、任务名称等信息。

这个文件名中的数字

表示第几个worker进程。还记得才supervisord.conf中启动celery worker时的参数-c吗？如果启动worker时，参数

-c 3

，那么这里将会有

celery1.log

、

celery2.log

和

celery3.log

三个文件。

这个日志文件名的定义，是在

settings.py

中设置的。

查看redis

在日志中，有一串不规则的字符串，这个是每个任务的id。可以通过这个id，在redis中查询到该任务的信息。

例如查询任务id为

1b68e5cb-21e1-4cdc-9cf6-bdbbfe7785b6

任务的信息。

redis-cli

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签： django redis 定时任务 celery 延迟执行

相关文章推荐

新的分享

章节导航