# 第八课 正则表达式
> 常用正则
```
<pre class="calibre14">```
<span class="token2">(</span><span class="token2">[</span>\s\S<span class="token2">]</span><span class="token">*</span><span class="token">?</span><span class="token2">)</span> 表示任意多个字符<span class="token2">,</span>换行也可以匹配
<span class="token2">(</span><span class="token2">[</span>\s<span class="token">*</span><span class="token2">]</span><span class="token">+</span><span class="token2">)</span> 匹配一个或多个空格
<span class="token2">(</span><span class="token2">[</span>\s<span class="token2">,</span><span class="token2">]</span><span class="token">+</span><span class="token2">)</span> 匹配多个空格或逗号
<span class="token2">(</span><span class="token2">[</span><span class="token2">,</span><span class="token2">]</span><span class="token">+</span><span class="token2">)</span> 匹配多个逗号
<span class="token">/</span>php<span class="token">/</span>i 不区分大小写
<span class="token">^</span> $ 匹配开始结束字符
<span class="token2">.</span> 匹配除换行以外字符串
<span class="token">?</span> <span class="token3">0</span>次 或 <span class="token3">1</span>次 等价<span class="token2">{</span><span class="token3">0</span><span class="token2">,</span><span class="token3">1</span><span class="token2">}</span>
<span class="token">*</span> <span class="token3">0</span>次 或 多次 等价<span class="token2">{</span><span class="token3">0</span><span class="token2">,</span><span class="token2">}</span>
<span class="token">+</span> <span class="token3">1</span>次 或 多次 等价<span class="token2">{</span><span class="token3">1</span><span class="token2">,</span><span class="token2">}</span>
<span class="token">-</span> 表示范围
<span class="token2">[</span><span class="token2">]</span> 开始结束字符类定义
\d 任意<span class="token3">10</span>进制数字 <span class="token2">[</span><span class="token3">0</span><span class="token">-</span><span class="token3">9</span><span class="token2">]</span>
\s 任意空白字符 单个
\S 任意非空白字符
\w 任意单词字符 等价<span class="token2">[</span>a<span class="token">-</span>zA<span class="token">-</span>Z0<span class="token">-</span><span class="token3">9</span><span class="token2">]</span>
<span class="token2">(</span><span class="token">?</span><span class="token2">:</span>中国<span class="token">|</span>美国<span class="token2">)</span><span class="token2">(</span><span class="token2">.</span><span class="token">*</span><span class="token2">)</span> 匹配中国<span class="token2">,</span>美国开头的字符串
<span class="token2">(</span>\d<span class="token">+</span>\<span class="token2">.</span>\d<span class="token">+</span>\<span class="token2">.</span>\d<span class="token">+</span>\<span class="token2">.</span>\d<span class="token">+</span><span class="token2">)</span> IP
<span class="token2">(</span><span class="token2">[</span>a<span class="token">-</span>zA<span class="token">-</span>Z<span class="token2">]</span><span class="token2">[</span>a<span class="token">-</span>zA<span class="token">-</span>Z0<span class="token">-</span><span class="token3">9</span>_<span class="token2">]</span><span class="token2">)</span> 匹配是否合法字母开头
<span class="token2">(</span>\d<span class="token">-</span>\d<span class="token">|</span>\d<span class="token">-</span>\d<span class="token2">)</span> 电话号码
<span class="token2">[</span><span class="token3">1</span><span class="token">-</span><span class="token3">9</span><span class="token2">]</span><span class="token2">[</span><span class="token3">0</span><span class="token">-</span><span class="token3">9</span><span class="token2">]</span> qq
<span class="token">^</span><span class="token2">[</span>\w\<span class="token2">.</span>\<span class="token">-</span><span class="token2">]</span><span class="token">+</span>@\w<span class="token">+</span><span class="token2">(</span><span class="token2">[</span>\<span class="token2">.</span>\<span class="token">-</span><span class="token2">]</span>\w<span class="token">+</span><span class="token2">)</span><span class="token">*</span>\<span class="token2">.</span>\w<span class="token">+</span>$ email
href<span class="token">=</span><span class="token4">"(.*?)"</span> 超链接
<span class="token">/</span><span class="token">^</span>\d<span class="token2">{</span><span class="token3">1</span><span class="token2">,</span><span class="token3">6</span><span class="token2">}</span>$<span class="token">/</span> 匹配<span class="token3">0</span><span class="token">-</span><span class="token3">999999</span>
<span class="token">/</span>\d<span class="token2">{</span><span class="token3">4</span><span class="token2">}</span>年\d<span class="token2">{</span><span class="token3">1</span><span class="token2">,</span><span class="token3">2</span><span class="token2">}</span>月\d<span class="token2">{</span><span class="token3">1</span><span class="token2">,</span><span class="token3">2</span><span class="token2">}</span><span class="token">/</span> 匹配年月日
```
```
> preg\_math 匹配一次,成功返回 true
```
<pre class="calibre14">```
<span class="token1">preg_match</span><span class="token2">(</span><span class="token4">"/\<center>([\s\S]*?)<\/center\>/"</span><span class="token2">,</span>$str<span class="token2">,</span>$rs<span class="token2">)</span><span class="token2">;</span>
```
```
> preg\_match\_all匹配多次,成功返回true
```
<pre class="calibre14">```
<span class="token1">preg_match_all</span><span class="token2">(</span><span class="token4">"/\<center>([\s\S]*?)<\/center\>/"</span><span class="token2">,</span>$str<span class="token2">,</span>$rs<span class="token2">)</span><span class="token2">;</span>
```
```
> preg\_replace 匹配替换,替换成$re
```
<pre class="calibre14">```
$rs <span class="token">=</span><span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/\<center>([\s\S]*?)<\/center\>/"</span><span class="token2">,</span>$re<span class="token2">,</span>$str<span class="token2">)</span><span class="token2">;</span>
```
```
> preg\_split分割成数组
```
<pre class="calibre14">```
$arr <span class="token">=</span> <span class="token1">preg_split</span><span class="token2">(</span><span class="token4">'/([\s*]+)/'</span><span class="token2">,</span><span class="token4">"a b c d ef"</span><span class="token2">)</span><span class="token2">;</span>
```
```
替换
```
<pre class="calibre14">```
$str <span class="token">=</span> <span class="token4">"选项[http://127.0.0.1/weixin/addons/yoby_diyform/weui/fm.jpg]你好"</span><span class="token2">;</span>
$str1 <span class="token">=</span> <span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/(?:\[)(.*?)(?:\])/i"</span><span class="token2">,</span> <span class="token4">"<img src=\"\${1}\" />"</span><span class="token2">,</span> $str<span class="token2">)</span><span class="token2">;</span>
<span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/.*\|(.*?)\|.*/i"</span><span class="token2">,</span> <span class="token4">"\${1}"</span><span class="token2">,</span> $v<span class="token2">)</span><span class="token2">;</span> 字符<span class="token">|</span><span class="token3">120000</span><span class="token">|</span>来了 输出<span class="token3">120000</span>
```
```
\\s+ 多个空白
\[^>\] >左边任意字符
.\*? 任意多个字符
\\d+ 匹配数字
```
<pre class="calibre14">```
<span class="token6">/*获取html并用正则处理*/</span>
<span class="token5">function</span> <span class="token1">get_content</span><span class="token2">(</span>$url<span class="token2">)</span><span class="token2">{</span>
$html <span class="token">=</span> <span class="token1">file_get_contents</span><span class="token2">(</span>$url<span class="token2">)</span><span class="token2">;</span>
$code<span class="token">=</span> <span class="token1">mb_detect_encoding</span><span class="token2">(</span>$html<span class="token2">,</span> <span class="token1">array</span><span class="token2">(</span><span class="token4">"GB2312"</span><span class="token2">,</span><span class="token4">"GBK"</span><span class="token2">,</span><span class="token4">'UTF-8'</span><span class="token2">,</span><span class="token4">'BIG5'</span><span class="token2">)</span><span class="token2">)</span><span class="token2">;</span><span class="token6">//获取编码</span>
<span class="token5">if</span><span class="token2">(</span>$code<span class="token">!=</span><span class="token4">"UTF-8"</span><span class="token2">)</span><span class="token2">{</span>
$htmls <span class="token">=</span> <span class="token1">mb_convert_encoding</span><span class="token2">(</span>$html<span class="token2">,</span> <span class="token4">"UTF-8"</span><span class="token2">,</span> $code<span class="token2">)</span><span class="token2">;</span><span class="token6">//转换内容为UTF-8编码</span>
<span class="token2">}</span><span class="token5">else</span><span class="token2">{</span>
$htmls <span class="token">=</span> $html<span class="token2">;</span>
<span class="token2">}</span>
$htmls <span class="token">=</span> <span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/<script[\s\S]*?<\/script>/i"</span><span class="token2">,</span><span class="token4">""</span><span class="token2">,</span>$htmls<span class="token2">,</span><span class="token">-</span><span class="token3">1</span><span class="token2">)</span><span class="token2">;</span><span class="token6">//去除script</span>
$htmls <span class="token">=</span> <span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/<noscript[\s\S]*?<\/noscript>/i"</span><span class="token2">,</span><span class="token4">""</span><span class="token2">,</span>$htmls<span class="token2">,</span><span class="token">-</span><span class="token3">1</span><span class="token2">)</span><span class="token2">;</span><span class="token6">//去除noscript</span>
$htmls<span class="token">=</span><span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/<(\/?link.*?)>/si"</span><span class="token2">,</span><span class="token4">""</span><span class="token2">,</span>$htmls<span class="token2">)</span><span class="token2">;</span><span class="token6">//去掉link</span>
$htmls<span class="token">=</span><span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/<(style.*?)>(.*?)<(\/style.*?)>/si"</span><span class="token2">,</span><span class="token4">""</span><span class="token2">,</span>$htmls<span class="token2">)</span><span class="token2">;</span><span class="token6">//去掉style</span>
$htmls <span class="token">=</span><span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/style=.+?['|\"]/i"</span><span class="token2">,</span><span class="token4">''</span><span class="token2">,</span>$htmls<span class="token2">,</span><span class="token">-</span><span class="token3">1</span><span class="token2">)</span><span class="token2">;</span><span class="token6">//去除style行内样式</span>
$htmls <span class="token">=</span><span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">'#<!--[^\!\[]*?(?<!\/\/)-->#'</span> <span class="token2">,</span> <span class="token4">''</span> <span class="token2">,</span> $htmls<span class="token2">)</span><span class="token2">;</span><span class="token6">//去掉html注释</span>
$htmls <span class="token">=</span> <span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/<a[^>]*>(.*?)<\/a>/is"</span><span class="token2">,</span> <span class="token4">"$1"</span><span class="token2">,</span> $htmls<span class="token2">)</span><span class="token2">;</span><span class="token6">//去除外站超链接</span>
$htmls <span class="token">=</span> <span class="token1">preg_replace</span><span class="token2">(</span><span class="token4">"/(\n\r)/i"</span><span class="token2">,</span> <span class="token4">''</span><span class="token2">,</span> $htmls<span class="token2">)</span><span class="token2">;</span> <span class="token6">//去掉空行</span>
<span class="token5">return</span> $htmls<span class="token2">;</span>
<span class="token2">}</span>
<span class="token1">preg_match</span><span class="token2">(</span><span class="token4">'/<div class="infoBox-list".*?>.*?<div class="news-page clearfix">/ism'</span><span class="token2">,</span> $htmls<span class="token2">,</span> $rs<span class="token2">)</span><span class="token2">;</span>
$htmls <span class="token">=</span> $rs<span class="token2">[</span><span class="token3">0</span><span class="token2">]</span><span class="token2">;</span><span class="token6">//获取两个class之间内容</span>
$url <span class="token">=</span> <span class="token2">(</span><span class="token1">preg_match</span><span class="token2">(</span><span class="token4">'/^http(s)?:\\/\\/.+/'</span><span class="token2">,</span>$url<span class="token2">)</span><span class="token2">)</span><span class="token">?</span>$url<span class="token2">:</span>"http<span class="token2">:</span><span class="token">/</span><span class="token">/</span>
"<span class="token2">.</span>$url<span class="token2">;</span><span class="token6">//判断是否包含https/http</span>
<span class="token1">preg_match</span><span class="token2">(</span><span class="token4">"/src=\"\/?(.*?)\"/"</span><span class="token2">,</span>$content<span class="token2">,</span>$match<span class="token2">)</span><span class="token2">;</span>
第一张图片
```
```
```
<pre class="calibre16">```
<span class="token2">[</span>\u4e00<span class="token">-</span>\u9fa5<span class="token2">]</span><span class="token2">{</span><span class="token3">0</span><span class="token2">,</span><span class="token2">}</span> 匹配中文
\d<span class="token">+</span> 匹配<span class="token">>=</span><span class="token3">0</span>数字
<span class="token2">[</span>a<span class="token">-</span>zA<span class="token">-</span>Z<span class="token2">]</span><span class="token">+</span> 不区分大小写<span class="token3">26</span>个字母
<span class="token2">[</span>A<span class="token">-</span>Za<span class="token">-</span>z0<span class="token">-</span><span class="token3">9</span><span class="token2">]</span><span class="token">+</span> 英文与数字
\s<span class="token">+</span> 多个空格
<span class="token2">[</span><span class="token3">0</span><span class="token">-</span><span class="token3">9</span><span class="token2">]</span><span class="token">*</span> 匹配一串数字
\d<span class="token2">{</span><span class="token3">4</span><span class="token2">}</span> 匹配四位数字
\d<span class="token2">{</span><span class="token3">5</span><span class="token2">,</span><span class="token2">}</span> 匹配至少<span class="token3">5</span>位数
\d<span class="token2">{</span><span class="token3">4</span><span class="token2">,</span><span class="token3">10</span><span class="token2">}</span> 匹配<span class="token3">4</span><span class="token">-</span><span class="token3">10</span>位数
```
```
- 简介
- 第一章 数据库
- Mysql/mariadb
- 函数
- 基础
- 增删改索引
- 标准查询
- 高级查询
- TIDB集群mysql解决方案
- Redis
- 语言基础
- 5种数据类型
- 其他类型
- Sqlite
- 语言基础
- 常用查询
- 第二章 PHP
- 语言基础
- 第一课 流程控制和运算
- 第二课 数组
- 第三课 日期时间
- 第四课 常用函数
- 第五课 字符串
- 第六课 文件操作
- 第七课 面向对象
- 第八课 正则表达式
- 第九课 图片处理生成
- 第十课 curl/memche
- 第十一课 mysql和pdo
- 第十三课 cookie和session
- 第十四课 xml操作
- 第十五课 php5.3+新特性
- 第十六课 php7+
- 第十七课 密码安全
- 废弃函数
- php命令行
- redis应用
- 算法
- 排序算法
- 基础算法
- 无限级分类
- 自定义函数Fn
- 查找算法
- 自定义函数数据函数fn
- laravel
- 路由
- 常用语句
- 数据库
- dingo/api
- Yii2
- 控制器
- 常用类
- 数据库
- redis
- thinkphp6
- TP6文档
- TP6插件
- dedecms
- 织梦标签大全
- 数据库操作
- 内置函数和定义函数
- 织梦核心改动
- 织梦插件/底层标签开发
- PHP相关工具
- composer
- php开发环境phpenv
- Phpstorm使用
- windows编译php扩展
- PHP开源库
- 开源项目管理禅道
- sns_auth
- php-casbin权限控制
- php-jwt
- 微信SDKeasywechat
- querylist采集库
- workerman
- Box/Spout处理excel和csv
- dll扩展
- redis/memche/xdebug
- redis
- Lua
- php_xlswriter
- event
- swoole
- 常用代码库
- 微擎框架
- 第一课全局变量
- 第二课常用函数
- 第三课自定义微擎独有函数
- 第四课数据库操作
- 第五课微信端回复
- 第六课微擎高级操作
- 第八课global函数列表
- mainfest.xml详解
- js方法
- 人人商城
- 第一课model解读
- 第二课常用语句解读
- 第三课常用js解读
- 第四课附录常见问题
- 第五课附录处理报表|支付
- 常用JSON状态码
- 第三章 JavaScript
- js基础
- 浏览器对象
- 语言基础
- html5接口
- ES6新语法
- vue
- 基础语法
- 京东vueUI组件
- uniapp
- 组件开发规范
- nodejs
- 基础知识
- 安装node
- nvm不同版本node切换
- js常用标准库
- zepto/jquery
- weui
- js图标库
- elementUI
- validator表单验证
- layer弹出层
- requirejs
- wow动画
- 动画animate
- swiper4
- 百度编辑器
- flyio/axios/qs
- jquery.form
- bootstrap3
- clipboard复制
- slideout侧滑
- imagehover.css图片悬停动画
- webpack打包
- Bulma UI框架
- store 客户端存储
- lottie动画创建库
- sweetalert
- js自定义函数
- 常见JSSDK
- 微信公众号JSSDK
- 腾讯地图jssdk
- 微信小程序
- 第四章 编程语言
- markdown语言
- Dart语言
- Dart语言基础
- Flutter框架
- Lua语言
- 字符串,数组,表
- 自定义方法
- go语言
- 第1.1语言基本语法
- 第1.2流程控制
- 第1.3函数
- 第1.4结构体
- 第1.5接口
- 第1.6包
- go语言框架Gin
- CSS3语言
- CSS与CSS3
- 选择符
- 属性
- css3
- loading动画
- HTML5语言
- less
- sass
- C#
- 基础知识
- 函数
- 第五章 开发工具
- git
- nginx/apache服务器
- Linux常用操作
- crontab定时任务
- 注册表与cmd
- 阿里云ECS
- frp穿透和ssl续期
- 宝塔安装
- 树莓派
- 浏览器模拟
- 火狐/chrome常用插件
- WSL安装使用
- mac brew和终端命令
- win10相关