如果我们需要获取一个网页的源码,并从中获取一些自己需要的信息,比如标签,或者
标签当中的所有内容和的时候。我们就必须要截取,然而php截取函数,相当的操蛋,原谅博主是个新手,至少我找了很久,并没有发现有向左截取字符串的函数,像别人的答案substr()配合获取字符串位置的函数,然后截取。我只能说试过,并不好用。然后还有些七七八八的办法,统统不实用,麻烦。接下来我就要说一种方法了,相当的bug,相当的好用。
$url = "http://www.texnet.com.cn/product"; $result = file_get_contents($url); $str = intercept_str($result, '粘上你要开始截取的字符串比如:', '粘上结束时刻的字符串比如:
', 1); //需要先去除所有的换行符、制表符、回车等等,html源文件由于上述符号的存在会造成无法匹配。 $hre = preg_replace("/[\t\n\r]+/","",$str); $reg='/.*?<\/a>/';//这个正则可以自己发挥,博主只是随便放了个匹配标签的 preg_match_all($reg,$hre,$aarray);
//然后就可以打印了呀
dump($aarray);
直接看结果:
array(1) {
[0] => array(96) {
[0] => string(195) " 桐乡市锦翔纺织有限公司 "
[1] => string(115) " 绢丝 "
[2] => string(113) " 绢丝 "
[3] => string(131) " 羊毛混纺纱 "
[4] => string(131) " 绢丝混纺纱 "
[5] => string(119) " 毛腈纱 "
[6] => string(122) "http://jinxiang.cn.texnet.com.cn"
[7] => string(130) "@@##@@"
[8] => string(205) " 桐乡市海通毛绒服饰有限公司 "
[9] => string(111) " 粗纱 "
[10] => string(117) " 粗纺纱 "
[11] => string(135) " 粗纺羊仔纱线 "
[12] => string(117) " 丝羊绒 "
[13] => string(117) " 羊绒纱 "
[14] => string(118) "http://zjhtfs.cn.texnet.com.cn"
[15] => string(129) "@@##@@"
[16] => string(193) " 桐乡市新拓针织有限公司 "
[17] => string(117) " 卫衣布 "
[18] => string(117) " 烂花布 "
[19] => string(117) " 烫金布 "
[20] => string(123) " 彩条汗布 "
[21] => string(111) " 汗布 "
[22] => string(118) "http://xtknit.cn.texnet.com.cn"
[23] => string(129) "@@##@@"
[24] => string(193) " 桐乡市玉溪针织有限公司 "
[25] => string(117) " 印花布 "
[26] => string(135) " 人棉氨纶汗布 "
[27] => string(111) " 汗布 "
[28] => string(141) " 人棉氨纶印花布 "
[29] => string(116) " 罗马布 "
[30] => string(118) "http://zjyuxi.cn.texnet.com.cn"
[31] => string(130) "@@##@@"
[32] => string(192) " 桐乡市洲泉众力丝织厂 "
[33] => string(129) " 亚麻彩条 "
[34] => string(129) " 亚麻素色 "
[35] => string(147) " 亚麻烫金针织布 "
[36] => string(129) " 亚麻印花 "
[37] => string(141) " 人棉竹节汗布 "
[38] => string(128) "http://zhonglisilk.cn.texnet.com.cn"
[39] => string(130) "@@##@@"
[40] => string(207) " 桐乡市清瀛家居用品有限公司 "
[41] => string(112) " 靠垫 "
[42] => string(118) " 三件套 "
[43] => string(113) " 被子 "
[44] => string(113) " 毯子 "
[45] => string(113) " 垫子 "
[46] => string(122) "http://zjqinghe.cn.texnet.com.cn"
[47] => string(130) "@@##@@"
[48] => string(203) " 桐乡市华灵丝绸有限责任公司 "
[49] => string(110) " 印花 "
[50] => string(122) " 长毛植绒 "
[51] => string(122) " 鞋材面料 "
[52] => string(122) " 服装面料 "
[53] => string(116) " 仿羊绒 "
[54] => string(114) "http://hlzr.cn.texnet.com.cn"
[55] => string(130) "@@##@@"
[56] => string(198) " 桐乡市晨晖丝绸有限公司 "
[57] => string(135) " 大提花面料 "
[58] => string(135) " 人丝美丽绸 "
[59] => string(135) " 涤丝斜纹绸 "
[60] => string(129) " 装饰里布 "
[61] => string(122) " 小提花 "
[62] => string(128) "http://chenhuisilk.cn.texnet.com.cn"
[63] => string(130) "@@##@@"
[64] => string(193) " 桐乡市嘉鸿针织有限公司 "
[65] => string(152) " 彩条丝光棉珠地网眼 "
[66] => string(140) " 烫银纯亚麻汗布 "
[67] => string(146) " 双丝光棉彩条罗纹 "
[68] => string(134) " 彩条氨纶汗布 "
[69] => string(116) " 罗马布 "
[70] => string(118) "http://jhknit.cn.texnet.com.cn"
[71] => string(130) "@@##@@"
[72] => string(198) " 桐乡市康泰织造有限公司 "
[73] => string(122) " 丝绵绸 "
[74] => string(134) " 色织双宫绸 "
[75] => string(116) " 真丝 "
[76] => string(134) " 涤丝双宫绸 "
[77] => string(134) " 缎染双宫绸 "
[78] => string(128) "http://kangtaisilk.cn.texnet.com.cn"
[79] => string(131) "@@##@@"
[80] => string(205) " 桐乡市博阳丝绸织造有限公司 "
[81] => string(111) " 丝麻 "
[82] => string(111) " 围巾 "
[83] => string(123) " 羊绒围巾 "
[84] => string(129) " 丝羊绒围巾 "
[85] => string(123) " 真丝围巾 "
[86] => string(118) "http://bysilk.cn.texnet.com.cn"
[87] => string(132) "@@##@@"
[88] => string(203) " 桐乡市华方纺织品有限公司 "
[89] => string(122) " 毛圈布 "
[90] => string(122) " 双面布 "
[91] => string(116) " 汗布 "
[92] => string(128) " 人棉汗布 "
[93] => string(140) " 全棉竹节毛圈 "
[94] => string(126) "http://huafangtex.cn.texnet.com.cn"
[95] => string(132) "@@##@@"
}
}是不是很简单,很暴力,很直接。











