Swift截取HTML中的所有图片url

在Swift中, 要从HTML格式的String中截取出所有的img的所有图片url,要截取的url就要匹配url,需要用到正则表达式。

  • Swift使用正则截取字符串的方法:
extension String {

    /**
     根据 正则表达式 截取字符串

     - parameter regex: 正则表达式

     - returns: 字符串数组
     */
    public func matchesForRegex(regex: String) -> [String]? {

        do {
            let regularExpression = try NSRegularExpression(pattern: regex, options: [])
            let range = NSMakeRange(0, self.characters.count)
            let results = regularExpression.matchesInString(self, options: [], range: range)
            let string = self as NSString
            return results.map { string.substringWithRange($0.range)}
        } catch {
            return nil
        }
    }
}
  • 匹配img标签的正则表达式为:
<img\b(?=\s)(?=(?:[^>=]|='[^']*'|="[^"]*"|=[^'"][^\s>]*)*?\ssrc=['"]([^"]*)['"]?)(?:[^>=]|='[^']*'|="[^"]*"|=[^'"\s]*)*"\s?\/?>
  • 在swift中的正则表达式书写为:
let regex = "<img\\b(?=\\s)(?=(?:[^>=]|='[^']*'|=\"[^\"]*\"|=[^'\"][^\\s>]*)*?\\ssrc=['\"]([^\"]*)['\"]?)(?:[^>=]|='[^']*'|=\"[^\"]*\"|=[^'\"\\s]*)*\"\\s?\\/?>"
  • 使用案例:
let string = "图片1<div><img src=\"http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg\" alt=\"Gravatar\"><br></div><div>图片2</div><div><img src=\"http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg\" alt=\"Gravatar\"><br></div><div><br></div><div>图片3</div><div><img src=\"http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg\" alt=\"Gravatar\"><br></div><div><br></div>"

// 截取img标签
let resultItems = string.matchesForRegex(regex)
  • 截取得到的resultItems为:
 ["<img src=\"http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg\" alt=\"Gravatar\">", 
 "<img src=\"http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg\" alt=\"Gravatar\">",
  "<img src=\"http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg\" alt=\"Gravatar\">"]
  • 如果单单只是截取img的url地址,使用的正则表达式为:
(http[^\s]+(jpg|jpeg|png|tiff)\b)
  • 在Swift中书写为:
let regex = "(http[^\\s]+(jpg|jpeg|png|tiff)\\b)"
  • 使用案例:
// 截取所有img url
let resultItems = string.matchesForRegex(regex)
  • 结果为:

[“http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg“, “http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg“, “http://cms-bucket.nosdn.127.net/catchpic/C/C7/C7643B52B98D04979AA173C6D6219958.jpg“]

猜你喜欢

转载自blog.csdn.net/h1101723183/article/details/52134472
今日推荐