最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

javascript - JS RegExp to remove all HTML tags and their content? - Stack Overflow

programmeradmin0浏览0评论

For example, I need to get from this:

Before Bold <b>In Bold</b> After Bold

To get:

Before Bold After Bold.

I tried:

string.replace(/<.*>.*<\/.*>/,'')

But it don't work as expected.

For example, I need to get from this:

Before Bold <b>In Bold</b> After Bold

To get:

Before Bold After Bold.

I tried:

string.replace(/<.*>.*<\/.*>/,'')

But it don't work as expected.

Share Improve this question asked Feb 28, 2013 at 16:09 ShluchShluch 4534 silver badges13 bronze badges 6
  • You are using greedy regexp – tckmn Commented Feb 28, 2013 at 16:11
  • The result has two spaces in between "Before Bold" and "After Bold". This is because you only removed the html tag. This is exactly what I would expect. – benekastah Commented Feb 28, 2013 at 16:12
  • 2 /(<([^>]+)>)/ig - Fiddle... I'd prefer VisioN's method over regex. – adeneo Commented Feb 28, 2013 at 16:12
  • I need to remove the tags, and the innerHTML of all the tags. What adeneo show, only remove the tags. I am don't search for best solution, only something that will work in most scenarios – Shluch Commented Feb 28, 2013 at 16:17
  • @Teemu Yeah, because it didn't remove the <b> tag with its contents. – VisioN Commented Feb 28, 2013 at 16:17
 |  Show 1 more ment

3 Answers 3

Reset to default 9

Try this:

string.replace(/<([^>]+?)([^>]*?)>(.*?)<\/\1>/ig, "")

It worked for me.

See it working here

var div = document.createElement("div"),
    result = "",
    child;

div.innerHTML = str;
child = div.firstChild;

do {
    if (child.nodeType === 3) {
        result += child.nodeValue;
    }
} while (child = child.nextSibling);

console.log(result);

I’m not sure about regex, but using jQuery you can easily just remove the children and return the HTML using a classic one-liner:

string = $('<div>').html(string).children().remove().end().html();
发布评论

评论列表(0)

  1. 暂无评论