最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

javascript - regex match unique results - Stack Overflow

programmeradmin1浏览0评论

I have the following situation, I'm searching in a HTML string for the attributes.

I have the following regex which works alright, but I want to get just unique results, of course I can apply some filter to the results array but I think this is achievable with pure regex.

So in this situation class is returned twice, but I only want 1 time:

['class', 'data-text'] not ['class', 'data-text', 'class']

const html = `<div class="foo">
    <span data-text="Some string" class="bar"></span>
</div>`

console.log(html.match(/[\w-:]+(?=\s*=\s*".*?")/g))

I have the following situation, I'm searching in a HTML string for the attributes.

I have the following regex which works alright, but I want to get just unique results, of course I can apply some filter to the results array but I think this is achievable with pure regex.

https://regex101./r/UqCuJS/1

So in this situation class is returned twice, but I only want 1 time:

['class', 'data-text'] not ['class', 'data-text', 'class']

const html = `<div class="foo">
    <span data-text="Some string" class="bar"></span>
</div>`

console.log(html.match(/[\w-:]+(?=\s*=\s*".*?")/g))

http://jsbin./bekibanisa/edit?js,console

Share Improve this question edited Dec 31, 2023 at 18:28 Jason Aller 3,65228 gold badges41 silver badges39 bronze badges asked Apr 30, 2017 at 15:12 AndersonAnderson 3411 gold badge5 silver badges18 bronze badges 6
  • Can you include full javascript tried at Question? Are you using .split() or .match() to get array of matches? – guest271314 Commented Apr 30, 2017 at 15:15
  • jsbin./bekibanisa/edit?js,console check this out, .match – Anderson Commented Apr 30, 2017 at 15:16
  • 2 See jsbin./yofozabilu/1/edit?js,console – Wiktor Stribiżew Commented Apr 30, 2017 at 15:19
  • @WiktorStribiżew your answer seems right, can you post it with an explanation of what is happening? – Anderson Commented Apr 30, 2017 at 15:21
  • 1 Yes, the greedy patterns with a construct matching any char, any number of times, will cause much backtracking. Use /[\w-:]+(?=\s*=\s*"[^"]*")/g and use .filter(). – Wiktor Stribiżew Commented Apr 30, 2017 at 15:49
 |  Show 1 more ment

2 Answers 2

Reset to default 10

You can pass result of .match() to Set, which does not allow duplicate values. If necessary convert Set instance back to Array.

const html = `<div class="foo">
	<span data-text="Some string" class="bar"></span>
</div>`
// or use existing `RegExp`
console.log([...new Set(html.match(/([\w-]+)(?=[=]")/g))])

Try removing the '/g' global modifier

console.log(html.match(/[\w-:]+(?=\s*=\s*".*?")/))
发布评论

评论列表(0)

  1. 暂无评论
ok 不同模板 switch ($forum['model']) { /*case '0': include _include(APP_PATH . 'view/htm/read.htm'); break;*/ default: include _include(theme_load('read', $fid)); break; } } break; case '10': // 主题外链 / thread external link http_location(htmlspecialchars_decode(trim($thread['description']))); break; case '11': // 单页 / single page $attachlist = array(); $imagelist = array(); $thread['filelist'] = array(); $threadlist = NULL; $thread['files'] > 0 and list($attachlist, $imagelist, $thread['filelist']) = well_attach_find_by_tid($tid); $data = data_read_cache($tid); empty($data) and message(-1, lang('data_malformation')); $tidlist = $forum['threads'] ? page_find_by_fid($fid, $page, $pagesize) : NULL; if ($tidlist) { $tidarr = arrlist_values($tidlist, 'tid'); $threadlist = well_thread_find($tidarr, $pagesize); // 按之前tidlist排序 $threadlist = array2_sort_key($threadlist, $tidlist, 'tid'); } $allowpost = forum_access_user($fid, $gid, 'allowpost'); $allowupdate = forum_access_mod($fid, $gid, 'allowupdate'); $allowdelete = forum_access_mod($fid, $gid, 'allowdelete'); $access = array('allowpost' => $allowpost, 'allowupdate' => $allowupdate, 'allowdelete' => $allowdelete); $header['title'] = $thread['subject']; $header['mobile_link'] = $thread['url']; $header['keywords'] = $thread['keyword'] ? $thread['keyword'] : $thread['subject']; $header['description'] = $thread['description'] ? $thread['description'] : $thread['brief']; $_SESSION['fid'] = $fid; if ($ajax) { empty($conf['api_on']) and message(0, lang('closed')); $apilist['header'] = $header; $apilist['extra'] = $extra; $apilist['access'] = $access; $apilist['thread'] = well_thread_safe_info($thread); $apilist['thread_data'] = $data; $apilist['forum'] = $forum; $apilist['imagelist'] = $imagelist; $apilist['filelist'] = $thread['filelist']; $apilist['threadlist'] = $threadlist; message(0, $apilist); } else { include _include(theme_load('single_page', $fid)); } break; default: message(-1, lang('data_malformation')); break; } ?>