最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

regex - Javascript: Splitting a string by comma but ignoring commas in quotes - Stack Overflow

programmeradmin2浏览0评论

I have a string like following

var str="A,B,C,E,'F,G,bb',H,'I9,I8',J,K"

I'd like to split the string on commas. However, in the case where something is inside single quotation marks, I need it to both ignore commas as following.

 A
 B
 C
 E
 F,G,bb
 H
 I9,I8
 J
 K

I have a string like following

var str="A,B,C,E,'F,G,bb',H,'I9,I8',J,K"

I'd like to split the string on commas. However, in the case where something is inside single quotation marks, I need it to both ignore commas as following.

 A
 B
 C
 E
 F,G,bb
 H
 I9,I8
 J
 K
Share Improve this question edited Apr 9, 2017 at 19:19 ROMANIA_engineer 56.7k30 gold badges208 silver badges205 bronze badges asked May 16, 2012 at 11:15 Augustian JosephAugustian Joseph 1071 gold badge1 silver badge12 bronze badges 1
  • 1 possible duplicate of Javascript code to parse CSV data – stema Commented May 16, 2012 at 11:25
Add a comment  | 

3 Answers 3

Reset to default 12
> str.match(/('[^']+'|[^,]+)/g)
["A", "B", "C", "E", "'F,G,bb'", "H", "'I9,I8'", "J", "K"]

Though you requested this, you may not accounted for corner-cases where for example:

  • 'bob\'s' is a string where ' is escaped
  • a,',c
  • a,,b
  • a,b,
  • ,a,b
  • a,b,'
  • ',a,b
  • ',a,b,c,'

Some of the above are handled correctly by this; others are not. I highly recommend that people use a library that has thought this through, to avoid things such as security vulnerabilities or subtle bugs, now or in the future (if you expand your code, or if other people use it).


Explanation of the RegEx:

  • ('[^']+'|[^,]+) - means match either '[^']+' or [^,]+
  • '[^']+' means quote...one-or-more non-quotes...quote.
  • [^,]+ means one-or-more non-commas

Note: by consuming the quoted string before the unquoted string, we make the parsing of the unquoted string case easier.

Here is my version that works with single and double quotes and can have multiple quoted strings with commas embedded. It gives empty results and too many of them, so you have to check for that. Not rigorously tested. Please excuse the over use of '\'.

var sample='this=that, \
sometext with quoted ",", \
for example, \
another \'with some, quoted text, and more\',\
last,\
but "" "," "asdf,asdf" not "fff\',\'  fff" the least';

var it=sample.match(/([^\"\',]*((\'[^\']*\')*||(\"[^\"]*\")*))+/gm);
for (var x=0;x<it.length;x++) {
var txt=$.trim(it[x]);
if(txt.length)
    console.log(">"+txt+'<');
}​

Use this

            var input="A,B,C,E,'F,G,bb',H,'I9,I8',J,K";
            //Below pattern will not consider comma(,) between ''. So 'I9,I8' will be considered as single string and not spitted by comma(,). 
            var pattern = ",(?=([^\']*\'[^\']*\')*[^\']*$)";
            //you will get acctual output in array
            var output[] = input.split(pattern);
发布评论

评论列表(0)

  1. 暂无评论