I have a string like following
var str="A,B,C,E,'F,G,bb',H,'I9,I8',J,K"
I'd like to split the string on commas. However, in the case where something is inside single quotation marks, I need it to both ignore commas as following.
A
B
C
E
F,G,bb
H
I9,I8
J
K
I have a string like following
var str="A,B,C,E,'F,G,bb',H,'I9,I8',J,K"
I'd like to split the string on commas. However, in the case where something is inside single quotation marks, I need it to both ignore commas as following.
A
B
C
E
F,G,bb
H
I9,I8
J
K
Share
Improve this question
edited Apr 9, 2017 at 19:19
ROMANIA_engineer
56.7k30 gold badges208 silver badges205 bronze badges
asked May 16, 2012 at 11:15
Augustian JosephAugustian Joseph
1071 gold badge1 silver badge12 bronze badges
1
- 1 possible duplicate of Javascript code to parse CSV data – stema Commented May 16, 2012 at 11:25
3 Answers
Reset to default 12> str.match(/('[^']+'|[^,]+)/g)
["A", "B", "C", "E", "'F,G,bb'", "H", "'I9,I8'", "J", "K"]
Though you requested this, you may not accounted for corner-cases where for example:
'bob\'s'
is a string where'
is escapeda,',c
a,,b
a,b,
,a,b
a,b,'
',a,b
',a,b,c,'
Some of the above are handled correctly by this; others are not. I highly recommend that people use a library that has thought this through, to avoid things such as security vulnerabilities or subtle bugs, now or in the future (if you expand your code, or if other people use it).
Explanation of the RegEx:
('[^']+'|[^,]+)
- means match either'[^']+'
or[^,]+
'[^']+'
means quote...one-or-more non-quotes...quote.[^,]+
means one-or-more non-commas
Note: by consuming the quoted string before the unquoted string, we make the parsing of the unquoted string case easier.
Here is my version that works with single and double quotes and can have multiple quoted strings with commas embedded. It gives empty results and too many of them, so you have to check for that. Not rigorously tested. Please excuse the over use of '\'.
var sample='this=that, \
sometext with quoted ",", \
for example, \
another \'with some, quoted text, and more\',\
last,\
but "" "," "asdf,asdf" not "fff\',\' fff" the least';
var it=sample.match(/([^\"\',]*((\'[^\']*\')*||(\"[^\"]*\")*))+/gm);
for (var x=0;x<it.length;x++) {
var txt=$.trim(it[x]);
if(txt.length)
console.log(">"+txt+'<');
}
Use this
var input="A,B,C,E,'F,G,bb',H,'I9,I8',J,K";
//Below pattern will not consider comma(,) between ''. So 'I9,I8' will be considered as single string and not spitted by comma(,).
var pattern = ",(?=([^\']*\'[^\']*\')*[^\']*$)";
//you will get acctual output in array
var output[] = input.split(pattern);