最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

javascript - Regular expressions with Indian characters - Stack Overflow

programmeradmin1浏览0评论

I wonder is it possible to write a regular expression for indian characters? I want to validate if the given character is an Indian letter or number. I found this two questions:

What are the unicode ranges for Hindi accented characters?

what is the range for Hindu–Arabic (ARABIC-INDIC) numeral utf8 from 0 to 9

so I tried this: \x{0600}-\x{06ff}

But if I search this text (in OpenOffice): with this: \x{0600}-\x{06ff} nothing is found...

I wonder is it possible to write a regular expression for indian characters? I want to validate if the given character is an Indian letter or number. I found this two questions:

What are the unicode ranges for Hindi accented characters?

what is the range for Hindu–Arabic (ARABIC-INDIC) numeral utf8 from 0 to 9

so I tried this: \x{0600}-\x{06ff}

But if I search this text (in OpenOffice): http://pastebin./mDHL69XH with this: \x{0600}-\x{06ff} nothing is found...

Share Improve this question edited May 23, 2017 at 12:03 CommunityBot 11 silver badge asked Feb 13, 2013 at 18:00 user568021user568021 1,4866 gold badges30 silver badges57 bronze badges 3
  • 1 Different regular-expression engines are different. You say that you "want to validate if the given character is an Indian letter or number", which suggests you're using some sort of programming language, but then you say that you "search this text (in OpenOffice)", which suggests that you're trying to test your regex using a different regex engine. That is a bad idea. – ruakh Commented Feb 13, 2013 at 18:03
  • you should specify the language you are working with – Anirudha Commented Feb 13, 2013 at 18:08
  • I never really went deep into regular expressions...so different engines are new to me :) well I'm actually trying to do this in javascript... – user568021 Commented Feb 14, 2013 at 10:40
Add a ment  | 

1 Answer 1

Reset to default 11

Well this should do

[\u0900-\u097F]+// \uFFFF format supported by Java,

or

[\u{0900}-\u{097F}]+// \u{FFFF} format supported by perl,pcre

or

\p{Devanagari}//not widely supported
发布评论

评论列表(0)

  1. 暂无评论