UIMA Ruta - match previous token

UIMA Ruta - match previous token - ruta

Given the example in the RUTA Guide:
DECLARE Sentence;
PERIOD #{-> MARK(Sentence)} PERIOD;
I would like to include the first PERIOD in the sentence annotation, is that possible ?

Do you want the first period "." in the sentence annotation? You can do that with the following script.
DECLARE Sentence;
(PERIOD #){-> MARK(Sentence)} PERIOD;
or
DECLARE Sentence;
(PERIOD #){-> Sentence} PERIOD;
Using input: "my first sentence. And my second sentence."
You will get ". And my second sentence" marked as sentence.
Is that what you try to achieve? I don't think so ;-) I think you want real sentences with the dot at the end of the sentence?
You can do that for example with:
DECLARE Sentence;
((# PERIOD){-> Sentence })*;
You will get:
"my first sentence."
and " And my second sentence." marked as sentence.

Related

Crystal reports remove last carriage return

I have an address block like such;
123 Fake St
Suburb, VIC
4512
I want to only remove the last carriage return so it looks like;
123 Fake St
Suburb VIC 4512
Im using the following code which i feel is on the right track but it doesnt change the text at all. Totext is in there because without it i get the message "a string is expected here"
replace({Header.Address},totext(instr(strreverse({Header.Address}),chrw(13))) ,'')
I feel like i only need a small adjustment on this but i just cant quite figure it out. Any help is appreciated.

It looks like the format for the overload of the Replace() function you are using is this.
Replace (inputString, findString, replaceString)
You have defined the value of the findString argument as this.
totext(instr(strreverse({Header.Address}),chrw(13)))
The findString argument has a datatype of String, so you had to add the ToText() function because the InStr() returns an integer value that represents the index of where the carriage return is found in your reversed string. You don't need an index for where the carriage return is located in the string for the overload of the Replace() function you are using, so you can simplify your findString argument to this.
ChrW(13)
This will change your original formula to this.
Replace({Header.Address}, ChrW(13) ,'')
If this still doesn't work, then you may have Line Feed characters in your address data instead of Carriage Return characters. In this case your formula would be this.
Replace({Header.Address}, ChrW(10) ,'')
In some cases you may need to replace both Line Feed and Carriage Return characters. To do this you want this nested version of the formula.
Replace( Replace({Header.Address}, ChrW(10) ,''), ChrW(13) ,'')

Solved elsewhere by;
StrReverse(Replace(Replace(StrReverse({Header.Address}), chr(10), "", 1, 1), chr(13), "", 1, 1))

Finding a string between two ranges of strings or end of string

I am trying to extract whatever is between two strings. The first string is a known string, the second string could be from a list of strings.
For example,
We have the start string and the end strings. We want to get the text between these.
start = "start"
end = ["then", "stop", "other"]
Criteria
test = "start a task then do something else"
result = "a task"
test = "start a task stop doing something else"
result = "a task"
test = "start a task then stop"
result = "a task"
test = "start a task"
result = "a task"
I have looked at using a regex, and I got one which works for between two strings, I just cannot create one which words with a option of strings:
(?<=start\s).*(?=\sthen)
I have tried using this:
(?<=start\s).*(?=\sthen|\sstop|\sother)
but this will include 'then, stop or other' in the match like so:
"start a task then stop" will return "a task then"
I have also tried to do a 'match any character except the end list" in the capture group like so: (?<=start\s)((?!then|stop|other).*)(?=\sthen|\sstop|\sother) but this has the same effect as the one above.
I am using swift, so I am also wondering whether this can be achieved by finding the substring between two strings.
Thanks for any help!

You may use
(?<=start\s).*?(?=\s+(?:then|stop|other)|$)
See the regex demo. To search for whole words, add \b word boundary in proper places:
(?<=\bstart\s).*?(?=\s+(?:then|stop|other)\b|$)
See another regex demo
Details
(?<=start\s) - a positive lookbehind that matches a location immediately preceded with start string and a whitespace
.*? - any 0+ chars other than line break chars, as few as possible
(?=\s+(?:then|stop|other)|$) - a position in the string that is immediately followed with
\s+ - 1+ whitespaces
(?:then|stop|other) - one of the words
|$ - or end of string.

Using regexp_replace how do t replace a string with an exception

How do I replace all occurrences of ' sub.*' with the exception of ' substation.*'?
regexp_replace("CleanString",' sub.*',' ', 'ig')
I have tried using various combinations of groupings () but still not getting it.
Using postgres regexp_replace()

A regular expression normally matches only things that are there, not things that are not there - you cannot simply put an "if-then-else" in there.
However, Postgres's regex support, the manual page for which is here includes "lookahead" and "lookbehind" expressions.
In your case, you want a *negative lookahead":
(?!re) negative lookahead matches at any point where no substring matching re begins (AREs only)
It's important to note the phrase "at any point" - lookarounds are "zero width", so (?!station) doesn't mean "something other than station", it means "a position in the string where station isn't coming next".
You can therefore construct your query like this:
' sub(?!station).*'
That will match any of "sub", "foo sub", " subbar", or "foo subbar", but not any of "substation", "foo substation", " substationbar", or "foo substationbar". Since the (?!station) is zero-width, and the next token is .*, it's fine for nothing to come after " sub".
If you want there to be something after the "sub", you could instead write:
' sub(?!station).+'
The .+ means "at least one of something", so it will still match " subbar" and "foo subbar", but will no longer match " sub" or "foo sub".

Access, How can I change lowercase letter of first letter in last name to uppercase

I would like to change lowercase letter of first letter in last name to uppercase by using code
my code from form is :
Option Compare Database
Private Sub Text19_Click()
Text19 = UCase(Text19)
End Sub
but there is no change to my table!
Furthermore, how can I find last name with a space, comma or period and make it without a space, comma and period.
such as
Moon,
Moon.
[space] Moon
change them to just
Moon

If there is no change to your table, maybe your field is not bound to the recordset? Maybe you need to 'Refresh' your form.
Also, it looks like you are trying to use this code on a TextBox?
Code would be as follows:
Private Sub Text19_DblClick(Cancel As Integer)
Text19 = Trim(Text19) ' Get rid of leading and trailing spaces.
If right(Text19, 1) = "." Or right(Text19, 1) = "," Then ' Remove comma, period
Text19 = left(Text19, Len(Text19) - 1)
End If
Text19 = UCase(left(Text19, 1)) & Mid(Text19, 2)
End Sub

AutoHotKey Script - String Splitting

I have a string that looks like this:
17/07/2013 TEXTT TEXR 1 Text 1234567 456.78 987654
I need to separate this so I only end up with 2 values (in this example it's 1234567 and 456.78). The rest is unneeded.
I tried using string split with %A_Space% but as the whole middle area between values is filled with spaces, it doesn't really work.
Anyone got an idea?

src:="17/07/2013 TEXTT TEXR 1 Text "
. " 1234567 456.78 987654", pattern:="([\d\.]+)\s+([\d\.]+)"
RegexMatch(src, pattern, match)
MsgBox, 262144, % "result", % match1 "`n"match2

You should look at RegExMatch() and RegexReplace().
So, you will need to build a regex needle (I'm not an expert regexer, but this will work)
First, remove all of the string up to the end of "1 Text" since "1 Text" as you say, is constant. That will leave you with the three number values.
Something like this should find just the numbers you want:
needle:= "iO)1\s+Text"
partialstring := RegexMatch(completestring, needle, results)
lenOfFrontToRemove := results.pos() + results.len()
lastthreenumbers := substr(completestring, lenOfFrontToRemove, strlen(completestring) )
lastthreenumbers := trim(lastthreenumbers)
msgbox % lastthreenumbers
To explain the regex needle:
- the i means case insensitive
- the O stands for options - it lets us use results.pos and results.len
- the \s means to look for whitespace; the + means to look for more than one if present.
Now you have just the last three numbers.
1234567 456.78 987654
But you get the idea, right? You should able to parse it from here.
Some hints: in a regex needle, use \d to find any digit, and the + to make it look for more than one in a row. If you want to find the period, use \.

We Keep Coding

iphone swift flutter scala powershell matlab mongodb postgresql perl eclipse

UIMA Ruta - match previous token - ruta

Given the example in the RUTA Guide: DECLARE Sentence; PERIOD #{-> MARK(Sentence)} PERIOD; I would like to include the first PERIOD in the sentence annotation, is that possible ?

Related

Crystal reports remove last carriage return

Finding a string between two ranges of strings or end of string

Using regexp_replace how do t replace a string with an exception

Access, How can I change lowercase letter of first letter in last name to uppercase

AutoHotKey Script - String Splitting

Categories

Resources