Basic Data Operation
This is a basic data operation. It represents a fundamental action on a basic data type.
You may see other such operations in theBasic Data Operations category, or:
Integer Operations
Arithmetic |Comparison
Boolean Operations
Bitwise |Logical
String Operations
Concatenation |Interpolation |Comparison |Matching
Memory Operations
Pointers & references |Addresses
Display a substring:
If the program uses UTF-8 or UTF-16, it must work on any valid Unicode code point, whether in the Basic Multilingual Plane or above it.
The program must reference logical characters (code points), not 8-bit code units for UTF-8 or 16-bit code units for UTF-16.
Programs for other encodings (such as 8-bit ASCII, or EUC-JP) are not required to handle all Unicode characters.
V s = ‘abcdefgh’V n = 2V m = 3V char = ‘d’V chars = ‘cd’print(s[n - 1 .+ m]) // starting from n=2 characters in and m=3 in lengthprint(s[n - 1 ..]) // starting from n characters in, up to the end of the stringprint(s[0 .< (len)-1]) // whole string minus last characterprint(s[s.index(char) .+ m]) // starting from a known character char="d" within the string and of m lengthprint(s[s.index(chars) .+ m]) // starting from a known substring chars="cd" within the string and of m length
bcdbcdefghabcdefgdefcde
/* ARM assembly AARCH64 Raspberry PI 3B *//* program subString64.s */ /*******************************************//* Constantes file *//*******************************************//* for this file see task include a file in language AArch64 assembly*/.include "../includeConstantesARM64.inc"/*******************************************//* Initialized data *//*******************************************/.dataszMessString: .asciz "Result : " szString1: .asciz "abcdefghijklmnopqrstuvwxyz"szStringStart: .asciz "abcdefg"szCarriageReturn: .asciz "\n"/*******************************************/ /* UnInitialized data *//*******************************************/.bss szSubString: .skip 500 // buffer result /*******************************************//* code section *//*******************************************/.text.global main main: ldr x0,qAdrszString1 // address input string ldr x1,qAdrszSubString // address output string mov x2,22 // location mov x3,4 // length bl subStringNbChar // starting from n characters in and of m length ldr x0,qAdrszMessString // display message bl affichageMess ldr x0,qAdrszSubString // display substring result bl affichageMess ldr x0,qAdrszCarriageReturn // display line return bl affichageMess // ldr x0,qAdrszString1 ldr x1,qAdrszSubString mov x2,15 // location bl subStringEnd //starting from n characters in, up to the end of the string ldr x0,qAdrszMessString // display message bl affichageMess ldr x0,qAdrszSubString bl affichageMess ldr x0,qAdrszCarriageReturn // display line return bl affichageMess // ldr x0,qAdrszString1 ldr x1,qAdrszSubString bl subStringMinus // whole string minus last character ldr x0,qAdrszMessString // display message bl affichageMess ldr x0,qAdrszSubString bl affichageMess ldr x0,qAdrszCarriageReturn // display line return bl affichageMess // ldr x0,qAdrszString1 ldr x1,qAdrszSubString mov x2,'c' // start character mov x3,5 // length bl subStringStChar //starting from a known character within the string and of m length cmp x0,-1 // error ? beq 2f ldr x0,qAdrszMessString // display message bl affichageMess ldr x0,qAdrszSubString bl affichageMess ldr x0,qAdrszCarriageReturn // display line return bl affichageMess //2: ldr x0,qAdrszString1 ldr x1,qAdrszSubString ldr x2,qAdrszStringStart // sub string to start mov x3,10 // length bl subStringStString // starting from a known substring within the string and of m length cmp x0,-1 // error ? beq 3f ldr x0,qAdrszMessString // display message bl affichageMess ldr x0,qAdrszSubString bl affichageMess ldr x0,qAdrszCarriageReturn // display line return bl affichageMess3:100: // standard end of the program mov x0,0 // return code mov x8,EXIT // request to exit program svc 0 // perform system callqAdrszMessString: .quad szMessStringqAdrszString1: .quad szString1qAdrszSubString: .quad szSubStringqAdrszStringStart: .quad szStringStartqAdrszCarriageReturn: .quad szCarriageReturn/******************************************************************//* sub strings index start number of characters */ /******************************************************************//* x0 contains the address of the input string *//* x1 contains the address of the output string *//* x2 contains the start index *//* x3 contains numbers of characters to extract *//* x0 returns number of characters or -1 if error */subStringNbChar: stp x1,lr,[sp,-16]! // save registers mov x14,#0 // counter byte output string 1: ldrb w15,[x0,x2] // load byte string input cbz x15,2f // zero final ? strb w15,[x1,x14] // store byte output string add x2,x2,1 // increment counter add x14,x14,1 cmp x14,x3 // end ? blt 1b // no -> loop2: strb wzr,[x1,x14] // store final zero byte string 2 mov x0,x14100: ldp x1,lr,[sp],16 // restaur 2 registers ret // return to address lr x30/******************************************************************//* sub strings index start at end of string */ /******************************************************************//* x0 contains the address of the input string *//* x1 contains the address of the output string *//* x2 contains the start index *//* x0 returns number of characters or -1 if error */subStringEnd: stp x2,lr,[sp,-16]! // save registers mov x14,0 // counter byte output string 1: ldrb w15,[x0,x2] // load byte string 1 cbz x15,2f // zero final ? strb w15,[x1,x14] add x2,x2,1 add x14,x14,1 b 1b // loop2: strb wzr,[x1,x14] // store final zero byte string 2 mov x0,x14100: ldp x2,lr,[sp],16 // restaur 2 registers ret // return to address lr x30/******************************************************************//* whole string minus last character */ /******************************************************************//* x0 contains the address of the input string *//* x1 contains the address of the output string *//* x0 returns number of characters or -1 if error */subStringMinus: stp x1,lr,[sp,-16]! // save registers mov x12,0 // counter byte input string mov x14,0 // counter byte output string 1: ldrb w15,[x0,x12] // load byte string cbz x15,2f // zero final ? strb w15,[x1,x14] add x12,x12,1 add x14,x14,1 b 1b // loop2: sub x14,x14,1 strb wzr,[x1,x14] // store final zero byte string 2 mov x0,x14100: ldp x1,lr,[sp],16 // restaur 2 registers ret // return to address lr x30/******************************************************************//* starting from a known character within the string and of m length */ /******************************************************************//* x0 contains the address of the input string *//* x1 contains the address of the output string *//* x2 contains the character *//* x3 contains the length/* x0 returns number of characters or -1 if error */subStringStChar: stp x1,lr,[sp,-16]! // save registers mov x16,0 // counter byte input string mov x14,0 // counter byte output string 1: ldrb w15,[x0,x16] // load byte string cbz x15,4f // zero final ? cmp x15,x2 // character find ? beq 2f // yes add x16,x16,1 // no -> increment indice b 1b // loop2: strb w15,[x1,x14] add x16,x16,1 add x14,x14,1 cmp x14,x3 bge 3f ldrb w15,[x0,x16] // load byte string cbnz x15,2b // loop if no zero final3: strb wzr,[x1,x14] // store final zero byte string 2 mov x0,x14 b 100f4: strb w15,[x1,x14] mov x0,#-1100: ldp x1,lr,[sp],16 // restaur 2 registers ret // return to address lr x30/******************************************************************//* starting from a known substring within the string and of m length */ /******************************************************************//* x0 contains the address of the input string *//* x1 contains the address of the output string *//* x2 contains the address of string to start *//* x3 contains the length/* x0 returns number of characters or -1 if error */subStringStString: stp x1,lr,[sp,-16]! // save registers stp x20,x21,[sp,-16]! // save registers mov x20,x0 // save address mov x21,x1 // save address output string mov x1,x2 bl searchSubString cmp x0,-1 // not found ? beq 100f mov x16,x0 // counter byte input string mov x14,01: ldrb w15,[x20,x16] // load byte string strb w15,[x21,x14] cmp x15,#0 // zero final ? csel x0,x14,x0,eq beq 100f add x14,x14,1 cmp x14,x3 add x15,x16,1 csel x16,x15,x16,lt blt 1b // loop strb wzr,[x21,x14] mov x0,x14 // return indice100: ldp x20,x21,[sp],16 // restaur 2 registers ldp x1,lr,[sp],16 // restaur 2 registers ret // return to address lr x30/******************************************************************//* search a substring in the string */ /******************************************************************//* x0 contains the address of the input string *//* x1 contains the address of substring *//* x0 returns index of substring in string or -1 if not found */searchSubString: stp x1,lr,[sp,-16]! // save registers mov x12,0 // counter byte input string mov x13,0 // counter byte string mov x16,-1 // index found ldrb w14,[x1,x13]1: ldrb w15,[x0,x12] // load byte string cbz x15,4f // zero final ? cmp x15,x14 // compare character beq 2f mov x16,-1 // no equals - > raz index mov x13,0 // and raz counter byte add x12,x12,1 // and increment counter byte b 1b // and loop2: // characters equals cmp x16,-1 // first characters equals ? csel x16,x12,x16,eq // moveq x6,x2 // yes -> index begin in x6 add x13,x13,1 // increment counter substring ldrb w14,[x1,x13] // and load next byte cbz x14,3f // zero final ? yes -> end search add x12,x12,1 // else increment counter string b 1b // and loop3: mov x0,x16 // return indice b 100f4: mov x0,#-1 // yes returns error100: ldp x1,lr,[sp],16 // restaur 2 registers ret // return to address lr x30/********************************************************//* File Include fonctions *//********************************************************//* for this file see task include a file in language AArch64 assembly */.include "../includeARM64.inc"
Result : wxyzResult : pqrstuvwxyzResult : abcdefghijklmnopqrstuvwxyResult : cdefgResult : abcdefghij
BYTE FUNC FindC(CHAR ARRAY text CHAR c) BYTE i i=1 WHILE i<=text(0) DO IF text(i)=c THEN RETURN (i) FI i==+1 ODRETURN (0)BYTE FUNC FindS(CHAR ARRAY text,sub) BYTE i,j,found i=1 WHILE i<=text(0)-sub(0)+1 DO found=0 FOR j=1 TO sub(0) DO IF text(i+j-1)#sub(j) THEN found=0 EXIT ELSE found=1 FI OD IF found THEN RETURN (i) FI i==+1 ODRETURN (0)PROC Main() CHAR ARRAY text="qwertyuiop" CHAR ARRAY sub="tyu" CHAR ARRAY res(20) BYTE n,m CHAR c PrintF("Original string:%E ""%S""%E%E",text) n=3 m=5 SCopyS(res,text,n,n+m-1) PrintF("Substring start from %B and length %B:%E ""%S""%E%E",n,m,res) n=4 SCopyS(res,text,n,text(0)) PrintF("Substring start from %B up to the end:%E ""%S""%E%E",n,res) SCopyS(res,text,1,text(0)-1) PrintF("Whole string without the last char:%E ""%S""%E%E",res) c='w m=4 n=FindC(text,c) IF n=0 THEN PrintF("Character '%C' not found in string%E%E",c) ELSE SCopyS(res,text,n,n+m-1) PrintF("Substring start from '%C' and len %B:%E ""%S""%E%E",c,m,res) FI n=FindS(text,sub) m=6 IF n=0 THEN PrintF("String ""%S"" not found in string%E%E",sub) ELSE SCopyS(res,text,n,n+m-1) PrintF("Substring start from '%S' and len %B: ""%S""%E%E",sub,m,res) FIRETURN
Screenshot from Atari 8-bit computer
Original string: "qwertyuiop"Substring start from 3 and length 5: "ertyu"Substring start from 4 up to the end: "rtyuiop"Whole string without the last char: "qwertyuio"Substring start from 'w' and len 4: "wert"Substring start from 'tyu' and len 6: "tyuiop"
String inAda is an array of Character elements indexed by Positive:
typeStringisarray(Positiverange<>)ofCharacter;
Substring is a first-class object inAda, an anonymous subtype of String. The language uses the termslice for it. Slices can be retrieved, assigned and passed as a parameter to subprograms in mutable or immutable mode. A slice is specified as:
A(<first-index>..<last-index>)
A string array inAda can start with any positive index. This is why the implementation below uses Str'First in all slices, which in this concrete case is 1, but intentionally left in the code because the task refers to N as anoffset to the string beginning rather than anindex in the string. InAda it is unusual to deal with slices in such way. One uses plain string index instead.
withAda.Text_IO;useAda.Text_IO;withAda.Strings.Fixed;useAda.Strings.Fixed;procedureTest_SlicesisStr:constantString:="abcdefgh";N:constant:=2;M:constant:=3;beginPut_Line(Str(Str'First+N-1..Str'First+N+M-2));Put_Line(Str(Str'First+N-1..Str'Last));Put_Line(Str(Str'First..Str'Last-1));Put_Line(Head(Tail(Str,Str'Last-Index(Str,"d",1)),M));Put_Line(Head(Tail(Str,Str'Last-Index(Str,"de",1)-1),M));endTest_Slices;
bcdbcdefghabcdefgefgfgh
Aikido uses square brackets for slices. The syntax is[start:end]
. If you want to use length you have to add to the start. Shifting strings left or right removes characters from the ends.
const str = "abcdefg"var n = 2var m = 3println (str[n:n+m-1]) // pos 2 length 3println (str[n:]) // pos 2 to endprintln (str >> 1) // remove last charactervar p = find (str, 'c')println (str[p:p+m-1]) // from pos of p length 3var s = find (str, "bc")println (str[s, s+m-1]) // pos of bc length 3
text s;data b, d;s = "The quick brown fox jumps over the lazy dog.";o_text(cut(s, 4, 15));o_newline();o_text(cut(s, 4, length(s)));o_newline();o_text(delete(s, -1));o_newline();o_text(cut(s, index(s, 'q'), 5));o_newline();b_cast(b, s);b_cast(d, "brown");o_text(cut(s, b_find(b, d), 15));o_newline();
quick brown foxquick brown fox jumps over the lazy dog.The quick brown fox jumps over the lazy dogquickbrown fox jumps
main: ( STRING s = "abcdefgh"; INT n = 2, m = 3; CHAR char = "d"; STRING chars = "cd"; printf(($gl$, s[n:n+m-1])); printf(($gl$, s[n:])); printf(($gl$, s[:UPB s-1])); INT pos; char in string("d", pos, s); printf(($gl$, s[pos:pos+m-1])); string in string("de", pos, s); printf(($gl$, s[pos:pos+m-1])))
bcdbcdefghabcdefgdefdef
In Apex, the substring method returns a new String that begins with the character at the specified zero-based startIndex and extends to the end of the String.
String x = 'testing123';//Test1: testing123System.debug('Test1: ' + x.substring(0,x.length()));//Test2: esting123System.debug('Test2: ' + x.substring(1,x.length()));//Test3: testing123System.debug('Test3: ' + x.substring(0));//Test4: 3System.debug('Test4: ' + x.substring(x.length()-1));//Test5: System.debug('Test5: ' + x.substring(1,1));//Test 6: testing123System.debug('Test6: ' + x.substring(x.indexOf('testing')));//Test7: eSystem.debug('Test7: ' + x.substring(1,2));
Expressed in terms of some familiar functional primitives, so that we can focus more on the task, without too much distraction by the parochial quirks of a particular scripting language.
(Functional primitives version)
-- SUBSTRINGS ------------------------------------------------------------------- take :: Int -> Text -> Textontake(n,s)text1thrunofsendtake-- drop :: Int -> Text -> Textondrop(n,s)text(n+1)thru-1ofsenddrop-- breakOn :: Text -> Text -> (Text, Text)onbreakOn(strPattern,s)set{dlm,mytext item delimiters}to{mytext item delimiters,strPattern}setlstPartstotextitemsofssetmytext item delimiterstodlm{item1oflstParts,strPattern&(item2oflstParts)}endbreakOn-- init :: Text -> Textoninit(s)iflengthofs>0thentext1thru-2ofselsemissing valueendifendinit-- TEST -----------------------------------------------------------------------onrunsetstrto"一二三四五六七八九十"setlegendsto{¬"from n in, of n length",¬"from n in, up to end",¬"all but last",¬"from matching char, of m length",¬"from matching string, of m length"}setpartsto{¬take(3,drop(4,str)),¬drop(3,str),¬init(str),¬take(3,item2ofbreakOn("五",str)),¬take(4,item2ofbreakOn("六七",str))}scripttabulatepropertystrPad:" "on|λ|(l,r)l&drop(lengthofl,strPad)&rend|λ|endscriptlinefeed&intercalate(linefeed,¬zipWith(tabulate,¬legends,parts))&linefeedendrun-- GENERIC FUNCTIONS FOR TEST --------------------------------------------------- intercalate :: Text -> [Text] -> Textonintercalate(strText,lstText)set{dlm,mytext item delimiters}to{mytext item delimiters,strText}setstrJoinedtolstTextastextsetmytext item delimiterstodlmreturnstrJoinedendintercalate-- min :: Ord a => a -> a -> aonmin(x,y)ify<xthenyelsexendifendmin-- Lift 2nd class handler function into 1st class script wrapper-- mReturn :: Handler -> ScriptonmReturn(f)ifclassoffisscriptthenfelsescriptproperty|λ|:fendscriptendifendmReturn-- zipWith :: (a -> b -> c) -> [a] -> [b] -> [c]onzipWith(f,xs,ys)setlngtomin(lengthofxs,lengthofys)setlstto{}tellmReturn(f)repeatwithifrom1tolngsetendoflstto|λ|(itemiofxs,itemiofys)endrepeatreturnlstendtellendzipWith
from n in, of n length 五六七from n in, up to end 四五六七八九十all but last 一二三四五六七八九from matching char, of m length 五六七from matching string, of m length 六七八九
/* ARM assembly Raspberry PI *//* program substring.s *//* Constantes */.equ STDOUT, 1 @ Linux output console.equ EXIT, 1 @ Linux syscall.equ WRITE, 4 @ Linux syscall.equ BUFFERSIZE, 100/* Initialized data */.dataszMessString: .asciz "Result : " szString1: .asciz "abcdefghijklmnopqrstuvwxyz"szStringStart: .asciz "abcdefg"szCarriageReturn: .asciz "\n"/* UnInitialized data */.bss szSubString: .skip 500 @ buffer result/* code section */.text.global main main: ldr r0,iAdrszString1 @ address input string ldr r1,iAdrszSubString @ address output string mov r2,#22 @ location mov r3,#4 @ length bl subStringNbChar @ starting from n characters in and of m length ldr r0,iAdrszMessString @ display message bl affichageMess ldr r0,iAdrszSubString @ display substring result bl affichageMess ldr r0,iAdrszCarriageReturn @ display line return bl affichageMess @ ldr r0,iAdrszString1 ldr r1,iAdrszSubString mov r2,#15 @ location bl subStringEnd @starting from n characters in, up to the end of the string ldr r0,iAdrszMessString @ display message bl affichageMess ldr r0,iAdrszSubString bl affichageMess ldr r0,iAdrszCarriageReturn @ display line return bl affichageMess @ ldr r0,iAdrszString1 ldr r1,iAdrszSubString bl subStringMinus @ whole string minus last character ldr r0,iAdrszMessString @ display message bl affichageMess ldr r0,iAdrszSubString bl affichageMess ldr r0,iAdrszCarriageReturn @ display line return bl affichageMess @ ldr r0,iAdrszString1 ldr r1,iAdrszSubString mov r2,#'c' @ start character mov r3,#5 @ length bl subStringStChar @starting from a known character within the string and of m length cmp r0,#-1 @ error ? beq 2f ldr r0,iAdrszMessString @ display message bl affichageMess ldr r0,iAdrszSubString bl affichageMess ldr r0,iAdrszCarriageReturn @ display line return bl affichageMess @2: ldr r0,iAdrszString1 ldr r1,iAdrszSubString ldr r2,iAdrszStringStart @ sub string to start mov r3,#10 @ length bl subStringStString @ starting from a known substring within the string and of m length cmp r0,#-1 @ error ? beq 3f ldr r0,iAdrszMessString @ display message bl affichageMess ldr r0,iAdrszSubString bl affichageMess ldr r0,iAdrszCarriageReturn @ display line return bl affichageMess3:100: @ standard end of the program mov r0, #0 @ return code mov r7, #EXIT @ request to exit program svc 0 @ perform system calliAdrszMessString: .int szMessStringiAdrszString1: .int szString1iAdrszSubString: .int szSubStringiAdrszStringStart: .int szStringStartiAdrszCarriageReturn: .int szCarriageReturn/******************************************************************//* sub strings index start number of characters */ /******************************************************************//* r0 contains the address of the input string *//* r1 contains the address of the output string *//* r2 contains the start index *//* r3 contains numbers of characters to extract *//* r0 returns number of characters or -1 if error */subStringNbChar: push {r1-r5,lr} @ save registers mov r4,#0 @ counter byte output string 1: ldrb r5,[r0,r2] @ load byte string input cmp r5,#0 @ zero final ? beq 2f strb r5,[r1,r4] @ store byte output string add r2,#1 @ increment counter add r4,#1 cmp r4,r3 @ end ? blt 1b @ no -> loop2: mov r5,#0 strb r5,[r1,r4] @ load byte string 2 mov r0,r4100: pop {r1-r5,lr} @ restaur registers bx lr @ return/******************************************************************//* sub strings index start at end of string */ /******************************************************************//* r0 contains the address of the input string *//* r1 contains the address of the output string *//* r2 contains the start index *//* r0 returns number of characters or -1 if error */subStringEnd: push {r1-r5,lr} @ save registers mov r4,#0 @ counter byte output string 1: ldrb r5,[r0,r2] @ load byte string 1 cmp r5,#0 @ zero final ? beq 2f strb r5,[r1,r4] add r2,#1 add r4,#1 b 1b @ loop2: mov r5,#0 strb r5,[r1,r4] @ load byte string 2 mov r0,r4100: pop {r1-r5,lr} @ restaur registers bx lr /******************************************************************//* whole string minus last character */ /******************************************************************//* r0 contains the address of the input string *//* r1 contains the address of the output string *//* r0 returns number of characters or -1 if error */subStringMinus: push {r1-r5,lr} @ save registers mov r2,#0 @ counter byte input string mov r4,#0 @ counter byte output string 1: ldrb r5,[r0,r2] @ load byte string cmp r5,#0 @ zero final ? beq 2f strb r5,[r1,r4] add r2,#1 add r4,#1 b 1b @ loop2: sub r4,#1 mov r5,#0 strb r5,[r1,r4] @ load byte string 2 mov r0,r4100: pop {r1-r5,lr} @ restaur registers bx lr /******************************************************************//* starting from a known character within the string and of m length */ /******************************************************************//* r0 contains the address of the input string *//* r1 contains the address of the output string *//* r2 contains the character *//* r3 contains the length/* r0 returns number of characters or -1 if error */subStringStChar: push {r1-r5,lr} @ save registers mov r6,#0 @ counter byte input string mov r4,#0 @ counter byte output string 1: ldrb r5,[r0,r6] @ load byte string cmp r5,#0 @ zero final ? streqb r5,[r1,r4] moveq r0,#-1 beq 100f cmp r5,r2 beq 2f add r6,#1 b 1b @ loop2: strb r5,[r1,r4] add r6,#1 add r4,#1 cmp r4,r3 bge 3f ldrb r5,[r0,r6] @ load byte string cmp r5,#0 bne 2b3: mov r5,#0 strb r5,[r1,r4] @ load byte string 2 mov r0,r4100: pop {r1-r5,lr} @ restaur registers bx lr /******************************************************************//* starting from a known substring within the string and of m length */ /******************************************************************//* r0 contains the address of the input string *//* r1 contains the address of the output string *//* r2 contains the address of string to start *//* r3 contains the length/* r0 returns number of characters or -1 if error */subStringStString: push {r1-r8,lr} @ save registers mov r7,r0 @ save address mov r8,r1 @ counter byte string mov r1,r2 bl searchSubString cmp r0,#-1 beq 100f mov r6,r0 @ counter byte input string mov r4,#01: ldrb r5,[r7,r6] @ load byte string strb r5,[r8,r4] cmp r5,#0 @ zero final ? moveq r0,r4 beq 100f add r4,#1 cmp r4,r3 addlt r6,#1 blt 1b @ loop mov r5,#0 strb r5,[r8,r4] mov r0,r4100: pop {r1-r8,lr} @ restaur registers bx lr /******************************************************************//* search a substring in the string */ /******************************************************************//* r0 contains the address of the input string *//* r1 contains the address of substring *//* r0 returns index of substring in string or -1 if not found */searchSubString: push {r1-r6,lr} @ save registers mov r2,#0 @ counter byte input string mov r3,#0 @ counter byte string mov r6,#-1 @ index found ldrb r4,[r1,r3]1: ldrb r5,[r0,r2] @ load byte string cmp r5,#0 @ zero final ? moveq r0,#-1 @ yes returns error beq 100f cmp r5,r4 @ compare character beq 2f mov r6,#-1 @ no equals - > raz index mov r3,#0 @ and raz counter byte add r2,#1 @ and increment counter byte b 1b @ and loop2: @ characters equals cmp r6,#-1 @ first characters equals ? moveq r6,r2 @ yes -> index begin in r6 add r3,#1 @ increment counter substring ldrb r4,[r1,r3] @ and load next byte cmp r4,#0 @ zero final ? beq 3f @ yes -> end search add r2,#1 @ else increment counter string b 1b @ and loop3: mov r0,r6100: pop {r1-r6,lr} @ restaur registers bx lr /******************************************************************//* display text with size calculation */ /******************************************************************//* r0 contains the address of the message */affichageMess: push {r0,r1,r2,r7,lr} @ save registers mov r2,#0 @ counter length */1: @ loop length calculation ldrb r1,[r0,r2] @ read octet start position + index cmp r1,#0 @ if 0 its over addne r2,r2,#1 @ else add 1 in the length bne 1b @ and loop @ so here r2 contains the length of the message mov r1,r0 @ address message in r1 mov r0,#STDOUT @ code to write to the standard output Linux mov r7, #WRITE @ code call system "write" svc #0 @ call system pop {r0,r1,r2,r7,lr} @ restaur registers bx lr @ return
str:"abcdefgh"n:2m:3; starting from n=2 characters in and m=3 in lengthprintslicestrn-1n+m-2; starting from n characters in, up to the end of the stringprintslicestrn-1(size str)-1; whole string minus last characterprintslicestr0(size str)-2; starting from a known character char="d"; within the string and of m lengthprintslicestrindexstr"d"m+(indexstr"d")-1; starting from a known substring chars="cd"; within the string and of m lengthprintslicestrindexstr"cd"m+(indexstr"cd")-1
bcdbcdefghabcdefgdefcde
The code contains some alternatives.
String:="abcdefghijklmnopqrstuvwxyz"; also: String = abcdefghijklmnopqrstuvwxyzn:=12m:=5; starting from n characters in and of m length;subString:=SubStr(String,n,m); alternative: StringMid, subString, String, n, mMsgBox%subString; starting from n characters in, up to the end of the string;subString:=SubStr(String,n); alternative: StringMid, subString, String, nMsgBox%subString; whole string minus last character;StringTrimRight,subString,String,1; alternatives: subString := SubStr(String, 1, StrLen(String) - 1); StringMid, subString, String, 1, StrLen(String) - 1MsgBox%subString; starting from a known character within the string and of m length;findChar:="q"subString:=SubStr(String,InStr(String,findChar),m); alternatives: RegExMatch(String, findChar . ".{" . m - 1 . "}", subString); StringMid, subString, String, InStr(String, findChar), mMsgBox%subString; starting from a known character within the string and of m length;findString:="pq"subString:=SubStr(String,InStr(String,findString),m); alternatives: RegExMatch(String, findString . ".{" . m - StrLen(findString) . "}", subString); StringMid, subString, String, InStr(String, findString), mMsgBox%subString
lmnop lmnopqrstuvwxyz abcdefghijklmnopqrstuvwxy qrstu pqrst
BEGIN{str="abcdefghijklmnopqrstuvwxyz"n=12m=5printsubstr(str,n,m)printsubstr(str,n)printsubstr(str,1,length(str)-1)printsubstr(str,index(str,"q"),m)printsubstr(str,index(str,"pq"),m)}
$ awk -f substring.awk lmnoplmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyqrstupqrst
This example isin need of improvement:
|
Lbl SUB10→{r₁+r₂+r₃}r₁+r₂ReturnLbl SUB2r₁+r₂ReturnLbl SUB30→{r₁+length(r₁)-1}r₁ReturnLbl SUB4inData(r₂,r₁)-1→I0→{r₁+I+r₃}r₁+IReturn
0READN,M,S$:L=LEN(S$):GOSUB1:END:DATA5,11,THEQUICKBROWNFOXJUMPSOVERTHELAZYDOG,J,FOXREM starting from n characters in and of m length;1PRINTMID$(S$,N,M)REM starting from n characters in, up to the end of the string;2PRINTRIGHT$(S$,L-N+1)REM whole string minus the last character;3PRINTLEFT$(S$,L-1)REM starting from a known character within the string and of m length;4READF$:GOSUB6REM starting from a known substring within the string and of m length.5READF$6FORI=1TOL:IFMID$(S$,I,LEN(F$))=F$THENPRINTMID$(S$,I,M):RETURN7NEXT:RETURN
REM SubstringBase$="abcdefghijklmnopqrstuvwxyz"N=12M=5REM Starting from N characters in and of M length.Sub$=MID$(Base$,N,M)PRINTSub$REM Starting from N characters in, up to the end of the string.L=LEN(Base$)L=L-NL=L+1Sub$=MID$(Base$,N,L)PRINTSub$REM Whole string minus last character.L=LEN(Base$)L=L-1Sub$=LEFT$(Base$,L)PRINTSub$REM Starting from a known character within the string and of M length.B=INSTR(Base$,"b")Sub$=MID$(Base$,B,M)PRINTSub$REM Starting from a known substring within the string and of M length.Find$="pq"B=INSTR(Base$,Find$)Sub$=MID$(Base$,B,M)PRINTSub$END
lmnoplmnopqrstuvwxyzabcdefghijklmnopqrstuvwxybcdefpqrst
c$ = "abcdefghijklmnopqrstuvwxyz"n = 12m = 5# starting from n characters in and of m length;print mid(c$, n, m)# starting from n characters in, up to the end of the string;print mid(c$, n, length(c$))# whole string minus last character;print left(c$, length(c$) - 1)# starting from a known character within the string and of m length;print mid(c$, instr(c$, "b"), m)# starting from a known substring within the string and of m length.f$ = "pq"print mid(c$, instr(c$, f$), m)end
lmnoplmnopqrstuvwxyzabcdefghijklmnopqrstuvwxybcdefpqrst
basestring$="The five boxing wizards jump quickly"n%=10m%=5REM starting from n characters in and of m length:substring$=MID$(basestring$,n%,m%)PRINTsubstring$REM starting from n characters in, up to the end of the string:substring$=MID$(basestring$,n%)PRINTsubstring$REM whole string minus last character:substring$=LEFT$(basestring$)PRINTsubstring$REM starting from a known character within the string and of m length:char$="w"substring$=MID$(basestring$,INSTR(basestring$,char$),m%)PRINTsubstring$REM starting from a known substring within the string and of m length:find$="iz"substring$=MID$(basestring$,INSTR(basestring$,find$),m%)PRINTsubstring$
boxinboxing wizards jump quicklyThe five boxing wizards jump quicklwizarizard
10REM SUBSTRING ... ROSETTACODE.ORG20A$="THE QUICK BROWN FOX JUMPS OVER THE LAZY DOG"30X$="J":S$="FOX"40N=5:M=1150PRINT"THE STRING:"60PRINTA$70PRINT80PRINT"SUBSTRING STARTING FROM"N"CHARACTERS IN AND OF"M"LENGTH:"90PRINTMID$(A$,N,M)100PRINT110PRINT"STARTING FROM"N"CHARACTERS IN, UP TO THE END OF THE STRING:"120PRINTRIGHT$(A$,LEN(A$)+1-N)130PRINT140PRINT"WHOLE STRING MINUS LAST CHARACTER:"150PRINTLEFT$(A$,LEN(A$)-1)160PRINT170PRINT"STARTING FROM '";X$;"' AND OF"M"LENGTH:"180I=1190IFMID$(A$,I,1)=X$THEN220200I=I+1210GOTO190220PRINTRIGHT$(A$,LEN(A$)+1-I)230PRINT240PRINT"STARTING FROM '";S$;"' AND OF"M"LENGTH:"250I=1260IFMID$(A$,I,LEN(S$))=S$THEN290270I=I+1280GOTO260290PRINTRIGHT$(A$,LEN(A$)+1-I)300END
THE STRING:THE QUICK BROWN FOX JUMPS OVER THE LAZY DOGSUBSTRING STARTING FROM 5 CHARACTERS IN AND OF 11 LENGTH:QUICK BROWNSTARTING FROM 5 CHARACTERS IN, UP TO THE END OF THE STRING:QUICK BROWN FOX JUMPS OVER THE LAZY DOGWHOLE STRING MINUS LAST CHARACTER:THE QUICK BROWN FOX JUMPS OVER THE LAZY DOSTARTING FROM 'J' AND OF 11 LENGTH:JUMPS OVER THE LAZY DOGSTARTING FROM 'FOX' AND OF 11 LENGTH:FOX JUMPS OVER THE LAZY DOG
' FB 1.05.0 Win64DimsAsString="123456789"DimAsIntegern=3,m=4PrintMid(s,n,m)PrintMid(s,n)PrintLeft(s,Len(s)-1)'start from "5" sayPrintMid(s,Instr(s,"5"),m)' start from "12" sayPrintMid(s,Instr(s,"12"),m)Sleep
345634567891234567856781234
Click this link to run this code
PublicSubMain()DimsStringAsString="THE QUICK BROWN FOX JUMPS OVER THE LAZY DOG"PrintMid(sString,11,5)'Starting from n characters in and of m lengthPrintMid(sString,17)'Starting from n characters in, up to the end of the stringPrintLeft(sString,-1)'Whole string minus last characterPrintMid(sString,InStr(sString,"B"),9)'Starting from a known character within the string and of m lengthPrintMid(sString,InStr(sString,"OVER"),8)'Starting from a known substring within the string and of m lengthEnd
Output:
BROWNFOX JUMPS OVER THE LAZY DOGTHE QUICK BROWN FOX JUMPS OVER THE LAZY DOBROWN FOXOVER THE
100 LET A$="abcdefghijklmnopqrstuvwxyz"110 LET N=10:LET M=7120 PRINT A$(N:N+M-1)130 PRINT A$(N:)140 PRINT A$(:LEN(A$)-1)150 LET I=POS(A$,"g")160 PRINT A$(I:I+M-1)170 LET I=POS(A$,"ijk")180 PRINT A$(I:I+M-1)
'These tasks can be completed with various combinations of Liberty Basic's'built in Mid$()/ Instr()/ Left$()/ Right$()/ and Len() functions, but these'examples only use the Mid$()/ Instr()/ and Len() functions.baseString$ = "Thequickbrownfoxjumpsoverthelazydog."n = 12m = 5'starting from n characters in and of m lengthPrint Mid$(baseString$, n, m)'starting from n characters in, up to the end of the stringPrint Mid$(baseString$, n)'whole string minus last characterPrint Mid$(baseString$, 1, (Len(baseString$) - 1))'starting from a known character within the string and of m lengthPrint Mid$(baseString$, Instr(baseString$, "f", 1), m)'starting from a known substring within the string and of m lengthPrint Mid$(baseString$, Instr(baseString$, "jump", 1), m)
10REM Substring20BAS$="abcdefghijklmnopqrstuvwxyz"30N=12:M=540REM Starting from N characters in50REM and of M length60SB$=MID$(BAS$,N,M)70PRINTSB$80REM Starting from N characters in,90REM up to the end of the string100SB$=MID$(BAS$,N,LEN(BAS$)-N+1)110PRINTSB$120REM Whole string minus last character130SB$=LEFT$(BAS$,LEN(BAS$)-1)140PRINTSB$150REM Starting from a known character160REM within the string and of M length170A$=BAS$:B$="b":GOSUB270180SB$=MID$(BAS$,C,M)190PRINTSB$200REM Starting from a known substring210REM within the string and of M length220A$=BAS$:B$="pq":GOSUB270230SB$=MID$(BAS$,C,M)240PRINTSB$250END260REM ** INSTR subroutine270LB=LEN(B$):C=0280FORI=1TOLEN(A$)-LB+1290IFMID$(A$,I,LB)=B$THENC=I:RETURN300NEXTI310RETURN
lmnoplmnopqrstuvwxyzabcdefghijklmnopqrstuvwxybcdefpqrst
IfOpenConsole()DefinebaseString.s,m,nbaseString="Thequickbrownfoxjumpsoverthelazydog."n=12m=5;Displaythesubstringstartingfromncharactersinandofmlength.PrintN(Mid(baseString,n,m));Displaythesubstringstartingfromncharactersin,uptotheendofthestring.PrintN(Mid(baseString,n));orPrintN(Right(baseString,Len(baseString)-n));DisplaythesubstringwholestringminuslastcharacterPrintN(Left(baseString,Len(baseString)-1));Displaythesubstringstartingfromaknowncharacterwithinthestringandofmlength.PrintN(Mid(baseString,FindString(baseString,"b",1),m));Displaythesubstringstartingfromaknownsubstringwithinthestringandofmlength.PrintN(Mid(baseString,FindString(baseString,"ju",1),m))Print(#CRLF$+#CRLF$+"Press ENTER to exit")Input()CloseConsole()EndIf
wnfoxwnfoxjumpsoverthelazydog.Thequickbrownfoxjumpsoverthelazydogbrownjumps
DefStr SDefInt Istring1 = "abcdefghijklmnopqrstuvwxyz"substring = "klm"Dim Achar As String * 1Istart = 6Ilength = 10Achar = "c"' starting from n characters in and of m length;Print Mid$(string1, Istart, Ilength)' starting from n characters in, up to the end of the string;Print Mid$(string1, Istart)Print Right$(string1, Len(string1) - Istart + 1)' whole string minus the last character;Print Left$(string1, Len(string1) - 1)Print Mid$(string1, 1, Len(string1) - 1)' starting from a known character within the string and of m length;Print Mid$(string1, InStr(string1, Achar), Ilength)' starting from a known substring within the string and of m length.Print Mid$(string1, InStr(string1, substring), Ilength)End
DIMbaseStringASSTRING,subStringASSTRING,findStringASSTRINGDIMmASINTEGER,nASINTEGERbaseString="abcdefghijklmnopqrstuvwxyz"n=12m=5' starting from n characters in and of m length;subString=MID$(baseString,n,m)PRINTsubString' starting from n characters in, up to the end of the string;subString=MID$(baseString,n)PRINTsubString' whole string minus last character;subString=LEFT$(baseString,LEN(baseString)-1)PRINTsubString' starting from a known character within the string and of m length;subString=MID$(baseString,INSTR(baseString,"b"),m)PRINTsubString' starting from a known substring within the string and of m length.findString="pq"subString=MID$(baseString,INSTR(baseString,findString),m)PRINTsubString
lmnop lmnopqrstuvwxyz abcdefghijklmnopqrstuvwxy bcdef pqrst
n = 2m = 3s$ = "abcd"a$ = mid$(a$,n,m) ' starting from n characters in and of m lengtha$ = mid$(a$,n) ' starting from n characters in, up to the end of the stringa$ = Print mid$(a$,1,(len(a$)-1)) ' whole string minus last charactera$ = mid$(a$,instr(a$,s$,1),m) ' starting from a known character within the string and of m lengtha$ = mid$(a$,instr(a$,s$,1), m) ' starting from a known substring within the string and of m length.
LETbasestring$="abcdefghijklmnopqrstuvwxyz"LETn=12LETm=5!startingfromncharactersinandofmlength;PRINT(basestring$)[n:n+m-1]!startingfromncharactersin,uptotheendofthestring;PRINT(basestring$)[n:MAXNUM]!wholestringminuslastcharacter;PRINT(basestring$)[1:LEN(basestring$)-1]!startingfromaknowncharacterwithinthestringandofmlength;PRINT(basestring$)[POS(basestring$,"b"):POS(basestring$,"b")+m-1]!startingfromaknownsubString$withinthestringandofmlength.LETfindstring$="pq"PRINT(basestring$)[POS(basestring$,findstring$):POS(basestring$,findstring$)+m-1]END
PublicSubsubstring()'(1) starting from n characters in and of m length;'(2) starting from n characters in, up to the end of the string;'(3) whole string minus last character;'(4) starting from a known character within the string and of m length;'(5) starting from a known substring within the string and of m length.sentence="the last thing the man said was the"n=10:m=5'(1)Debug.PrintMid(sentence,n,5)'(2)Debug.PrintRight(sentence,Len(sentence)-n+1)'(3)Debug.PrintLeft(sentence,Len(sentence)-1)'(4)k=InStr(1,sentence,"m")Debug.PrintMid(sentence,k,5)'(5)k=InStr(1,sentence,"aid")Debug.PrintMid(sentence,k,5)EndSub
thingthing the man said was thethe last thing the man said was thman said w
s="rosettacode.org"'starting from n characters in and of m lengthWScript.StdOut.WriteLineMid(s,8,4)'starting from n characters in, up to the end of the stringWScript.StdOut.WriteLineMid(s,8,Len(s)-7)'whole string minus last characterWScript.StdOut.WriteLineMid(s,1,Len(s)-1)'starting from a known character within the string and of m lengthWScript.StdOut.WriteLineMid(s,InStr(1,s,"c"),4)'starting from a known substring within the string and of m lengthWScript.StdOut.WriteLineMid(s,InStr(1,s,"ose"),6)
codecode.orgrosettacode.orcodeosetta
c$ = "abcdefghijklmnopqrstuvwxyz"n = 12m = 5// starting from n characters in and of m length;print mid$(c$, n, m)// starting from n characters in, up to the end of the string;print mid$(c$, n)// whole string minus last character;print left$(c$, len(c$) - 1)// starting from a known character within the string and of m length;print mid$(c$, instr(c$, "b"), m)// starting from a known substring within the string and of m length.f$ = "pq"print mid$(c$, instr(c$, f$), m)end
lmnoplmnopqrstuvwxyzabcdefghijklmnopqrstuvwxybcdefpqrst
ZX Spectrum Basic has unfortunately no direct way to find a substring within a string, however a similar effect can be done searching with a for loop:
10LETA$="abcdefghijklmnopqrstuvwxyz"15LETn=10:LETm=720PRINTA$(nTOn+m-1)30PRINTA$(nTO)40PRINTA$(TOLEN(A$)-1)50FORi=1TOLEN(A$)60IFA$(i)="g"THENPRINTA$(iTOi+m-1):LETi=LEN(A$):GOTO7070NEXTi80LETB$="ijk"90FORi=1TOLEN(A$)-LEN(B$)+1100IFA$(iTOi+LEN(B$)-1)=B$THENPRINTA$(iTOi+m-1):LETi=LEN(A$)-LEN(B$)+1:GOTO110110NEXTi120STOP
Without superfluous code:
10LETA$="abcdefghijklmnopqrstuvwxyz":LETla=LENA$20LETn=10:LETm=730PRINTA$(nTOn+m-1)40PRINTA$(nTO)50PRINTA$(TOla-1)60FORi=1TOla70IFA$(i)="g"THENPRINTA$(iTOi+m-1):LETi=la80NEXTi90LETB$="ijk":LETlb=LENb$100FORi=1TOla-lb+1110IFA$(iTOi+lb-1)=B$THENPRINTA$(iTOi+m-1):LETi=la-lb+1120NEXTi
jklmnopjklmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyghijklmijklmno
Similar to:J
↑
(take) and↓
(drop) are the main tools to use here. In CBQN these produce a slice type and thus take constant time regardless of the size of the argument or result.
5↑3↓"Marshmallow""shmal"3↓"Marshmallow""shmallow"¯1↓"Marshmallow""Marshmallo"(⊑∘/'m'⊸=)⊸↓"Marshmallow""mallow"(⊑∘/"sh"⊸⍷)⊸↓"Marshmallow""shmallow"
( (basestring = "The five boxing wizards jump quickly")& (n = 10)& (m = 5) { starting from n characters in and of m length: }& @(!basestring:? [(!n+-1) ?substring [(!n+!m+-1) ?)& out$!substring { starting from n characters in, up to the end of the string: }& @(!basestring:? [(!n+-1) ?substring)& out$!substring { whole string minus last character: }& @(!basestring:?substring [-2 ?)& out$!substring { starting from a known character within the string and of m length: }& (char = "w")& @(!basestring:? ([?p !char ?: ?substring [(!p+!m) ?))& out$!substring { starting from a known substring within the string and of m length: }& (find = "iz")& @(!basestring:? ([?p !find ?: ?substring [(!p+!m) ?))& out$!substring&)
boxinboxing wizards jump quicklyThe five boxing wizards jump quicklwizarizard
blsq ) "RosettaCode"5.+"Roset"blsq ) "RosettaCode"5.+2.-"set"blsq ) "RosettaCode""set"ss2blsq ) "RosettaCode"J"set"ss.-"settaCode"blsq ) "RosettaCode"~]"RosettaCod"blsq ) "RosettaCode"[-"osettaCode"
Selecting/Deleting individual characters
blsq ) "RosettaCode"{0 1 3 5}si"Roet"blsq ) "RosettaCode"{0 1 3 5}di"oetaCde"
/* * RosettaCode: Substring, C89 * * In this task display a substring: starting from n characters in and of m * length; starting from n characters in, up to the end of the string; whole * string minus last character; starting from a known character within the * string and of m length; starting from a known substring within the string * and of m length. * * This example program DOES NOT make substrings. The program simply displays * certain parts of the input string. * */#define _CRT_SECURE_NO_WARNINGS/* MSVS compilers need this */#include<stdio.h>#include<stdlib.h>#include<string.h>/* * Put no more than m characters from string to standard output. * * It is worth noting that printf("%*s",width,string) does not limit the number * of characters to be printed. * * @param string null terminated string * @param m number of characters to display */voidputm(char*string,size_tm){while(*string&&m--)putchar(*string++);}intmain(void){charstring[]="Programs for other encodings (such as 8-bit ASCII, or EUC-JP)."intn=3;intm=4;charknownCharacter='(';charknownSubstring[]="encodings";putm(string+n-1,m);putchar('\n');puts(string+n+1);putchar('\n');putm(string,strlen(string)-1);putchar('\n');putm(strchr(string,knownCharacter),m);putchar('\n');putm(strstr(string,knownSubstring),m);putchar('\n');returnEXIT_SUCCESS;}
/* * RosettaCode: Substring, C89, Unicode * * In this task display a substring: starting from n characters in and of m * length; starting from n characters in, up to the end of the string; whole * string minus last character; starting from a known character within the * string and of m length; starting from a known substring within the string * and of m length. * * This example program DOES NOT make substrings. The program simply displays * certain parts of the input string. * */#include<stdio.h>#include<stdlib.h>#include<string.h>/* * Put all characters from string to standard output AND write newline. * BTW, _putws may not be avaliable. */voidput(wchar_t*string){while(*string)putwchar(*string++);putwchar(L'\n');}/* * Put no more than m characters from string to standard output AND newline. */voidputm(wchar_t*string,size_tm){while(*string&&m--)putwchar(*string++);putwchar(L'\n');}intmain(void){wchar_tstring[]=L"Programs for other encodings (such as 8-bit ASCII).";intn=3;intm=4;wchar_tknownCharacter=L'(';wchar_tknownSubstring[]=L"encodings";putm(string+n-1,m);put(string+n+1);putm(string,wcslen(string)-1);putm(wcschr(string,knownCharacter),m);putm(wcsstr(string,knownSubstring),m);returnEXIT_SUCCESS;}
#include<stddef.h>#include<stdio.h>#include<stdlib.h>#include<string.h>char*substring(constchar*s,size_tn,ptrdiff_tm){char*result;/* check for null s */if(NULL==s)returnNULL;/* negative m to mean 'up to the mth char from right' */if(m<0)m=strlen(s)+m-n+1;/* n < 0 or m < 0 is invalid */if(n<0||m<0)returnNULL;/* make sure string does not end before n * and advance the "s" pointer to beginning of substring */for(;n>0;s++,n--)if(*s=='\0')/* string ends before n: invalid */returnNULL;result=malloc(m+1);if(NULL==result)/* memory allocation failed */returnNULL;result[0]=0;strncat(result,s,m);/* strncat() will automatically add null terminator * if string ends early or after reading m characters */returnresult;}char*str_wholeless1(constchar*s){returnsubstring(s,0,strlen(s)-1);}char*str_fromch(constchar*s,intch,ptrdiff_tm){returnsubstring(s,strchr(s,ch)-s,m);}char*str_fromstr(constchar*s,char*in,ptrdiff_tm){returnsubstring(s,strstr(s,in)-s,m);}#define TEST(A) do {\ char *r = (A);\ if (NULL == r)\ puts("--error--");\ else {\ puts(r);\ free(r);\ }\ } while(0)intmain(){constchar*s="hello world shortest program";TEST(substring(s,12,5));// get "short"TEST(substring(s,6,-1));// get "world shortest program"TEST(str_wholeless1(s));// "... progra"TEST(str_fromch(s,'w',5));// "world"TEST(str_fromstr(s,"ro",3));// "rog"return0;}
#include<iostream>#include<string>intmain(){std::strings="0123456789";intconstn=3;intconstm=4;charconstc='2';std::stringconstsub="456";std::cout<<s.substr(n,m)<<"\n";std::cout<<s.substr(n)<<"\n";std::cout<<s.substr(0,s.size()-1)<<"\n";std::cout<<s.substr(s.find(c),m)<<"\n";std::cout<<s.substr(s.find(sub),m)<<"\n";}
usingSystem;namespaceSubString{classProgram{staticvoidMain(string[]args){strings="0123456789";constintn=3;constintm=2;constcharc='3';conststringz="345";// A: starting from n characters in and of m length;Console.WriteLine(s.Substring(n,m));// B: starting from n characters in, up to the end of the string;Console.WriteLine(s.Substring(n,s.Length-n));// C: whole string minus the last character;Console.WriteLine(s.Substring(0,s.Length-1));// D: starting from a known character within the string and of m length;Console.WriteLine(s.Substring(s.IndexOf(c),m));// E: starting from a known substring within the string and of m length.Console.WriteLine(s.Substring(s.IndexOf(z),m));}}}
As of C# 8, we can use the Range syntax. Cases B and C can be written more succinctly.
// B: starting from n characters in, up to the end of the string;Console.WriteLine(s[n..]);// C: whole string minus the last character;Console.WriteLine(s[..^1]);
(defstring"alphabet")(defn2)(defm4)(deflen(countstring));starting from n characters in and of m length;(println(subsstringn(+nm)));phab;starting from n characters in, up to the end of the string;(println(subsstringn));phabet;whole string minus last character;(println(subsstring0(declen)));alphabe;starting from a known character within the string and of m length;(let[pos(.indexOfstring(int\l))](println(subsstringpos(+posm))));lpha;starting from a known substring within the string and of m length.(let[pos(.indexOfstring"ph")](println(subsstringpos(+posm))));phab
identificationdivision.program-id.substring.environmentdivision.configurationsection.repository.functionallintrinsic.datadivision.working-storagesection.01original.05value"this is a string".01startingpic 99value3.01widthpic 99value8.01pospic 99.01enderpic 99.01lookingpic 99.01indicatorpic x.88foundvaluehigh-valuewhensettofalseislow-value.01look-forpic x(8).proceduredivision.substring-main.display"Original |"original"|, n = "starting" m = "widthdisplayoriginal(starting:width)displayoriginal(starting:)displayoriginal(1:length(original)-1)move"a"tolook-formove1tolookingperformfind-positioniffounddisplayoriginal(pos:width)end-ifmove"is a st"tolook-formovelength(trim(look-for))tolookingperformfind-positioniffounddisplayoriginal(pos:width)end-ifgoback.find-position.setfoundtofalsecomputeender=length(original)-lookingperformvaryingposfrom1by1untilpos>enderiforiginal(pos:looking)equallook-forthensetfoundtotrueexitperformend-ifend-perform.endprogramsubstring.
prompt$ cobc -xj substring.cobOriginal |this is a string|, n = 03 m = 08is is ais is a stringthis is a strina stringis a str
<cfoutput><cfsetstr="abcdefg"><cfsetn=2><cfsetm=3><!--- Note: In CF index starts at 1 rather than 0starting from n characters in and of m length --->#mid(str,n,m)#<!--- starting from n characters in, up to the end of the string ---><cfsetcountFromRight=Len(str)-n+1>#right(str,countFromRight)#<!--- whole string minus last character ---><cfsetallButLast=Len(str)-1>#left(str,allButLast)#<!--- starting from a known character within the string and of m length ---><cfsetstartingIndex=find("b",str)>#mid(str,startingIndex,m)#<!--- starting from a known substring within the string and of m length ---><cfsetstartingIndexSubString=find("bc",str)>#mid(str,startingIndexSubString,m)#</cfoutput>
bcdbcdefgabcdefbcdbcd
<cfscript>str="abcdefg";n=2;m=3;// Note: In CF index starts at 1 rather than 0// starting from n characters in and of m lengthwriteOutput(mid(str,n,m));// starting from n characters in, up to the end of the stringcountFromRight=Len(str)-n+1;writeOutput(right(str,countFromRight));// whole string minus last characterallButLast=Len(str)-1;writeOutput(left(str,allButLast));// starting from a known character within the string and of m lengthstartingIndex=find("b",str);writeOutput(mid(str,startingIndex,m));// starting from a known substring within the string and of m lengthstartingIndexSubString=find("bc",str);writeOutput(mid(str,startingIndexSubString,m));</cfscript>
bcdbcdefgabcdefbcdbcd
(let((string"0123456789")(n2)(m3)(start#\5)(substring"34"))(list(subseqstringn(+nm))(subseqstringn)(subseqstring0(1-(lengthstring)))(let((pos(positionstartstring)))(subseqstringpos(+posm)))(let((pos(searchsubstringstring)))(subseqstringpos(+posm)))))
BlackBox Component Builder
MODULESubstrings;IMPORTStdLog,Strings;PROCEDUREDo*;CONSTaStr="abcdefghijklmnopqrstuvwxyz";VARstr:ARRAY128OFCHAR;pos:INTEGER;BEGINStrings.Extract(aStr,3,10,str);StdLog.String("from 3, 10 characters:> ");StdLog.String(str);StdLog.Ln;Strings.Extract(aStr,3,LEN(aStr)-3,str);StdLog.String("from 3, until the end:> ");StdLog.String(str);StdLog.Ln;Strings.Extract(aStr,0,LEN(aStr)-1,str);StdLog.String("whole string but last:> ");StdLog.String(str);StdLog.Ln;Strings.Find(aStr,'d',0,pos);Strings.Extract(aStr,pos+1,10,str);StdLog.String("from 'd', 10 characters:> ");StdLog.String(str);StdLog.Ln;Strings.Find(aStr,"de",0,pos);Strings.Extract(aStr,pos+LEN("de"),10,str);StdLog.String("from 'de', 10 characters:> ");StdLog.String(str);StdLog.Ln;ENDDo;ENDSubstrings.
Execute: ^Q Substrings.Do
from 3, 10 characters:> defghijklmfrom 3, until the end:> defghijklmnopqrstuvwxyzwhole string but last:> abcdefghijklmnopqrstuvwxyfrom 'd', 10 characters:> efghijklmnfrom 'de', 10 characters:> fghijklmno
defsubstring_demo(string,n,m,known_character,known_substring)n-=1putsstring[n...n+m]putsstring[n...]putsstring.rchopknown_character_index=string.index(known_character).not_nil!putsstring[known_character_index...known_character_index+m]known_substring_index=string.index(known_substring).not_nil!putsstring[known_substring_index...known_substring_index+m]endsubstring_demo("crystalline",3,5,'t',"st")
ystalystallinecrystallintallistall
importstd.stdio,std.string;voidmain(){consts="the quick brown fox jumps over the lazy dog";enumn=5,m=3;writeln(s[n..n+m]);writeln(s[n..$]);writeln(s[0..$-1]);consti=s.indexOf("q");writeln(s[i..i+m]);constj=s.indexOf("qu");writeln(s[j..j+m]);}
uicuick brown fox jumps over the lazy dog.The quick brown fox jumps over the lazy dogquiqui
;starting from n characters in and of m length;SUB_STR = STR(n:m) ;starting from n characters in, up to the end of the string;SUB_STR = STR(n,$LEN(STR)) ;whole string minus last character;SUB_STR = STR(1,%TRIM(STR)-1) ;starting from a known character f within the string and of m length;;starting from a known substring f within the string and of m length.SUB_STR = STR(%INSTR(1,STR,f):m)
programShowSubstring;{$APPTYPE CONSOLE}usesSysUtils;consts='0123456789';n=3;m=4;c='2';sub='456';beginWriteln(Copy(s,n,m));// starting from n characters in and of m length;Writeln(Copy(s,n,Length(s)));// starting from n characters in, up to the end of the string;Writeln(Copy(s,1,Length(s)-1));// whole string minus last character;Writeln(Copy(s,Pos(c,s),m));// starting from a known character within the string and of m length;Writeln(Copy(s,Pos(sub,s),m));// starting from a known substring within the string and of m length.end.
23452345678901234567823454567
DuckDB character strings are UTF-8 strings, and stringindexing, and most string functions, such as length(), are based on Unicode code points.
DuckDB has an index origin of 1 for strings, and in this entry, thephrase 'the character n characters in' is taken to mean a characterwith DuckDB index equal to n.
#Forbrevity,wewilluseatable-valuedfunctionfordefiningthesamplestring(s):createorreplacefunctions()astable(select'一二三四五六七八九十'ass);#startingfromncharactersinandofmlength:s[n:n+m-1]selects[1:2]froms();#startingfromncharactersin,uptotheendofthestring:s[n:]selects[9:]froms();#wholestringminusthelastcharacter:.[:-2]selects[0:-2]froms();#startingfromaknowncharacterwithinthestringandofmlength,say2:selects[ix:ix+(2-1)]from(selects,position('五'ins)asixfroms());#startingfromaknownsubstringwithinthestringandofmlength,say2:selects[ix:ix+(2-1)]from(selects,position('五六'ins)asixfroms());#Forclaritywe'll use DuckDB's'list'outputmode:.modelist
s[1:2] = 一二s[9:] = 九十s[0:-2] = 一二三四五六七八九s[ix:(ix + (2 - 1))] = 五六s[ix:(ix + (2 - 1))] = 五六
let s = "0123456789"let n = 3let m = 2let c = '3'let z = "345" // A: starting from n characters in and of m length;print(s.Substring(n, m))// B: starting from n characters in, up to the end of the string;print(s[n..])// C: whole string minus the last character;print(s[..-1])// D: starting from a known character within the string and of m length;print(s.Substring(s.IndexOf(c),m))// E: starting from a known substring within the string and of m length.print(s.Substring(s.IndexOf(z),m))
def string := "aardvarks"def n := 4def m := 4println(string(n, n + m))println(string(n))println(string(0, string.size() - 1))println({string(def i := string.indexOf1('d'), i + m)})println({string(def i := string.startOf("ard"), i + m)})
varkvarksaardvarkdvarardv
a$ = timestr systimeprint substr a$ 12 5print substr a$ 12 99#a$ = "Hallo Österreich!"print substr a$ 1 (len a$ - 1)#c$ = "Ö"m = 2i = 1while substr a$ i 1 <> c$ i += 1.print substr a$ i m#c$ = "re"m = 5i = 1while substr a$ i len c$ <> c$ i += 1.print substr a$ i m
/* In this task display a substring:1. starting from n characters in and of m length;2. starting from n characters in, up to the end of the string;3. whole string minus last character;4. starting from a known character within the string and of m length;5. starting from a known substring within the string and of m length.*/IMPORTSTD;//imports a standard string libraryTheString:='abcdefghij';CharIn:=3;//nStrLength:=4;//mKnownChar:='f';KnownSub:='def';FindKnownChar:=STD.Str.Find(TheString,KnownChar,1);FindKnownSub:=STD.Str.Find(TheString,KnownSub,1);OUTPUT(TheString[Charin..CharIn+StrLength-1]);//task1OUTPUT(TheString[Charin..]);//task2OUTPUT(TheString[1..LENGTH(TheString)-1]);//task3OUTPUT(TheString[FindKnownChar..FindKnownChar+StrLength-1]);//task4OUTPUT(TheString[FindKnownSub..FindKnownSub+StrLength-1]);//task5/* OUTPUTS: defg cdefghij abcdefghi fghi defg*/
module Substrings { void run(String[] args = []) { @Inject Console console; if (args.size < 4) { console.print( $| |Usage: | | xec Substrings <str> <offset> <count> <substr> | ); return; } String s = args[0]; Int n = new Int(args[1]); Int m = new Int(args[2]); String sub = args[3]; Char c = sub[0]; console.print($| |{s .quoted()=} |{substring(s, n, m ).quoted()=} |{substring(s, n ).quoted()=} |{substring(s ).quoted()=} |{substring(s, c, m ).quoted()=} |{substring(s, sub, m).quoted()=} | ); } // starting from n characters in and of m length static String substring(String s, Int n, Int m) { assert 0 <= n <= n+m; return n < s.size ? s[n..<(n+m).notGreaterThan(s.size)] : ""; } // starting from n characters in, up to the end of the string static String substring(String s, Int n) { assert 0 <= n; return s.substring(n); } // whole string minus the last character static String substring(String s) { return s.size > 1 ? s[0..<s.size-1] : ""; } // starting from a known character within the string and of m length static String substring(String s, Char c, Int m){ assert 0 <= m; return substring(s, s.indexOf(c) ?: 0, m); } // starting from a known substring within the string and of m length static String substring(String s, String sub, Int m){ assert 0 <= m; return substring(s, s.indexOf(sub) ?: 0, m); }}
x$ xec doc/examples/Substrings scaryaardvark 5 4 ards .quoted()="scaryaardvark"substring(s, n, m ).quoted()="aard"substring(s, n ).quoted()="aardvark"substring(s ).quoted()="scaryaardvar"substring(s, c, m ).quoted()="arya"substring(s, sub, m).quoted()="ardv"
# by Artyom BologovHt0t0t0t01s/..\(.\{2\}\).*/\1/2s/..\(.*\)/\1/3s/.$//4s/.*\(c.\{2\}\).*/\1/5s/.*\(cd.\{3\}\).*/\1/,pQ
$ ed -s substring.input < substring.ed Newline appendedcdcdefghabcdefgcdecdefg
#import <Foundation/Foundation.h>intmain()autoreleasepoolstr:='abcdefgh'n:=2m:=3Log('%@',str[0..str.length-1])// abcdefghLog('%@',str[n..m])// cdLog('%@',str[n..str.length-1])// cdefghLog('%@',str.substringFromIndex:n)// cdefghLog('%@',str[(str.rangeOfString:'b').location..m])// bcdreturn0
Substring feature explainer video - 1 min
Each task in the description is coded with one or two lines of setup and then a pair of assertions to ensure that the proper result from the substring call. Because the {STRING} class in Eiffel is high-level, it covers both 8-bit and 32-bit strings (ASC and Unicode). If one wants to specifically code for Unicode, all one really needs to do is use {STRING_32} in the space of {STRING}.
classRC_SUBSTRING_TEST_SETinheritTEST_SET_SUPPORTfeature-- Test routinesrc_substring_test-- New test routinenotetask:"[Display a substring:- starting from n characters in and of m length;- starting from n characters in, up to the end of the string;- whole string minus the last character;- starting from a known character within the string and of m length;- starting from a known substring within the string and of m length.]"testing:"execution/isolated","execution/serial"localstr,str2:STRINGn,m:INTEGERdostr:="abcdefgh"m:=2-- starting from n characters in and of m length;n:=str.index_of('e',1)str2:=str.substring(n,n+m-1)assert_strings_equal("start_n","ef",str2)assert_integers_equal("m_length_1",2,str2.count)-- starting from n characters in, up to the end of the string;str2:=str.substring(n,n+(str.count-n))assert_strings_equal("start_n_to_end","efgh",str2)assert_integers_equal("len_1a",4,str2.count)-- whole string minus the last character;str2:=str.substring(1,str.count-1)assert_strings_equal("one_less_than_whole","abcdefg",str2)assert_integers_equal("len_1b",7,str2.count)-- starting from a known character within the string and of m length;n:=str.index_of('d',1)str2:=str.substring(n,n+m-1)assert_strings_equal("known_char","de",str2)assert_integers_equal("m_length_2",2,str2.count)-- starting from a known substring within the string and of m length.n:=str.substring_index("bc",1)str2:=str.substring(n,n+m-1)assert_strings_equal("known_substr","bc",str2)assert_integers_equal("m_length_3",2,str2.count)endend
ELENA 4.x :
import extensions; public program(){ var s := "0123456789"; var n := 3; var m := 2; var c := $51; var z := "345"; console.writeLine(s.Substring(n, m)); console.writeLine(s.Substring(n, s.Length - n)); console.writeLine(s.Substring(0, s.Length - 1)); console.writeLine(s.Substring(s.indexOf(0, c), m)); console.writeLine(s.Substring(s.indexOf(0, z), m))}
3434567890123456783434
s="abcdefgh"String.slice(s,2,3)#=> "cde"String.slice(s,1..3)#=> "bcd"String.slice(s,-3,2)#=> "fg"String.slice(s,3..-1)#=> "defgh"# UTF-8s="αβγδεζηθ"String.slice(s,2,3)#=> "γδε"String.slice(s,1..3)#=> "βγδ"String.slice(s,-3,2)#=> "ζη"String.slice(s,3..-1)#=> "δεζηθ"
Interactive session in Erlang shell showing built in functions doing the task.
1> N = 3. 2> M = 5.3> string:sub_string( "abcdefghijklm", N )."cdefghijklm"4> string:sub_string( "abcdefghijklm", N, N + M - 1 )."cdefg"6> string:sub_string( "abcdefghijklm", 1, string:len("abcdefghijklm") - 1 )."abcdefghijkl"7> Start_character = string:chr( "abcdefghijklm", $e ).8> string:sub_string( "abcdefghijklm", Start_character, Start_character + M - 1 )."efghi"9> Start_string = string:str( "abcdefghijklm", "efg" ).10> string:sub_string( "abcdefghijklm", Start_string, Start_string + M - 1 )."efghi"
sequence baseString, subString, findStringinteger findCharinteger m, nbaseString = "abcdefghijklmnopqrstuvwxyz"-- starting from n characters in and of m length;n = 12m = 5subString = baseString[n..n+m-1]puts(1, subString )puts(1,'\n')-- starting from n characters in, up to the end of the string;n = 12subString = baseString[n..$]puts(1, subString )puts(1,'\n')-- whole string minus last character;subString = baseString[1..$-1]puts(1, subString )puts(1,'\n')-- starting from a known character within the string and of m length;findChar = 'o'm = 5n = find(findChar,baseString)subString = baseString[n..n+m-1]puts(1, subString )puts(1,'\n')-- starting from a known substring within the string and of m length.findString = "pq"m = 5n = match(findString,baseString)subString = baseString[n..n+m-1]puts(1, subString )puts(1,'\n')
lmnoplmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyopqrspqrst
[<EntryPoint>]letmainargs=lets="一二三四五六七八九十"letn,m=3,2letc='六'letz="六七八"printfn"%s"(s.Substring(n,m))printfn"%s"(s.Substring(n))printfn"%s"(s.Substring(0,s.Length-1))printfn"%s"(s.Substring(s.IndexOf(c),m))printfn"%s"(s.Substring(s.IndexOf(z),m))0
四五四五六七八九十一二三四五六七八九六七六七
USING:mathsequenceskernel;! starting from n characters in and of m length:subseq*(fromlengthseq--newseq)[over+]dipsubseq;! starting from n characters in, up to the end of the string:dummy(seqn--tailseq)tail;! whole string minus last character:dummy1(seq--headseq)but-last;USING:frysequenceskernel;! helper word:subseq-from-*(subseqlenseqquot--seq)[nip]prepose2keepsubseq*;inline! starting from a known character within the string and of m length;:subseq-from-char(charlenseq--seq)[index]subseq-from-*;! starting from a known substring within the string and of m length.:subseq-from-seq(subseqlenseq--seq)[start]subseq-from-*;
VBA/Python programmer's approach not sure if it's the most falconic way
/* created by Aykayayciti Earl Lamont MontgomeryApril 9th, 2018 */s = "FalconPL is not just a multi-paradign language but also fun"n = 12m = 5> "starting from n characters in and of m length: ", s[n:n+m]> "starting from n characters in, up to the end of the string: ", s[n:]> "whole string minus last character: ", s[0:len(s)-1]new_n = s.find("j", 0)> "starting from a known character within the string and of m length: ", s[new_n:new_n+m]new_n = s.find("mu", 0)> "starting from a known character within the string and of m length: ", s[new_n:new_n+m]
starting from n characters in and of m length: not jstarting from n characters in, up to the end of the string: not just a multi-paradign language but also funwhole string minus last character: FalconPL is not just a multi-paradign language but also fustarting from a known character within the string and of m length: just starting from a known character within the string and of m length: multi[Finished in 2.3s]
/STRING and SEARCH are standard words.SCAN is widely implemented. Substrings represented by address/length pairs require neither mutation nor allocation.
2constantPos3constantLen:Str( -- c-addr u )s"abcdefgh";StrPos/stringdropLentype\ cdeStrPos/stringtype\ cdefghStr1-type\ abcdefgStrchardscandropLentype\ defStrs"de"search2dropLentype\ def
programtest_substringcharacter(*),parameter::string='The quick brown fox jumps over the lazy dog.'character(*),parameter::substring='brown'character,parameter::c='q'integer,parameter::n=5integer,parameter::m=15integer::i! Display the substring starting from n characters in and of length m.write(*,'(a)')string(n:n+m-1)! Display the substring starting from n characters in, up to the end of the string.write(*,'(a)')string(n:)! Display the whole string minus the last character.i=len(string)-1write(*,'(a)')string(:i)! Display the substring starting from a known character and of length m.i=index(string,c)write(*,'(a)')string(i:i+m-1)! Display the substring starting from a known substring and of length m.i=index(string,substring)write(*,'(a)')string(i:i+m-1)end programtest_substring
quick brown foxquick brown fox jumps over the lazy dog.The quick brown fox jumps over the lazy dogquick brown foxbrown fox jumps
Note that in Fortran positions inside character strings are one-based, i. e. the first character is in position one.
s[n..n+m]s[n..high(nativeUInt)]s[1..length(s)-1]s[pos(c,s)..pos(c,s)+m]s[pos(p,s)..pos(p,s)+m]
Although Frink runs on a Java Virtual Machine (JVM), its string operations likesubstr
orindexOf
do not have the broken behavior of Java on high Unicode characters. These return correct values for all Unicode codepoints.
String indices are zero-based.
test = "🐱abcdefg😾"n = 3m = 2println[substrLen[test, n, m]]println[right[test, -m]]println[left[test, -1]]pos = indexOf["c"]if pos != -1 println[substrLen[test, pos, m]]pos = indexOf[test, "cd"]if pos != -1 println[substrLen[test, pos, m]]
cdbcdefg😾🐱abcdefgcdcd
include "NSLog.incl"void local fn DoIt CFStringRef string = @"abcdefghijklmnopqrstuvwxyz" NSLog(@"%@",mid(string,3,6)) NSLog(@"%@",fn StringSubstringFromIndex( string, 10 )) NSLog(@"%@",left(string,len(string)-1)) CFRange range = fn StringRangeOfString( string, @"r" ) NSLog(@"%@",mid(string,range.location,6)) range = fn StringRangeOfString( string, @"pqr" ) NSLog(@"%@",mid(string,range.location,7))end fnfn DoItHandleEvents
defghiklmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyrstuvwpqrstuv
LETTERS;# "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz"LETTERS{[5..10]};# "EFGHIJ"
The task originally had no mention of unicode. This solution works with ASCII data.
packagemainimport("fmt""strings")funcmain(){s:="ABCDEFGH"n,m:=2,3// for referencefmt.Println("Index: ","01234567")fmt.Println("String:",s)// starting from n characters in and of m lengthfmt.Printf("Start %d, length %d: %s\n",n,m,s[n:n+m])// starting from n characters in, up to the end of the stringfmt.Printf("Start %d, to end: %s\n",n,s[n:])// whole string minus last characterfmt.Printf("All but last: %s\n",s[:len(s)-1])// starting from a known character within the string and of m lengthdx:=strings.IndexByte(s,'D')fmt.Printf("Start 'D', length %d: %s\n",m,s[dx:dx+m])// starting from a known substring within the string and of m lengthsx:=strings.Index(s,"DE")fmt.Printf(`Start "DE", length %d: %s`+"\n",m,s[sx:sx+m])}
Index: 01234567String: ABCDEFGHStart 2, length 3: CDEStart 2, to end: CDEFGHAll but last: ABCDEFGStart 'D', length 3: DEFStart "DE", length 3: DEF
Strings are generally handled as UTF-8 in Go.
packagemainimport("fmt""strings")funcmain(){s:="αβγδεζηθ"r:=[]rune(s)n,m:=2,3kc:='δ'// known characterks:="δε"// known string// for referencefmt.Println("Index: ","01234567")fmt.Println("String:",s)// starting from n characters in and of m lengthfmt.Printf("Start %d, length %d: %s\n",n,m,string(r[n:n+m]))// starting from n characters in, up to the end of the stringfmt.Printf("Start %d, to end: %s\n",n,string(r[n:]))// whole string minus last characterfmt.Printf("All but last: %s\n",string(r[:len(r)-1]))// starting from a known character within the string and of m lengthdx:=strings.IndexRune(s,kc)fmt.Printf("Start %q, length %d: %s\n",kc,m,string([]rune(s[dx:])[:m]))// starting from a known substring within the string and of m lengthsx:=strings.Index(s,ks)fmt.Printf("Start %q, length %d: %s\n",ks,m,string([]rune(s[sx:])[:m]))}
Index: 01234567String: αβγδεζηθStart 2, length 3: γδεStart 2, to end: γδεζηθAll but last: αβγδεζηStart 'δ', length 3: δεζStart "δε", length 3: δεζ
'abcdefgh':s;2:a;3:b;s a> b< putss a> putss); putss.'d'?> b< putss.'cd'/0=,> b< puts'':n
cdecdefghabcdefgdefcde
Strings in Groovy are 0-indexed.
defstr='abcdefgh'defn=2defm=3// #1printlnstr[n..n+m-1]/* or */printlnstr[n..<(n+m)]// #2printlnstr[n..-1]// #3printlnstr[0..-2]// #4defindex1=str.indexOf('d')printlnstr[index1..index1+m-1]/* or */printlnstr[index1..<(index1+m)]// #5defindex2=str.indexOf('de')printlnstr[index2..index2+m-1]/* or */printlnstr[index2..<(index2+m)]
A string in Haskell is a list of chars: [Char]
*Main> take 3 $ drop 2 "1234567890""345"*Main> drop 2 "1234567890""34567890"*Main> init "1234567890""123456789"
t45ncs|nullsub=[]|otherwise=taken.head$subwheresub=filter(isPrefixOfc)$tailss
*Main> t45 3 "4" "1234567890""456"*Main> t45 3 "45" "1234567890""456"*Main> t45 3 "31" "1234567890"""
Testing with an extended set of characters, and usingData.Text functions, includingbreakOn:
{-# LANGUAGE OverloadedStrings #-}importqualifiedData.TextasT(Text,take,drop,init,breakOn)importqualifiedData.Text.IOasO(putStrLn)fromMforN::Int->Int->T.Text->T.TextfromMforNnms=T.takem(T.dropns)fromNtoEnd::Int->T.Text->T.TextfromNtoEnd=T.dropallButLast::T.Text->T.TextallButLast=T.initfromCharForN,fromStringForN::Int->T.Text->T.Text->T.TextfromCharForNmneedlehaystack=T.takem$snd$T.breakOnneedlehaystackfromStringForN=fromCharForN-- TEST ---------------------------------------------------main::IO()main=mapM_O.putStrLn([fromMforN910,fromNtoEnd20,allButLast,fromCharForN6"话",fromStringForN6"大势"]<*>["天地不仁仁者人也🐒话说天下大势分久必合🍑合久必分🔥"])
话说天下大势分久必合合久必分🔥天地不仁仁者人也🐒话说天下大势分久必合🍑合久必分话说天下大势大势分久必合
CHARACTER :: string = 'ABCDEFGHIJK', known = 'B', substring = 'CDE'REAL, PARAMETER :: n = 5, m = 8 WRITE(Messagebox) string(n : n + m - 1), "| substring starting from n, length m"WRITE(Messagebox) string(n :), "| substring starting from n, to end of string"WRITE(Messagebox) string(1: LEN(string)-1), "| whole string minus last character"pos_known = INDEX(string, known)WRITE(Messagebox) string(pos_known : pos_known+m-1), "| substring starting from pos_known, length m"pos_substring = INDEX(string, substring)WRITE(Messagebox) string(pos_substring : pos_substring+m-1), "| substring starting from pos_substring, length m"
proceduremain(arglist)write("Usage: substring <string> <first position> <second position> <single character> <substring>")s:=\arglist[1]|"aardvarks"n:=\arglist[2]|5m:=\arglist[3]|4c:=\arglist[4]|"d"ss:=\arglist[5]|"ard"write(s[n+:m])write(s[n:0])write(s[1:-1])write(s[find(c,s)+:m])write(s[find(ss,s)+:m])end
5{.3}.'Marshmallow'shmal3}.'Marshmallow'shmallow}.'Marshmallow'arshmallow}:'Marshmallow'Marshmallo5{.(}.~i.&'m')'Marshmallow'mallo5{.(}.~I.@E.~&'sh')'Marshmallow'shmal
Note that there are other, sometimes better, ways of accomplishing this task.
'Marshmallow'{~(+i.)/35shmal
Or, probably more efficient when the desired substring is large:
(,.35)];.0'Marshmallow'shmal
Thetaketo
/takeafter
anddropto
/dropafter
utilities from thestrings
script further simplify these types of tasks.
require'strings''sh'dropto'Marshmallow'shmallow5{.'sh'dropto'Marshmallow'shmal'sh'takeafter'Marshmallow'mallow
Note also that these operations work the same way on lists of numbers that they do on this example list of characters.
3}.235711131719711131719711dropafter235711131719235711
publicstaticStringSubstring(Stringstr,intn,intm){returnstr.substring(n,n+m);}publicstaticStringSubstring(Stringstr,intn){returnstr.substring(n);}publicstaticStringSubstring(Stringstr){returnstr.substring(0,str.length()-1);}publicstaticStringSubstring(Stringstr,charc,intm){returnstr.substring(str.indexOf(c),str.indexOf(c)+m+1);}publicstaticStringSubstring(Stringstr,Stringsub,intm){returnstr.substring(str.indexOf(sub),str.indexOf(sub)+m+1);}
TheString
object has two similar methods:substr
andsubstring
.
substr(start, [len])
returns a substring beginning at a specified location and having a specified length.substring(start, [end])
returns a string containing the substring fromstart
up to,but not including,end
.varstr="abcdefgh";varn=2;varm=3;// * starting from n characters in and of m length;str.substr(n,m);// => "cde"// * starting from n characters in, up to the end of the string;str.substr(n);// => "cdefgh"str.substring(n);// => "cdefgh"// * whole string minus last character;str.substring(0,str.length-1);// => "abcdefg"// * starting from a known character within the string and of m length;str.substr(str.indexOf('b'),m);// => "bcd"// * starting from a known substring within the string and of m length.str.substr(str.indexOf('bc'),m);// => "bcd"
Or, in terms of some familiar functional primitives, translating broadly from Haskell:
(function(){'usestrict';//take::Int->Text->Textfunctiontake(n,s){returns.substr(0,n);}//drop::Int->Text->Textfunctiondrop(n,s){returns.substr(n);}//init::Text->Textfunctioninit(s){varn=s.length;return(n>0?s.substr(0,n-1):undefined);}//breakOn::Text->Text->(Text,Text)functionbreakOn(strPattern,s){vari=s.indexOf(strPattern);returni===-1?[strPattern,'']:[s.substr(0,i),s.substr(i)];}varstr='一二三四五六七八九十';returnJSON.stringify({'fromnin,ofmlength':(function(n,m){returntake(m,drop(n,str));})(4,3),'fromnin,uptoend':(function(n){returndrop(n,str);})(3),'allbutlast':init(str),'frommatchingchar,ofmlength':(function(pattern,s,n){returntake(n,breakOn(pattern,s)[1]);})('五',str,3),'frommatchingstring,ofmlength':(function(pattern,s,n){returntake(n,breakOn(pattern,s)[1]);})('六七',str,4)},null,2);})();
{"from n in, of m length":"五六七","from n in, up to end":"四五六七八九十","all but last":"一二三四五六七八九","from matching char, of m length":"五六七","from matching string, of m length":"六七八九"}
For this exercise we use the Chinese characters for 1 to 10, the character for "10" being "十":
def s: "一二三四五六七八九十";
jq strings are UTF-8 strings, and array-based string indexing andmost string functions, such as length/0, are based on Unicode codepoints. However, the function index/1 currently uses character counts when its input is a string, and therefore in the following we use ix/1 defined as follows:
def ix(s): explode | index(s|explode);
(Users who have access to the regex function match/1 can use it, as illustrated in the comments below.)
Since jq arrays and strings have an index origin of 0, "n characters in" is interpreted to require an index of (n+1).
# starting from n characters in and of m length: .[n+1: n+m+1]"s[1:2] => \( s[1:2] )", # starting from n characters in, up to the end of the string: .[n+1:]"s[9:] => \( s[9:] )",# whole string minus last character: .[0:length-1]"s|.[0:length-1] => \(s | .[0:length-1] )",# starting from a known character within the string and of m length: # jq 1.4: ix(c) as $i | .[ $i: $i + m] # jq>1.4: match(c).offset as $i | .[ $i: $i + m]"s | ix(\"五\") as $i | .[$i: $i + 1] => \(s | ix("五") as $i | .[$i: $i + 1] )",# starting from a known substring within the string and of m length: # jq 1.4: ix(sub) as $i | .[ $i: $i + m] # jq>1.4: match(sub).offset as $i | .[ $i: $i + m]"s | ix(\"五六\") as $i | .[$i: $i + 2] => " + "\( s | ix("五六") as $i | .[$i: $i + 2] )"
$jq-M-n-r-fSubstring.jqs[1:2]=>二s[9:]=>十s|.[0:length-1]=>一二三四五六七八九s|ix("五")as$i|.[$i:$i+1]=>五s|ix("五六")as$i|.[$i:$i+2]=>五六
#!/usr/local/bin/jsish -u %svarstr="abcdefgh";varn=2;varm=3;// In jsish, semi-colon first character lines are echoed with result;str;;n;;m;// * starting from n characters in and of m length;;str.substr(n,m);// * starting from n characters in, up to the end of the string;;str.substr(n);;str.substring(n);// * whole string minus last character;;str.substring(0,str.length-1);// * starting from a known character within the string and of m length;;str.substr(str.indexOf('b'),m);// * starting from a known substring within the string and of m length.;str.substr(str.indexOf('bc'),m);/* Functional */varres=(function(){'use strict';// take :: Int -> Text -> Textfunctiontake(n,s){returns.substr(0,n);}// drop :: Int -> Text -> Textfunctiondrop(n,s){returns.substr(n);}// init :: Text -> Textfunctioninit(s){varn=s.length;return(n>0?s.substr(0,n-1):undefined);}// breakOn :: Text -> Text -> (Text, Text)functionbreakOn(strPattern,s){vari=s.indexOf(strPattern);returni===-1?[strPattern,'']:[s.substr(0,i),s.substr(i)];}varstr='abcdefgh';returnJSON.stringify({'from 4 in, of 3 length':(function(n,m){returntake(m,drop(n,str));})(4,3),'from 3 in, up to end':(function(n){returndrop(n,str);})(3),'all but last':init(str),'from matching b, of length 3':(function(pattern,s,n){returntake(n,breakOn(pattern,s)[1]);})('b',str,3),'from matching bc, of length 4':(function(pattern,s,n){returntake(n,breakOn(pattern,s)[1]);})('bc',str,4)},true);})();;res;
prompt$ jsish --U substringing.jsistr ==> abcdefghn ==> 2m ==> 3str.substr(n, m) ==> cdestr.substr(n) ==> cdefghstr.substring(n) ==> cdefghstr.substring(0, str.length - 1) ==> abcdefghstr.substr(str.indexOf('b'), m) ==> bcdstr.substr(str.indexOf('bc'), m) ==> bcdres ==> { "all but last":"abcdefg", "from 3 in, up to end":"defgh", "from 4 in, of 3 length":"efg", "from matching b, of length 3":"bcd", "from matching bc, of length 4":"bcde" }prompt$ jsish -u -update true substringing.jsiCreated substringing.jsiprompt$ jsish -u substringing.jsi[PASS] substringing.jsi
The initial --U is a run with echo mode. The-u -update true puts jsish in unit test mode, and will add a comparison block. After the test pass, the code file is changed to
#!/usr/local/bin/jsish -u %svarstr="abcdefgh";varn=2;varm=3;// In jsish, semi-colon first character lines are echoed with result;str;;n;;m;// * starting from n characters in and of m length;;str.substr(n,m);// * starting from n characters in, up to the end of the string;;str.substr(n);;str.substring(n);// * whole string minus last character;;str.substring(0,str.length-1);// * starting from a known character within the string and of m length;;str.substr(str.indexOf('b'),m);// * starting from a known substring within the string and of m length.;str.substr(str.indexOf('bc'),m);/* Functional */varres=(function(){'use strict';// take :: Int -> Text -> Textfunctiontake(n,s){returns.substr(0,n);}// drop :: Int -> Text -> Textfunctiondrop(n,s){returns.substr(n);}// init :: Text -> Textfunctioninit(s){varn=s.length;return(n>0?s.substr(0,n-1):undefined);}// breakOn :: Text -> Text -> (Text, Text)functionbreakOn(strPattern,s){vari=s.indexOf(strPattern);returni===-1?[strPattern,'']:[s.substr(0,i),s.substr(i)];}varstr='abcdefgh';returnJSON.stringify({'from 4 in, of length 3':(function(n,m){returntake(m,drop(n,str));})(4,3),'from 3 in, up to end':(function(n){returndrop(n,str);})(3),'all but last':init(str),'from matching b, of length 3':(function(pattern,s,n){returntake(n,breakOn(pattern,s)[1]);})('b',str,3),'from matching bc, of length 4':(function(pattern,s,n){returntake(n,breakOn(pattern,s)[1]);})('bc',str,4)},true);})();;res;/*=!EXPECTSTART!=str ==> abcdefghn ==> 2m ==> 3str.substr(n, m) ==> cdestr.substr(n) ==> cdefghstr.substring(n) ==> cdefghstr.substring(0, str.length - 1) ==> abcdefghstr.substr(str.indexOf('b'), m) ==> bcdstr.substr(str.indexOf('bc'), m) ==> bcdres ==> { "all but last":"abcdefg", "from 3 in, up to end":"defgh", "from 4 in, of length 3":"efg", "from matching b, of length 3":"bcd", "from matching bc, of length 4":"bcde" }=!EXPECTEND!=*/
By default, the type of the string is infered from its elements. In the example below, the string s is an ASCII string. In order to interpret the string as an UTF8 string with logical access to its argument, one should useCharString("/\ʕ•ᴥ•ʔ/\"...)
. Without the CharString declaration, the string is interpreted as an UTF8 string with access through its byte representation.
julia>s="abcdefg""abcdefg"julia>n=33julia>s[n:end]"cdefg"julia>m=22julia>s[n:n+m]"cde"julia>s[1:end-1]"abcdef"julia>s[search(s,'c')]'c'julia>s[search(s,'c'):search(s,'c')+m]"cde"
Strings in Kotlin are 0-indexed:
// version 1.0.6funmain(args:Array<String>){vals="0123456789"valn=3valm=4valc='5'valz="12"vari:Intprintln(s.substring(n,n+m))println(s.substring(n))println(s.dropLast(1))i=s.indexOf(c)println(s.substring(i,i+m))i=s.indexOf(z)println(s.substring(i,i+m))}
3456345678901234567856781234
#!/bin/ksh# Display a substring:#- starting from n characters in and of m length;#- starting from n characters in, up to the end of the string;#- whole string minus the last character;#- starting from a known character within the string and of m length;#- starting from a known substring within the string and of m length.## Variables:#str='solve this task according to the task description,'integern=6m=14ch='v'substr='acc'## Functions:### Function _length(str, start, length) - return substr from start,## length chars long (length=-1 = end-of-str)#function_length{typeset_str;_str="$1"typeset_st;integer_st=$2typeset_ln;integer_ln=$3((_ln==-1))&&echo"${_str:${_st}}"echo"${_str:${_st}:${_ln}}"}####### main #######print--"--String (Length:${#str} chars):"print"${str}\n"print--"--From char${n} and${m} chars in length:"_length"${str}"${n}${m}echoprint--"--From char${n} to the end:"_length"${str}"${n}-1print--"--Last character removed:"# Strings in ksh are zero based_length"${str}"0$((${#str}-1))echoprint--"-From char:'${ch}' and${m} chars in length:"foo=${str%${ch}*}_length"${str}"${#foo}${m}echoprint--"-From substr:'${substr}' and${m} chars in length:"foo=${str%${substr}*}_length"${str}"${#foo}${m}echo
--String (Length: 50 chars):solve this task according to the task description,
--From char 6 and 14 chars in length:this task acco
--From char 6 to the end:this task according to the task description,
--Last character removed:solve this task according to the task description
-From char:'v' and 14 chars in length:ve this task a
-From substr:'acc' and 14 chars in length:according to t
To enhance readability, this task was split into two separate GUI's. In the second, note that "Known Substring" can be a single character.
1:
2:
{S.slice12hellobravenewworld}->bravenew{W.slice411www.rosetta.org}->rosetta
$txt = The Lang programming language!$n = 9$m = 11$c = p$searchTxt = progfn.println(fn.substring($txt, $n, parser.op($n + $m)))# Output: programmingfn.println(fn.substring($txt, $n))# Output: programming language!fn.println(fn.substring($txt, 0, parser.op(fn.len($txt) - 1)))# Output: The Lang programming languagefn.println(fn.substring($txt, fn.indexOf($txt, $c), parser.op(fn.indexOf($txt, $c) + $m)))# Output: programmingfn.println(fn.substring($txt, fn.indexOf($txt, $searchTxt), parser.op(fn.indexOf($txt, $searchTxt) + $m)))# Output: programming
: cr "\n". ; [] '__A set : dip swap __A swap 1 compress append '__A set execute __A -1 extract nip ; : nip swap drop ; : tuck swap over ; : -rot rot rot ; : 0= 0 == ; : 1+ 1 + ;: 2dip swap 'dip dip ; : 2drop drop drop ; : |a,b> over - iota + ; : bi* 'dip dip execute ; : bi@ dup bi* ;: comb "" split ; : concat "" join ; : empty? length 0= ; : tail over lensize |a,b> subscript ;: lensize length nip ; : while do 'dup dip 'execute 2dip rot if dup 2dip else break then loop 2drop ;: <substr> comb -rot over + |a,b> subscript concat ;: str-tail tail concat ;: str-index : 2streq 2dup over lensize iota subscript eq '* reduce ; swap 'comb bi@ length -rot 0 -rot "2dup 'lensize bi@ <=" "2streq if 0 reshape else '1+ 2dip 0 extract drop then" while empty? if 2drop tuck == if drop -1 then else 4 ndrop -1 then ;'abcdefgh 'str set 2 'n set 3 'm setn m str <substr>str comb n str-tailstr "d" str-index m str <substr>str "de" str-index m str <substr>
local(str='The quick grey rhino jumped over the lazy green fox.')//starting from n characters in and of m length;#str->substring(16,5)//rhino//starting from n characters in, up to the end of the string#str->substring(16)//rhino jumped over the lazy green fox.//whole string minus last character#str->substring(1,#str->size-1)//The quick grey rhino jumped over the lazy green fox//starting from a known character within the string and of m length;#str->substring(#str->find('g'),10)//grey rhino//starting from a known substring within the string and of m length#str->substring(#str->find('rhino'),12)//rhino jumped
From the LFE REPL:
>(setn3)3>(setm5)5>(string:sub_string"abcdefghijklm"n)"cdefghijklm">(string:sub_string"abcdefghijklm"n(+nm-1))"cdefg">(string:sub_string"abcdefghijklm"1(-(length"abcdefghijklm")1))"abcdefghijkl">(setchar-index(string:chr"abcdefghijklm"#\e))5>(string:sub_string"abcdefghijklm"char-index(+char-indexm-1))"efghi">(setstart-str(string:str"abcdefghijklm""efg"))5>(string:sub_string"abcdefghijklm"start-str(+start-strm-1))"efghi"
str = "The quick brown fox jumps over the lazy dog"-- starting from n characters in and of m lengthn = 5m = 11put str.char[n..n+m-1]-- "quick brown"-- starting from n characters in, up to the end of the stringn = 11put str.char[n..str.length]-- "brown fox jumps over the lazy dog"-- whole string minus last characterput str.char[1..str.length-1]-- "The quick brown fox jumps over the lazy do"-- starting from a known character within the string and of m lengthc = "x"m = 7pos = offset(c, str)put str.char[pos..pos+m-1]-- "x jumps"-- starting from a known substring within the string and of m lengthsub = "fox"m = 9pos = offset(sub, str)put str.char[pos..pos+m-1]-- "fox jumps"
put "pple" into xanswer char 2 to char 5 of x // n = 2, m=5answer char 2 to len(x) of x // n = 2, m = len(x), can also use -1answer char 1 to -2 of x // n = 1, m = 1 less than length of stringanswer char offset("p",x) to -1 of x // known char "p" to end of stringanswer char offset("pl",x) to -1 of x // known "pl" to end of string
n.b. Offset also supports a third parameter "charsToSkip" allowing you to loop through subsequent matches of the substring.
The following are defined to behave similarly to the built-in index operator ITEM. As with most Logo list operators, these are designed to work for both words (strings) and lists.
to items :n :thing if :n >= count :thing [output :thing] output items :n butlast :thingendto butitems :n :thing if or :n <= 0 empty? :thing [output :thing] output butitems :n-1 butfirst :thingendto middle :n :m :thing output items :m-(:n-1) butitems :n-1 :thingendto lastitems :n :thing if :n >= count :thing [output :thing] output lastitems :n butfirst :thingendto starts.with :sub :thing if empty? :sub [output "true] if empty? :thing [output "false] if not equal? first :sub first :thing [output "false] output starts.with butfirst :sub butfirst :thingendto members :sub :thing output cascade [starts.with :sub ?] [bf ?] :thingend; note: Logo indices start at onemake "s "abcdefghprint items 3 butitems 2 :s ; cdeprint middle 3 5 :s ; cdeprint butitems 2 :s ; cdefghprint butlast :s ; abcdefgprint items 3 member "d :s ; defprint items 3 members "de :s ; def
Using atoms for representing strings and usng the same sample data as e.g. in the Java solution:
:-object(substring). :-public(test/5). test(String,N,M,Character,Substring):-sub_atom(String,N,M,_,Substring1),write(Substring1),nl,sub_atom(String,N,_,0,Substring2),write(Substring2),nl,sub_atom(String,0,_,1,Substring3),write(Substring3),nl,% there can be multiple occurences of the characteronce(sub_atom(String,Before4,1,_,Character)),sub_atom(String,Before4,M,_,Substring4),write(Substring4),nl,% there can be multiple occurences of the substringonce(sub_atom(String,Before5,_,_,Substring)),sub_atom(String,Before5,M,_,Substring5),write(Substring5),nl.:-end_object.
| ?- ?- substring::test('abcdefgh', 2, 3, 'b', 'bc').cdecdefghabcdefgbcdbcdyes
str="abcdefghijklmnopqrstuvwxyz"n,m=5,15print(string.sub(str,n,m))-- efghijklmnoprint(string.sub(str,n,-1))-- efghijklmnopqrstuvwxyzprint(string.sub(str,1,-2))-- abcdefghijklmnopqrstuvwxypos=string.find(str,"i")ifpos~=nilthenprint(string.sub(str,pos,pos+m))end-- ijklmnopqrstuvwxpos=string.find(str,"ijk")ifpos~=nilthenprint(string.sub(str,pos,pos+m))end-- ijklmnopqrstuvwx-- Alternative (more modern) notationprint(str:sub(n,m))-- efghijklmnoprint(str:sub(n))-- efghijklmnopqrstuvwxyzprint(str:sub(1,-2))-- abcdefghijklmnopqrstuvwxypos=str:find"i"ifposthenprint(str:sub(pos,pos+m))end-- ijklmnopqrstuvwxpos=str:find"ijk"ifposthenprint(str:sub(pos,pos+m))endd-- ijklmnopqrstuvwx
By default a sting can contain anything, and has a maximum length of 2GBytes. Literals are always UTF-16LE. Print/edit done as UTF-16LE. But we can use Str$(a_string) to convert UTF-16LE to Ansi, using Locale id. To display it we can use Chr$(a_String), to convert back to UTF-16LE. Mid$, Right$, Left$, Instr,RInstr works for Ansi using "as byte". For Utf16-le, we get next 16bit value, not exactly next char, but for many languages it is exactly next char.
Function for length always return length as Words (two bytes), so we can get half, if we have an odd number of ansi characters. For Utf16-le there is another Len function,Len.Disp which returns the needed positions for displaying characters. So Print LEN.DISP("aããz")=4 : Print Len("̃ãz")=4
Module CheckAnsi { \\ ANSI STRING Locale 1033 \\ convert UTF16-LE to ANSI 8bit s$ =Str$("ABCDEFG") Print Len(s$)=3.5 ' 3.5 words, means 7 bytes (3.5*2) AnsiLen=Len(s$)*2 ' From 4th byte get 3 bytes n=4 m=3 substring$=Mid$(s$, n, m as byte) substring2End$=Mid$(s$, n , AnsiLen as byte) substringMinusOne$=Left$(s$, AnsiLen-1 as byte) substringFromKnownCharacter$=Mid$(s$, Instr(s$, str$("B") as byte) , m as byte) substringFromKnownSubstring$=Mid$(s$, Instr(s$, str$("BC") as byte) , m as byte) Print Len(substring$)*2=m \\ convert to UTF-16LE Print Chr$(substring$)="DEF" Print Chr$(substring2End$)="DEFG" Print Chr$(substringMinusOne$)="ABCDEF" Print Chr$(substringFromKnownCharacter$)="BCD" Print Chr$(substringFromKnownSubstring$)="BCD"}CheckAnsiModule CheckUTF16LE { s$ ="ABCDEFG" Print Len(s$)=7 Utf16Len=Len(s$) ' From 4th byte get 3 bytes n=4 m=3 substring$=Mid$(s$, n, m) substring2End$=Mid$(s$, n , Utf16Len) substringMinusOne$=Left$(s$, Utf16Len-1) substringFromKnownCharacter$=Mid$(s$, Instr(s$, "B") , m) substringFromKnownSubstring$=Mid$(s$, Instr(s$, "BC") , m) Print Len(substring$)=m \\ convert to UTF-16LE Print substring$="DEF" Print substring2End$="DEFG" Print substringMinusOne$="ABCDEF" Print substringFromKnownCharacter$="BCD" Print substringFromKnownSubstring$="BCD"}CheckUTF16LE
> n, m := 3, 5:> s := "The Higher, The Fewer!":> s[ n .. n + m - 1 ]; "e Hig"
There are a few ways to get everything from the n-th character on.
> s[ n .. -1 ] = s[ n .. ]; "e Higher, The Fewer!" = "e Higher, The Fewer!"> StringTools:-Drop( s, n - 1 ); "e Higher, The Fewer!"
There are a few ways to get all but the last character.
> s[ 1 .. -2 ] = s[ .. -2 ];"The Higher, The Fewer" = "The Higher, The Fewer"> StringTools:-Chop( s ); "The Higher, The Fewer"
Thesearchtext
command returns the position of a matching substring.
> pos := searchtext( ",", s ):> s[ pos .. pos + m - 1 ]; ", The"> pos := searchtext( "Higher", s ):> s[ pos .. pos + m - 1 ]; "Highe"
But, note thatsearchtext
returns 0 when there is no match, and 0 is not a valid index into a string.
TheStringTake
andStringDrop
are relevant for this exercise.
n=2m=3StringTake["Mathematica",{n+1,n+m-1}]StringDrop["Mathematica",n](* StringPosition returns a list of starting and ending character positions for a substring *)pos=StringPosition["Mathematica","e"][[1]][[1]]StringTake["Mathematica",{pos,pos+m-1}](* Similar to above *)pos=StringPosition["Mathematica","the"][[1]]StringTake["Mathematica",{pos,pos+m-1}]
Unicode, UTF-8, UTF-16 is only partially supported. In some cases, a conversion of unicode2native() or native2unicode() is necessary.
% starting from n characters in and of m length;s(n+(1:m))s(n+1:n+m)% starting from n characters in, up to the end of the string;s(n+1:end)% whole string minus last character;s(1:end-1)% starting from a known character within the string and of m length;s(find(s==c,1)+[0:m-1])% starting from a known substring within the string and of m length.s(strfind(s,pattern)+[0:m-1])
s:"the quick brown fox jumps over the lazy dog";substring(s,17);/* "fox jumps over the lazy dog" */substring(s,17,20);/* "fox" */
MUMPS has the first position in a string numbered as 1.
SUBSTR(S,N,M,C,K) ;show substring operations ;S is the string ;N is a position within the string (that is, n<length(string)) ;M is an integer of positions to show ;C is a character within the string S ;K is a substring within the string S ;$Find returns the position after the substring NEW X WRITE !,"The base string is:",!,?5,"'",S,"'" WRITE !,"From position ",N," for ",M," characters:" WRITE !,?5,$EXTRACT(S,N,N+M-1) WRITE !,"From position ",N," to the end of the string:" WRITE !,?5,$EXTRACT(S,N,$LENGTH(S)) WRITE !,"Whole string minus last character:" WRITE !,?5,$EXTRACT(S,1,$LENGTH(S)-1) WRITE !,"Starting from character '",C,"' for ",M," characters:" SET X=$FIND(S,C)-$LENGTH(C) WRITE !,?5,$EXTRACT(S,X,X+M-1) WRITE !,"Starting from string '",K,"' for ",M," characters:" SET X=$FIND(S,K)-$LENGTH(K) W !,?5,$EXTRACT(S,X,X+M-1) QUIT
Usage:
USER>D SUBSTR^ROSETTA("ABCD1234efgh",3,4,"D","23") The base string is: 'ABCD1234efgh'From position 3 for 4 characters: CD12From position 3 to the end of the string: CD1234efghWhole string minus last character: ABCD1234efgStarting from character 'D' for 4 characters: D123Starting from string '23' for 4 characters: 234e
str = "test string"println substr(str, m, m + n)println substr(str, n, len(str))println substr(str, 0, len(str) - 1)println substr(str, str.indexOf("s"), str.indexOf("s") + m)println substr(str, str.indexOf("str"), str.indexOf("str") + m)
usingSystem;usingSystem.Console;moduleSubstrings{Main():void{strings="0123456789";defn=3;defm=2;defc='3';defz="345";WriteLine(s.Substring(n,m));WriteLine(s.Substring(n,s.Length-n));WriteLine(s.Substring(0,s.Length-1));WriteLine(s.Substring(s.IndexOf(c,0,s.Length),m));WriteLine(s.Substring(s.IndexOf(z,0,s.Length),m));}}
/* NetRexx */optionsreplaceformatcommentsjavacrossrefsavelogsymbolss='abcdefghijk'n=4m=3sayssays.substr(n,m)says.substr(n)says.substr(1,s.length-1)says.substr(s.pos('def'),m)says.substr(s.pos('g'),m)return
abcdefghijkdefdefghijkabcdefghijdefghi
>(set'str"alphabet"'n2'm4)4>; starting from n characters in and of m length>(slicestrnm)"phab">; starting from n characters in, up to the end of the string>(slicestrn)"phabet">; whole string minus last character>(chopstr)"alphabe">; starting from a known character within the string and of m length>(slicestr(find"l"str)m)"lpha">; starting from a known substring within the string and of m length>(slicestr(find"ph"str)m)"phab"
Nim allows to work with raw strings, ignoring the encoding, or with UTF-8 strings. The following program shows how to extract substrings in both cases.
importstrformat,strutils,unicodelets1="abcdefgh"# ASCII string.s2="àbĉdéfgĥ"# UTF-8 string.n=2m=3c='d'cs1="de"cs2="dé"varpos:int# ASCII strings.# We can take a substring using "s.substr(first, last)" or "s[first..last]".# The latter form can also be used as value to assign a substring.echo"ASCII string: ",s1echo&"Starting from n = {n} characters in and of m = {m} length: ",s1[(n-1)..(n+m-2)]echo&"Starting from n = {n} characters in, up to the end of the string: ",s1[(n-1)..^1]echo"Whole string minus the last character: ",s1[0..^2]pos=s1.find(c)ifpos>0:echo&"Starting from character '{c}' within the string and of m = {m} length: ",s1[pos..<(pos+m)]else:echo&"Character '{c}' not found."pos=s1.find(cs1)ifpos>0:echo&"Starting from substring “{cs1}” within the string and of m = {m} length: ",s1[pos..<(pos+m)]else:echo&"String “{cs1}” not found."# UTF-8 strings.procfindUtf8(s:string;c:char):int=## Return the codepoint index of the first occurrence of a given character in a string.## Return - 1 if not found.s.toRunes.find(Rune(c))procfindUtf8(s1,s2:string):int=## Return the codepoint index of the first occurrence of a given string in a string.## Return - 1 if not found.lets1=s1.toRuneslets2=s2.toRunesforiin0..(s1.len-s2.len):ifs1[i..(i+s2.len-1)]==s2:returniresult=-1echo()echo"UTF-8 string: ",s2echo&"Starting from n = {n} characters in and of m = {m} length: ",s2.runeSubStr(n-1,m)echo&"Starting from n = {n} characters in, up to the end of the string: ",s2.runeSubstr(n-1)echo"Whole string minus the last character: ",s2.runeSubStr(0,s2.runeLen-1)pos=s2.findUtf8(c)ifpos>0:echo&"Starting from character '{c}' within the string and of m = {m} length: ",s2.runeSubStr(pos,m)else:echo&"String “{cs1}” not found."pos=s2.findUtf8(cs2)ifpos>0:echo&"Starting from substring “{cs2}” within the string and of m = {m} length: ",s2.runeSubStr(pos,m)else:echo&"String “{cs2}” not found."
ASCII string: abcdefghStarting from n = 2 characters in and of m = 3 length: bcdStarting from n = 2 characters in, up to the end of the string: bcdefghWhole string minus the last character: abcdefgStarting from character 'd' within the string and of m = 3 length: defStarting from substring “de” within the string and of m = 3 length: defUTF-8 string: àbĉdéfgĥStarting from n = 2 characters in and of m = 3 length: bĉdStarting from n = 2 characters in, up to the end of the string: bĉdéfgĥWhole string minus the last character: àbĉdéfgStarting from character 'd' within the string and of m = 3 length: défStarting from substring “dé” within the string and of m = 3 length: déf
( based on the JavaScript code )'abcdefgh 's ;s str-len 'len ;2 'n ;3 'm ;( starting from n characters in and of m length )s n n m + substring . ( => cde ) newline( starting from n characters in, up to the end of the string )s n len substring . ( => cdefgh ) newline( whole string minus last character )s 0 len 1 - substring . ( => abcdefg ) newline( starting from a known character within the string and of m length )s s 'b str-find dup m + substring . ( => bcd ) newline( starting from a known substring within the string and of m length )s s 'bc str-find dup m + substring . ( => bcd ) newline
bundle Default { class SubString { function : Main(args : String[]) ~ Nil { s := "0123456789"; n := 3; m := 4; c := '2'; sub := "456"; s->SubString(n, m)->PrintLine(); s->SubString(n)->PrintLine(); s->SubString(0, s->Size())->PrintLine(); s->SubString(s->Find(c), m)->PrintLine(); s->SubString(s->Find(sub), m)->PrintLine(); } }}
From the interactive toplevel:
$ocaml#lets="ABCDEFGH";;vals:string="ABCDEFGH"#letn,m=2,3;;valn:int=2valm:int=3#String.subsnm;;-:string="CDE"#String.subsn(String.lengths-n);;-:string="CDEFGH"#String.subs0(String.lengths-1);;-:string="ABCDEFG"#String.subs(String.indexs'D')m;;-:string="DEF"##load"str.cma";;#letn=Str.search_forward(Str.regexp_string"DE")s0inString.subsnm;;-:string="DEF"
: substrings(s, n, m) s sub(n, m) println s right(s size n - 1 +) println s left(s size 1 - ) println s sub(s indexOf('d'), m) println s sub(s indexOfAll("de"), m) println ;
"abcdefgh" 2 3 substringsbcdbcdefghabcdefgdefdef
macro utf8-char is (["%16r{00}" to "%16r{7F}"] | ["%16r{C0}" to "%16r{DF}"] ["%16r{80}" to "%16r{BF}"] | ["%16r{E0}" to "%16r{EF}"] ["%16r{80}" to "%16r{BF}"] {2} | ["%16r{F0}" to "%16r{F7}"] ["%16r{80}" to "%16r{BF}"] {3}) macro-endprocess local stream s initial {'This 𝓲𝓼 the 𝓞𝓶𝓷𝓲𝓜𝓪𝓻𝓴 solution.'} using group StartingFrom_n_CharactersInAndOf_m_Length submit s using group StartingFrom_n_charactersInUpToTheEndOfTheString submit s using group WholeStringMinusTheLastCharacter submit s using group StartingFromKnownCharacterAndOf_m_Length submit s using group StartingFromKnownSubstringAndOf_m_Length submit sgroup StartingFrom_n_CharactersInAndOf_m_Length find value-start utf8-char{12} utf8-char{8} => p output p || '%n' ; outputs characters 13 to 20: 𝓞𝓶𝓷𝓲𝓜𝓪𝓻𝓴group StartingFrom_n_charactersInUpToTheEndOfTheString find value-start utf8-char{12} utf8-char+ => p output p || '%n' ; outputs characters 13 to last: 𝓞𝓶𝓷𝓲𝓜𝓪𝓻𝓴 solution.group WholeStringMinusTheLastCharacter find value-start ((lookahead not (utf8-char value-end)) utf8-char)+ => p output p || '%n' ; outputs characters 1 to (last - 1), so without the .group StartingFromKnownCharacterAndOf_m_Length find 'T' utf8-char{3} => p output p || '%n' ; outputs his following Tgroup StartingFromKnownSubstringAndOf_m_Length find '𝓞𝓶𝓷𝓲𝓜' utf8-char{3} => p output p || '%n' ; outputs 𝓪𝓻𝓴 following 𝓞𝓶𝓷𝓲𝓜group #implied find utf8-char ; ensures no other characters go to the output
𝓞𝓶𝓷𝓲𝓜𝓪𝓻𝓴𝓞𝓶𝓷𝓲𝓜𝓪𝓻𝓴 solution.This 𝓲𝓼 the 𝓞𝓶𝓷𝓲𝓜𝓪𝓻𝓴 solutionhis𝓪𝓻𝓴
declare fun {DropUntil Xs Prefix} case Xs of nil then nil [] _|Xr then if {List.isPrefix Prefix Xs} then Xs else {DropUntil Xr Prefix} end end end Digits = "1234567890"in {ForAll [{List.take {List.drop Digits 2} 3} = "345" {List.drop Digits 2} = "34567890" {List.take Digits {Length Digits}-1} = "123456789" {List.take {DropUntil Digits "4"} 3} = "456" {List.take {DropUntil Digits "56"} 3} = "567" {List.take {DropUntil Digits "31"} 3} = "" ] System.showInfo}
\\ Returns the substring of string str specified by the start position s and length n.\\ If n=0 then to the end of str.\\ ssubstr() 3/5/16 aevssubstr(str,s=1,n=0)={my(vt=Vecsmall(str),ve,vr,vtn=#str,n1);if(vtn==0,return(""));if(s<1||s>vtn,return(str));n1=vtn-s+1; if(n==0,n=n1); if(n>n1,n=n1);ve=vector(n,z,z-1+s); vr=vecextract(vt,ve); return(Strchr(vr));}{\\ TESTmy(s="ABCDEFG",ns=#s);print(" *** Testing ssubstr():");print("1.",ssubstr(s,2,3));print("2.",ssubstr(s));print("3.",ssubstr(s,,ns-1));print("4.",ssubstr(s,2));print("5.",ssubstr(s,,4));print("6.",ssubstr(s,0,4));print("7.",ssubstr(s,3,7));print("8.|",ssubstr("",1,4),"|");}
*** Testing ssubstr():1.BCD2.ABCDEFG3.ABCDEF4.BCDEFG5.ABCD6.ABCDEFG7.CDEFG8.||
See alsoDelphi andFree Pascal
Remember, in Extended Pascal (ISO standard 10206),string(…) variables’ indices are1‑based.Pay attention to the constraints below.
programsubstring(output);varsample:string(20)value'Foobar';n,m:integervalue1;begin{ starting from n characters in and of m length - - - - - - - - - - - - - - - }writeLn(subStr(sample,n,m));writeLn(subStr(sample,n):m);writeLn(sample[n..n+m-1]);{ starting from n characters in, up to the end of the string - - - - - - - - }writeLn(subStr(sample,n));writeLn(sample[n..length(sample)]);{ whole string minus the last character - - - - - - - - - - - - - - - - - - - }writeLn(subStr(sample,1,length(sample)-1));writeLn(sample[1..pred(length(sample))]);writeLn(sample:length(sample)-1);{ To make this a permanent change you can use writeStr(sample, sample:pred(length(sample)); }{ starting from a known character within the string and of m length - - - - - }writeLn(subStr(sample,index(sample,'b'),m));writeLn(subStr(sample,index(sample,'b')):m);writeLn(sample[index(sample,'b')..index(sample,'b')+m-1]);{ starting from a known substring within the string and of m length - - - - - }writeLn(subStr(sample,index(sample,'bar'),m));writeLn(subStr(sample,index(sample,'bar')):m);writeLn(sample[index(sample,'bar')..index(sample,'bar')+m-1]);end.
varmyBindableStringVariable:bindablestring(20);
myBindableStringVariable[firstCharacterIndex..lastCharacterIndex]
write(sample:pred(length(sample),ord(length(sample)>0)));{ all but last char }
{$zerobasedstrings}constn=3;m=2;beginvars:='0123456789';Writeln(s.Substring(n,m));Writeln(s[n:]);Writeln(s[:^1]);Writeln(s.Substring(s.IndexOf('3'),m));Writeln(s.Substring(s.IndexOf('456'),m));end.
3434567890123456783445
my$str='abcdefgh';printsubstr($str,2,3),"\n";# Returns 'cde'printsubstr($str,2),"\n";# Returns 'cdefgh'printsubstr($str,0,-1),"\n";#Returns 'abcdefg'printsubstr($str,index($str,'d'),3),"\n";# Returns 'def'printsubstr($str,index($str,'de'),3),"\n";# Returns 'def'
--(1) starting from n characters in and of m length;--(2) starting from n characters in, up to the end of the string;--(3) whole string minus last character;--(4) starting from a known character within the string and of m length;--(5) starting from a known substring within the string and of m length.constantsentence="the last thing the man said was the",n=10,m=5integerk,ll=n+m-1ifl<=length(sentence)then?sentence[n..l]-- (1)endififn<=length(sentence)then?sentence[n..-1]-- (2) or [n..$]endififlength(sentence)>0then?sentence[1..-2]-- (3) or [1..$-1]endifk=find('m',sentence)l=k+m-1ifl<=length(sentence)then?sentence[k..l]-- (4)endifk=match("aid",sentence)l=k+m-1ifl<=length(sentence)then?sentence[k..l]-- (5)endif
"thing""thing the man said was the""the last thing the man said was th""man s""aid w"
Alternative version with no error handling, for those in a hurry (same ouput):
?sentence[n..n+m-1]?sentence[n..-1]?sentence[1..-2]?(sentence[find('m',sentence)..$])[1..m]?(sentence[match("aid",sentence)..$])[1..m]
If sentence is UTF-8 or UTF-16, you should explicitly use sequence utf32 = utf8_to_utf32(string utf8) or sequence utf32 = utf16_to_utf32(sequence utf16) before any slicing or find()/match(), and string utf8 = utf32_to_utf8(sequence utf32) or sequence utf16 = utf32_to_utf16(sequence utf32) before display. Note that unicode does not normally display correctly on a standard Windows console, but is fine in a GUI or Linux console or a web browser.
include ..\Utilitys.pmt/#--(1) starting from n characters in and of m length; --(2) starting from n characters in, up to the end of the string; --(3) whole string minus last character; --(4) starting from a known character within the string and of m length; --(5) starting from a known substring within the string and of m length. #/def myslice rot len var _|long rot rot over _|long swap - 1 + min sliceenddef"the last thing the man said was the"10 var n 5 var mn m myslice ? /# (1) #/len n swap myslice ? /# (2) #/dup -1 del ? /# (3) #/'m' find m myslice ? /# (4) #/"aid" find m myslice ? /# (5) #/
<?php$str='abcdefgh';$n=2;$m=3;echosubstr($str,$n,$m),"\n";//cdeechosubstr($str,$n),"\n";//cdefghechosubstr($str,0,-1),"\n";//abcdefgechosubstr($str,strpos($str,'d'),$m),"\n";//defechosubstr($str,strpos($str,'de'),$m),"\n";//def?>
go => S = "Picat is fun", N = 3, M = 4, C = 'i', % must be a char SS = "is", test(S,N,M,C,SS).test(S,N,M,C,SS) => println($test(S,N,M,C,SS)), % - starting from n characters in and of m length; println(1=slice(S,N,N+M)), println(1=S[N..N+M]), % - starting from n characters in, up to the end of the string; println(2=S.slice(N)), % - whole string minus last character; println(3=but_last(S)), println(3=S[1..S.len-1]), % - starting from a known character within the string and of m length; println(4=substring4(S,C)), % - starting from a known substring within the string and of m length. println(5=substring5(S,SS,M)), nl. but_last(S) = slice(S,1,S.length-1).substring4(S,C) = slice(S,S.find_first_of(C)).% find is non-deterministic, hence the once/1substring5(S,SS,M) = slice(S,Start,Start+M) => once(find(S,SS,Start,_End)).
test(Picat is fun,3,4,i,is)1 = cat i1 = cat i2 = cat is fun3 = Picat is fu3 = Picat is fu4 = icat is fun5 = is fu
(let Str (chop "This is a string") (prinl (head 4 (nth Str 6))) # From 6 of 4 length (prinl (nth Str 6)) # From 6 up to the end (prinl (head -1 Str)) # Minus last character (prinl (head 8 (member "s" Str))) # From character "s" of length 8 (prinl # From "isa" of length 8 (head 8 (seek '((S) (pre? "is a" S)) Str) ) ) )
is ais a stringThis is a strins is a sis a str
s='abcdefghijk';n=4; m=3;u=substr(s,n,m);u=substr(s,n);u=substr(s,1,length(s)-1);u=left(s,length(s)-1);u=substr(s,1,length(s)-1);u=substr(s,index(s,'g'),m);
Since .NET and PowerShell use zero-based indexing, all character indexes have to be reduced by one.
# test string$s="abcdefgh"# test parameters$n,$m,$c,$s2=2,3,[char]'d',$s2='cd'# starting from n characters in and of m length# n = 2, m = 3$s.Substring($n-1,$m)# returns 'bcd'# starting from n characters in, up to the end of the string# n = 2$s.Substring($n-1)# returns 'bcdefgh'# whole string minus last character$s.Substring(0,$s.Length-1)# returns 'abcdefg'# starting from a known character within the string and of m length# c = 'd', m =3$s.Substring($s.IndexOf($c),$m)# returns 'def'# starting from a known substring within the string and of m length# s2 = 'cd', m = 3$s.Substring($s.IndexOf($s2),$m)# returns 'cde'
substring_task(Str,N,M,Char,SubStr):-sub_string(Str,N,M,_,Span),sub_string(Str,N,_,0,ToEnd),sub_string(Str,0,_,1,MinusLast),string_from_substring_to_m(Str,Char,M,FromCharToMth),string_from_substring_to_m(Str,SubStr,M,FromSubToM),maplist(writeln,['from n to m ':Span,'from n to end ':ToEnd,'string minus last char ':MinusLast,'form known char to m ':FromCharToMth,'from known substring to m ':FromSubToM]).string_from_substring_to_m(String,Sub,M,FromSubToM):-sub_string(String,Before,_,_,Sub),sub_string(String,Before,M,_,FromSubToM).
Running it:
?-substring_task("abcdefghijk",2,4,"d","ef").fromntom:cdeffromntoend:cdefghijkstringminuslastchar:abcdefghijformknownchartom:defgfromknownsubstringtom:efghtrue
Python uses zero-based indexing, so the n'th character is at index n-1.
>>>s='abcdefgh'>>>n,m,char,chars=2,3,'d','cd'>>># starting from n=2 characters in and m=3 in length;>>>s[n-1:n+m-1]'bcd'>>># starting from n characters in, up to the end of the string;>>>s[n-1:]'bcdefgh'>>># whole string minus last character;>>>s[:-1]'abcdefg'>>># starting from a known character char="d" within the string and of m length;>>>indx=s.index(char)>>>s[indx:indx+m]'def'>>># starting from a known substring chars="cd" within the string and of m length.>>>indx=s.index(chars)>>>s[indx:indx+m]'cde'>>>
find$
is defined atCount occurrences of a substring#Quackery.
[ $ "abcdefgh" ] is s ( --> $ ) [ 2 ] is n ( --> n ) [ 3 ] is m ( --> n ) [ char d ] is ch ( --> c ) [ $ "cd" ] is ss ( --> $ ) s n split nip m split drop echo$ cr s n split nip echo$ cr s -1 split drop echo$ cr ch s tuck find split nip m split drop echo$ cr ss s tuck find$ split nip m split drop echo$ cr
cdecdefghabcdefgdefcde
s<-"abcdefgh"n<-2;m<-2;char<-'d';chars<-'cd'substring(s,n,n+m)substring(s,n)substring(s,1,nchar(s)-1)indx<-which(strsplit(s,'')[[1]]%in%strsplit(char,'')[[1]])substring(s,indx,indx+m)indx<-which(strsplit(s,'')[[1]]%in%strsplit(chars,'')[[1]])[1]substring(s,indx,indx+m)
#lang racket(define str "abcdefghijklmnopqrstuvwxyz")(define n 10)(define m 2)(define start-char #\x)(define start-str "xy");; starting from n characters in and of m length;(substring str n (+ n m)) ; -> "kl";; starting from n characters in, up to the end of the string;(substring str m) ; -> "klmnopqrstuvwxyz";; whole string minus last character;(substring str 0 (sub1 (string-length str))) ; -> "abcdefghijklmnopqrstuvwxy";; starting from a known character within the string and of m length;(substring str (caar (regexp-match-positions (regexp-quote (string start-char)) str))) ; -> "xyz";; starting from a known substring within the string and of m length.(substring str (caar (regexp-match-positions (regexp-quote start-str) str))) ; -> "xyz"
(formerly Perl 6)
my $str = 'abcdefgh';my $n = 2;my $m = 3;say $str.substr($n, $m);say $str.substr($n);say $str.substr(0, *-1);say $str.substr($str.index('d'), $m);say $str.substr($str.index('de'), $m);
define println use $s $s print "\n" print"0123456789" as $str$str 3 2 extract println # at 4th pos get 2 chars$str 8 4 extract println # at 9th pos get 4 chars (when only 1 char available)$str 3 $str length extract println # at 4th pos get all chars to end of str$str 3 0x7FFFFFFF extract println # at 4th pos get all chars to end of str$str 3 -1 extract println # at 4th pos get rest of chars but last one$str 0 -1 extract println # all chars but last one"3" as $matchChr # starting chr for extraction4 as $subLen # Nr chars after found starting char$str $matchChr split as $l "" $l 0 set $l $matchChr join 0 $subLen extract println"345" as $matchChrs # starting chrs for extraction6 as $subLen # Nr chars after found starting chars$str $matchChrs split as $l "" $l 0 set $l $matchChrs join 0 $subLen extract println
3489345678934567893456780123456783456345678
REBOL [Title: "Retrieve Substring"URL: http://rosettacode.org/wiki/Substring#REBOL]s: "abcdefgh" n: 2 m: 3 char: #"d" chars: "cd"; Note that REBOL uses base-1 indexing. Strings are series values,; just like blocks or lists so I can use the same words to manipulate; them. All these examples use the 'copy' function against the 's'; string with a particular offset as needed. ; For the fragment "copy/part skip s n - 1 m", read from right to; left. First you have 'm', which we ignore for now. Then evaluate; 'n - 1' (makes 1), to adjust the offset. Then 'skip' jumps from the; start of the string by that offset. 'copy' starts copying from the; new start position and the '/part' refinement limits the copy by 'm'; characters. print ["Starting from n, length m:" copy/part skip s n - 1 m]; It may be helpful to see the expression with optional parenthesis:print ["Starting from n, length m (parens):" (copy/part (skip s (n - 1)) m)]; This example is much simpler, so hopefully it's easier to see how; the string start is position for the copy:print ["Starting from n to end of string:" copy skip s n - 1]print ["Whole string minus last character:" copy/part s (length? s) - 1]print ["Starting from known character, length m:"copy/part find s char m]print ["Starting from substring, length m:"copy/part find s chars m]
Script: "Retrieve Substring" (6-Dec-2009)Starting from n, length m: bcdStarting from n, length m (parens): bcdStarting from n to end of string: bcdefghWhole string minus last character: abcdefgStarting from known character, length m: defStarting from substring, length m: cde
let s = "ABCDEFGH"let from = 2let length = 3Js.log2("Original string: ", s)Js.log(Js.String.substrAtMost(~from, ~length, s))Js.log(Js.String.substr(~from, s))Js.log(Js.String.substrAtMost(~from=0, ~length=(Js.String2.length(s) - 1), s))Js.log(Js.String.substrAtMost(~from=(Js.String.indexOf("B", s)), ~length, s))Js.log(Js.String.substrAtMost(~from=(Js.String.indexOf("BC", s)), ~length, s))
$ bsc substr.res > substr.js$ node substr.jsOriginal string: ABCDEFGHCDECDEFGHABCDEFGBCDBCD
Note: in REXX, the 1st character index of a string is 1, not 0.
/*REXX program demonstrates various ways to extract substrings from a string of characters.*/$='abcdefghijk'; n=4; m=3 /*define some constants: string, index, length of string. */say 'original string='$ /* [↑] M can be zero (which indicates a null string).*/L=length($) /*the length of the $ string (in bytes or characters).*/ say center(1,30,'═') /*show a centered title for the 1st task requirement. */u=substr($, n, m) /*start from N characters in and of M length. */say uparse var $ =(n) a +(m) /*an alternate method by using the PARSE instruction. */say a say center(2,30,'═') /*show a centered title for the 2nd task requirement. */u=substr($,n) /*start from N characters in, up to the end-of-string. */say uparse var $ =(n) a /*an alternate method by using the PARSE instruction. */say a say center(3,30,'═') /*show a centered title for the 3rd task requirement. */u=substr($, 1, L-1) /*OK: the entire string except the last character. */say uv=substr($, 1, max(0, L-1) ) /*better: this version handles the case of a null string. */say vlm=L-1parse var $ a +(lm) /*an alternate method by using the PARSE instruction. */say a say center(4,30,'═') /*show a centered title for the 4th task requirement. */u=substr($,pos('g',$), m) /*start from a known char within the string of length M. */say uparse var $ 'g' a +(m) /*an alternate method by using the PARSE instruction. */say a say center(5,30,'═') /*show a centered title for the 5th task requirement. */u=substr($,pos('def',$),m) /*start from a known substr within the string of length M.*/say uparse var $ 'def' a +(m) /*an alternate method by using the PARSE instruction. */say a /*stick a fork in it, we're all done and Bob's your uncle.*/
output when using the (internal) default strings:
original string=abcdefghijk══════════════1═══════════════defdef══════════════2═══════════════defghijkdefghijk══════════════3═══════════════abcdefghijabcdefghijabcdefghij══════════════4═══════════════ghighi══════════════5═══════════════defdef
Programming note: generally, the REXX parse statement is faster than using an assignment statementand using a BIF (built-infunction), but the use of parse is more obtuse to novice programmers.
cStr = "a":"h" # 'abcdefgh'n = 3 m = 3# starting from n characters in and of m lengthSee substr(cStr,n, m) + nl #=> cde# starting from n characters in, up to the end of the stringSee substr(cStr,n) + nl #=> cdefgh# whole string minus last characterSee substr(cstr,1,len(cStr)-1) + nl #=> abcdefg# starting from a known character within the string and of m lengthSee substr(cStr,substr(cStr,"e"),m) +nl #=> efg# starting from a known substring within the string and of m lengthSee substr(cStr,substr(cStr,"de"),m) +nl #=> def
* 1...5....1....5....2....5.. D myString S 30 inz('Liebe bewegt das Universum!') D output S 30 inz('') D n S 2 0 inz(1) D m S 2 0 inz(5) D length S 2 0 inz(0) D find S 2 0 inz(0) /free *inlr = *on; dsply %subst(myString:n:m); dsply %subst(myString:7:20); length = %len(%trim(myString)); dsply %subst(myString:1:length-1); find = %scan('U':myString); dsply %subst(myString:find:9); find = %scan('bewegt':myString); dsply %subst(myString:find:%len('bewegt')); output = ' *** end *** '; dsply ' ' ' ' output; /end-free
DSPLY Liebe DSPLY bewegt das Universum DSPLY Liebe bewegt das UniversumDSPLY Universum DSPLY bewegt
Code | Comments |
---|---|
≪ → string n m char sub ≪ string n DUP m + 1 - SUB string n OVER SIZE SUB string 1 OVER SIZE 1 - SUB string DUP char POS DUP m + 1 - SUB string DUP sub POS DUP m + 1 - SUB≫ ≫SHOWC STO | ( string start length char sub -- sub1 .. sub5 ) from n characters in and of m length from n characters in, up to the end of the string whole string minus the last character from a character within the string and of m length from a substring within the string and of m length |
The following piece of code will deliver what is required:
"abcdefgh" 2 3 "d" "cd"SHOWC
5: bcd4: bcdefgh3: abcdefg2: def1: cde
str = 'abcdefgh'n = 2m = 3puts str[n, m] #=> cdeputs str[n..m] #=> cdputs str[n..-1] #=> cdefghputs str[0..-2] #=> abcdefgputs str[str.index('d'), m] #=> defputs str[str.index('de'), m] #=> defputs str[/a.*d/] #=> abcd
let s = "abc文字化けdef";let n = 2;let m = 3; // Print 3 characters starting at index 2 (c文字)println!("{}", s.chars().skip(n).take(m).collect::<String>()); // Print all characters starting at index 2 (c文字化けdef)println!("{}", s.chars().skip(n).collect::<String>()); // Print all characters except the last (abc文字化けde)println!("{}", s.chars().rev().skip(1).collect::<String>()); // Print 3 characters starting with 'b' (bc文)let cpos = s.find('b').unwrap();println!("{}", s[cpos..].chars().take(m).collect::<String>()); // Print 3 characters starting with "けd" (けde)let spos = s.find("けd").unwrap();println!("{}", s[spos..].chars().take(m).collect::<String>());
data _null_; a="abracadabra"; b=substr(a,2,3); /* first number is position, starting at 1, second number is length */ put _all_;run;
class MAIN is main is s ::= "hello world shortest program"; #OUT + s.substring(12, 5) + "\n"; #OUT + s.substring(6) + "\n"; #OUT + s.head( s.size - 1) + "\n"; #OUT + s.substring(s.search('w'), 5) + "\n"; #OUT + s.substring(s.search("ro"), 3) + "\n"; end;end;
object Substring { // Ruler 1 2 3 4 5 6 // 012345678901234567890123456789012345678901234567890123456789012 val str = "The good life is one inspired by love and guided by knowledge." val (n, m) = (21, 16) // An one-liner to set n = 21, m = 16 // Starting from n characters in and of m length assert("inspired by love" == str.slice(n, n + m)) // Starting from n characters in, up to the end of the string assert("inspired by love and guided by knowledge." == str.drop(n)) // Whole string minus last character assert("The good life is one inspired by love and guided by knowledge" == str.init) // Starting from a known character within the string and of m length assert("life is one insp" == str.dropWhile(_ != 'l').take(m) ) // Starting from a known substring within the string and of m length assert("good life is one" == { val i = str.indexOf("good"); str.slice(i, i + m) }) // Alternatively assert("good life is one" == str.drop(str.indexOf("good")).take(m))}
(define s "Hello, world!")(define n 5)(define m (+ n 6))(display (substring s n m))(newline)(display (substring s n))(newline)(display (substring s 0 (- (string-length s) 1)))(newline)(display (substring s (string-index s #\o) m))(newline)(display (substring s (string-contains s "lo") m))(newline)
# 2 chars starting from 3rd$ echo string | sed -r 's/.{3}(.{2}).*/\1/'in# remove first 3 charsecho string | sed -r 's/^.{3}//'# delete last char$ echo string | sed -r 's/.$//'strin# `r' with two following chars$ echo string | sed -r 's/.*(r.{2}).*/\1/'rin
$ include "seed7_05.s7i";const proc: main is func local const string: stri is "abcdefgh"; const integer: N is 2; const integer: M is 3; begin writeln(stri[N len M]); writeln(stri[N ..]); writeln(stri[.. pred(length(stri))]); writeln(stri[pos(stri, 'c') len M]); writeln(stri[pos(stri, "de") len M]); end func;
bcdbcdefghabcdefgcdedef
Note: SenseTalk indexes from 1 and ranges are inclusive
set mainString to "87654321"set n to 3set m to 4set c to "5"set sub to "654"put characters n + 1 to n + m of mainStringput characters n + 1 to end of mainStringput characters first to penultimate of mainStringset characterOffset to offset of c in mainStringput characters characterOffset to characterOffset + m - 1 of mainStringset subOffset to offset of sub in mainStringput characters subOffset to subOffset + m - 1 of mainString
var str = 'abcdefgh';var n = 2;var m = 3;say str.substr(n, m); #=> cdesay str.substr(n); #=> cdefghsay str.substr(0, -1); #=> abcdefgsay str.substr(str.index('d'), m); #=> defsay str.substr(str.index('de'), m); #=> def
#s := 'hello world shortest program'.#n := 13.#m := 4.inform: (s copyFrom: n to: n + m).inform: (s copyFrom: n).inform: s allButLast.inform: (s copyFrom: (s indexOf: $w) to: (s indexOf: $w) + m).inform: (s copyFrom: (s indexOfSubSeq: 'ro') to: (s indexOfSubSeq: 'ro') + m).
The distinction between searching a single character or a string into another string is rather blurred. In the following code, instead of using'w' (a string) we could use$w (a character), but it makes no difference.
|s|s := 'hello world shortest program'.(s copyFrom: 13 to: (13+4)) displayNl."4 is the length (5) - 1, since we need the index of the last char we want, which is included" (s copyFrom: 7) displayNl.(s allButLast) displayNl.(s copyFrom: ((s indexOfRegex: 'w') first) to: ( ((s indexOfRegex: 'w') first) + 4) ) displayNl.(s copyFrom: ((s indexOfRegex: 'ro') first) to: ( ((s indexOfRegex: 'ro') first) + 2) ) displayNl.
These last two examples in particular seem rather complex, sowe can extend the string class.
String extend [ copyFrom: index length: nChar [ ^ self copyFrom: index to: ( index + nChar - 1 ) ] copyFromRegex: regEx length: nChar [ |i| i := self indexOfRegex: regEx. ^ self copyFrom: (i first) length: nChar ]]."and show it simpler..."(s copyFrom: 13 length: 5) displayNl.(s copyFromRegex: 'w' length: 5) displayNl.(s copyFromRegex: 'ro' length: 3) displayNl.
string = "abcdefghijklmnopqrstuvwxyz"n = 12m = 5known_char = "q"known_str = "pq"* starting from n characters in and of m length;string len(n - 1) len(m) . output* starting from n characters in, up to the end of the string;string len(n - 1) rem . output* whole string minus last character;string rtab(1) . output* starting from a known character within the string and of m length;string break(known_char) len(m) . output* starting from a known substring <= m within the string and of m length. string (known_str len(m - size(known_str))) . outputend
lmnop lmnopqrstuvwxyz abcdefghijklmnopqrstuvwxy qrstu pqrst
In Db2, there are different ways to find the position of a character or substring. For this reason, several examples are shown. Please take a look at the documentation for more details.
select 'the quick brown fox jumps over the lazy dog' from sysibm.sysdummy1;select substr('the quick brown fox jumps over the lazy dog', 5, 15) from sysibm.sysdummy1;select substr('the quick brown fox jumps over the lazy dog', 32) from sysibm.sysdummy1;select substr('the quick brown fox jumps over the lazy dog', 1, length ('the quick brown fox jumps over the lazy dog') - 1) from sysibm.sysdummy1;select locate('j', 'the quick brown fox jumps over the lazy dog') from sysibm.sysdummy1;select locate_in_string('the quick brown fox jumps over the lazy dog', 'j') from sysibm.sysdummy1;select posstr('the quick brown fox jumps over the lazy dog', 'j') from sysibm.sysdummy1;select position('j', 'the quick brown fox jumps over the lazy dog', OCTETS) from sysibm.sysdummy1;select substr('the quick brown fox jumps over the lazy dog', locate('j', 'the quick brown fox jumps over the lazy dog')) from sysibm.sysdummy1;select locate('fox', 'the quick brown fox jumps over the lazy dog') from sysibm.sysdummy1;select locate_in_string('the quick brown fox jumps over the lazy dog', 'fox') from sysibm.sysdummy1;select posstr('the quick brown fox jumps over the lazy dog', 'fox') from sysibm.sysdummy1;select position('fox', 'the quick brown fox jumps over the lazy dog', OCTETS) from sysibm.sysdummy1;select substr('the quick brown fox jumps over the lazy dog', locate('fox', 'the quick brown fox jumps over the lazy dog')) from sysibm.sysdummy1;
Output:
db2 => select 'the quick brown fox jumps over the lazy dog' from sysibm.sysdummy1;1 -------------------------------------------the quick brown fox jumps over the lazy dog 1 record(s) selected.db2 => select substr('the quick brown fox jumps over the lazy dog', 5, 15) from sysibm.sysdummy1;1 ---------------quick brown fox 1 record(s) selected.db2 => select substr('the quick brown fox jumps over the lazy dog', 32) from sysibm.sysdummy1;1 -------------------------------------------the lazy dog 1 record(s) selected.db2 => select substr('the quick brown fox jumps over the lazy dog', 1, length ('the quick brown fox jumps over the lazy dog') - 1) from sysibm.sysdummy1;1 -------------------------------------------the quick brown fox jumps over the lazy do 1 record(s) selected.db2 => select substr('the quick brown fox jumps over the lazy dog', locate('j', 'the quick brown fox jumps over the lazy dog')) from sysibm.sysdummy1;1 -------------------------------------------jumps over the lazy dog 1 record(s) selected.db2 => select substr('the quick brown fox jumps over the lazy dog', locate('fox', 'the quick brown fox jumps over the lazy dog')) from sysibm.sysdummy1;1 -------------------------------------------fox jumps over the lazy dog 1 record(s) selected.
s = "Ἐν ἀρχῇ ἐποίησεν ὁ θεὸς τὸν οὐρανὸν καὶ τὴν γῆν"usubstr(s, 25, 11) τὸν οὐρανὸνusubstr(s, 25, .) τὸν οὐρανὸν καὶ τὴν γῆνusubstr(s, 1, ustrlen(s)-1) Ἐν ἀρχῇ ἐποίησεν ὁ θεὸς τὸν οὐρανὸν καὶ τὴν γῆusubstr(s, -3, .) γῆν
let string = "Hello, Swift language"let (n, m) = (5, 4)// Starting from `n` characters in and of `m` length.do { let start = string.startIndex.advancedBy(n) let end = start.advancedBy(m) // Pure-Swift (standard library only): _ = string[start..<end] // With Apple's Foundation framework extensions: string.substringWithRange(start..<end)}// Starting from `n` characters in, up to the end of the string.do { // Pure-Swift (standard library only): _ = String( string.characters.suffix(string.characters.count - n) ) // With Apple's Foundation framework extensions: _ = string.substringFromIndex(string.startIndex.advancedBy(n))}// Whole string minus last character.do { // Pure-Swift (standard library only): _ = String( string.characters.prefix( string.characters.count.predecessor() ) ) // With Apple's Foundation framework extensions: _ = string.substringToIndex(string.endIndex.predecessor())}// Starting from a known character within the string and of `m` length.do { // Pure-Swift (standard library only): let character = Character("l") guard let characterIndex = string.characters.indexOf(character) else { fatalError("Index of '\(character)' character not found.") } let endIndex = characterIndex.advancedBy(m) _ = string[characterIndex..<endIndex]}// Starting from a known substring within the string and of `m` length.do { // With Apple's Foundation framework extensions: let substring = "Swift" guard let range = string.rangeOfString(substring) else { fatalError("Range of substring \(substring) not found") } let start = range.startIndex let end = start.advancedBy(m) string[start..<end]}
Tailspin doesn't really let you manipulate parts of strings. You can get a list of characters (or, really, clumps of combining characters) by
[$s...]
and manipulate parts of that list, and recombine as
'$a...;'
if you really want to. The better Tailspin way to handle parts of strings is to use a "composer" to compose meaningful structure from the string (and discard unwanted parts).
composer substr&{from:, length:} (<'.{$:$from-1;}'>) <'.{$length;}'> (<'.*'>)end substr'abcdef' -> substr&{from:3, length:2} -> !OUT::write'' -> !OUT::writecomposer substrFrom&{from:} (<'.{$:$from-1;}'>) <'.*'>end substrFrom'abcdef' -> substrFrom&{from:4} -> !OUT::write'' -> !OUT::writecomposer chopLast <'(.(?=.))*'> (<'.'>)end chopLast'abcdef' -> chopLast -> !OUT::write'' -> !OUT::writecomposer substrStarting&{prefix:, length:} (<~='$prefix;'>) <'.{$length;}'> (<'.*'>)end substrStarting'abcdef' -> substrStarting&{prefix: 'b', length: 2} -> !OUT::write'' -> !OUT::write'abcdef' -> substrStarting&{prefix: 'cd', length: 3} -> !OUT::write'' -> !OUT::write
cddefabcdebccde
set str "abcdefgh"set n 2set m 3puts [string range $str $n [expr {$n+$m-1}]]puts [string range $str $n end]puts [string range $str 0 end-1]# Because Tcl does substrings with a pair of indices, it is easier to express# the last two parts of the task as a chained pair of [string range] operations.# A maximally efficient solution would calculate the indices in full first.puts [string range [string range $str [string first "d" $str] end] [expr {$m-1}]]puts [string range [string range $str [string first "de" $str] end] [expr {$m-1}]]# From Tcl 8.5 onwards, these can be contracted somewhat.puts [string range [string range $str [string first "d" $str] end] $m-1]puts [string range [string range $str [string first "de" $str] end] $m-1]
Of course, if you were doing 'position-plus-length' a lot, it would be easier to add another subcommand tostring
, like this:
# Define the substring operation, efficientlyproc ::substring {string start length} { string range $string $start [expr {$start + $length - 1}]}# Plumb it into the languageset ops [namespace ensemble configure string -map]dict set ops substr ::substringnamespace ensemble configure string -map $ops# Now show off by repeating the challenge!set str "abcdefgh"set n 2set m 3puts [string substr $str $n $m]puts [string range $str $n end]puts [string range $str 0 end-1]puts [string substr $str [string first "d" $str] $m]puts [string substr $str [string first "de" $str] $m]
$$ MODE TUSCRIPTstring="abcdefgh", n=4,m=n+2substring=EXTRACT (string,#n,#m) PRINT substringsubstring=Extract (string,#n,0) PRINT substringsubstring=EXTRACT (string,0,-1) PRINT substringn=SEARCH (string,":d:"),m=n+2substring=EXTRACT (string,#n,#m) PRINT substringsubstring=EXTRACT (string,":{substring}:"|,0) PRINT substring
dedefghabcdefgdefgh
str="abc qrdef qrghi"n=6m=3expr "x$str" : "x.\{$n\}\(.\{1,$m\}\)"expr "x$str" : "x.\{$n\}\(.*\)"printf '%s\n' "${str%?}"expr "r${str#*r}" : "\(.\{1,$m\}\)"expr "qr${str#*qr}" : "\(.\{1,$m\}\)"
defdef qrghiabc qrdef qrghrdeqrd
This program usesexpr(1) to capture a substring.
str="abc qrdef qrghi"n=6m=3expr "x$str" : "x.\{$n\}\(.\{1,$m\}\)"expr "x$str" : "x.\{$n\}\(.*\)"expr "x$str" : "x\(.*\)."index() {i=0 s=$1until test "x$s" = x || expr "x$s" : "x$2" >/dev/null; doi=`expr $i + 1` s=`expr "x$s" : "x.\(.*\)"`doneecho $i}expr "x$str" : "x.\{`index "$str" r`\}\(.\{1,$m\}\)"expr "x$str" : "x.\{`index "$str" qr`\}\(.\{1,$m\}\)"
defdef qrghiabc qrdef qrghrdeqrd
Note that the last two constructs won't work with bash as only zsh supports nested string manipulation.
#!/bin/zshstring='abcdefghijk'echo ${string:2:3} # Display 3 chars starting 2 chars in ie: 'cde'echo ${string:2} # Starting 2 chars in, display to end of stringecho ${string:0:${#string}-1} # Whole string minus last characterecho ${string%?} # Shorter variant of the aboveecho ${${string/*c/c}:0:3} # Display 3 chars starting with 'c'echo ${${string/*cde/cde}:0:3} # Display 3 chars starting with 'cde'
This example shows how tocut(1) a substring from a string.
#!/bin/shstr=abcdefghijklmnopqrstuvwxyzn=12m=5printf %s "$str" | cut -c $n-`expr $n + $m - 1`printf %s "$str" | cut -c $n-printf '%s\n' "${str%?}"printf q%s "${str#*q}" | cut -c 1-$mprintf pq%s "${str#*pq}" | cut -c 1-$m
$ sh substring.sh lmnoplmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyqrstupqrst
string s = "Hello, world!";int n = 1;int m = 3;// start at n and go m lettersstring s_n_to_m = s[n:n+m];// start at n and go to endstring s_n_to_end = s[n:s.length];// start at beginning and show all but laststring s_notlast = s[0:s.length - 1];// start from known letter and then go m lettersint index_of_l = s.index_of("l");string s_froml_for_m = s[index_of_l:index_of_l + m];// start from known substring then go m lettersint index_of_lo = s.index_of("lo");string s_fromlo_for_m = s[index_of_lo:index_of_lo + m];
1) Substring function argument in V (Vlang) uses end position versus length in AHK.2) V (Vlang) arrays are 0 index based.
fn main() { str := "abcdefghijklmnopqrstuvwxyz" find_char := "q" find_string := "pq" n := 12 m := 5// starting from n characters in and of m length println(str.substr(n - 1, (n - 1) + m))// starting from n characters in, up to the end of the string println(str.substr(n - 1, str.len))// whole string minus last character println(str.substr(0, str.len - 1))// starting from a known character within the string and of m length // returns nothing if not found println(str.substr(str.index(find_char) or {return}, (str.index(find_char) or {return}) + m))// starting from a known character within the string and of m length // returns nothing if not found println(str.substr(str.index(find_string) or {return}, (str.index(find_string) or {return}) + m))}
lmnoplmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyqrstupqrst
s <- "abcdefgh"s.0=> "a"# starting from n characters in and of m length;def (substr s start len) (s start start+len)(substr s 3 2)=> "de"# starting from n characters in, up to the end of the string(s 3 nil)=> "defgh"# whole string minus last character;(s 3 -1)=> "defg"# starting from a known character within the string and of <tt>m</tt> length;# starting from a known substring within the string and of <tt>m</tt> length.let start (pos s pat) (s start start+m)
import "./str" for Str var s = "αβγδεζηθ"var n = 2var m = 3var kc = "δ" // known charactervar ks = "δε" // known string// for referenceSystem.print("Index of characters: 01234567")System.print("Complete string: %(s)")// starting from n characters in and of m lengthSystem.print("Start %(n), length %(m): %(Str.sub(s, n...n+m))")// starting from n characters in, up to the end of the stringSystem.print("Start %(n), to end: %(Str.sub(s, n..-1))")// whole string minus last characterSystem.print("All but last: %(Str.sub(s, 0..-2))")// starting from a known character within the string and of m lengthvar dx = s.indexOf(kc)if (dx >= 0) { System.print("Start '%(kc)', length %(m): %(Str.sub(s[dx..-1], 0...m))")}// starting from a known substring within the string and of m lengthvar sx = s.indexOf(ks)if (sx >= 0) { System.print("Start '%(ks)', length %(m): %(Str.sub(s[sx..-1], 0...m))")}
Index of character: 01234567Complete string: αβγδεζηθStart 2, length 3: γδεStart 2, to end: γδεζηθAll but last: αβγδεζηStart 'δ', length 3: δεζStart 'δε', length 3: δεζ
include xpllib; \provides StrLen and StrFindproc PMid(S, N, M); \Print string at Nth character M chars longchar S, N, M, I;[for I:= 1 to M do ChOut(0, S(N-2+I));CrLf(0);];char S;def N=2, M=3;[S:= "abcdefgh";PMid(S, N, M); \starting from N chars in and of M lengthPMid(S, N, StrLen(S)-N+1); \starting from N chars in, up to end of stringPMid(S, 1, StrLen(S)-1); \whole string minus last characterPMid(StrFind(S, "d" ), 1, M); \starting from known char and of M lengthPMid(StrFind(S, "cd"), 1, M); \starting from known substring and of M length]
bcdbcdefghabcdefgdefcde
str = "abcdefgh";n = 2;m = 3;// starting from n character in and of m lengthwrite, strpart(str, n:n+m-1);// starting from n character in, up to the end of the stringwrite, strpart(str, n:);// whole string minus last characterwrite, strpart(str, :-1);// starting from a known character within the string and of m lengthmatch = strfind("d", str);write, strpart(str, [match(1), match(1)+m]);// starting from a known substring within the string and of m lengthmatch = strfind("cd", str);write, strpart(str, [match(1), match(1)+m]);
8 bit ASCII
var str = "abcdefgh", n = 2, m = 3;str[n,m] //-->"cde"str[n,*] //-->"cdefgh"str[0,-1] //-->"abcdefg"str[str.find("d"),m] //-->"def"str[str.find("de"),m] //-->"def"