The L SEP character

Post Reply
User avatar
rbytes
Posts: 1338
Joined: Sun May 31, 2015 12:11 am
My devices: iPhone 11 Pro Max
iPad Pro 11
MacBook
Dell Inspiron laptop
CHUWI Plus 10 convertible Windows/Android tablet
Location: Calgary, Canada
Flag: Canada
Contact:

The L SEP character

Post by rbytes »

L SEP is a mysterious character that I have recently encountered on the web. Apparently it is used in HTML as a line feed. It has its place in the ASCII table as character 8232.

I found that it also acts as a line feed in Smart Basic, with one major difference from the carriage return CHR$(13) or line feed CHR$(10). It does not cause a syntax error when L SEPs are included in a text string!

Here is an example of a perfectly legal string definition:

test2$="Today I am your chariot horse
Tomorrow I'm your albatross
Suspended by the finest thread
No one could ever see
And when there's breathing in your ear
You put your faith in all you hear
But just how deep those feelings go
I have no way to know
I'll never know" It contains 8 L SEPs!

Notice that on the Forum, these L SEP characters do not cause line feeds, and are invisible. But when viewed in Smart Basic's Editor, that same code looks like this, yet gives no syntax error.

test2$="Today I am your chariot horse

Tomorrow I'm your albatross

Suspended by the finest thread

No one could ever see

And when there's breathing in your ear

You put your faith in all you hear

But just how deep those feelings go

I have no way to know

I'll never know"

More weirdness: if you obtain the length of this test string using the SB LEN command, the result with be 8 higher than the count of visible characters. Fortunately SB string commands can identify, count and generally process the L SEP character.

I had problems when I tried sending strings that contained CHR$(8232) characters to a PC print server. The PC does NOT recognize LSEP characters as line feeds. It prints them instead as actual printable characters, such as shown here:
E585777D-385B-4F49-B7A0-C1256DB35362.jpeg
E585777D-385B-4F49-B7A0-C1256DB35362.jpeg (98.94 KiB) Viewed 1418 times
This character appears blank in an ASCII printout done in Smart Basic. That surprises me since iOS and Windows are compatible with the same fonts. Why don't we see the same L SEP font character in iOS that appears in Windows?

I had to come up with a method to identify ASCII 8232 and substitute standard CR and LF characters before I could print properly using a Windows print server. I will post that tomorrow.
The only thing that gets me down is gravity...

User avatar
GeorgeMcGinn
Posts: 435
Joined: Sat Sep 10, 2016 6:37 am
My devices: IPad Pro 10.5in
IMac
Linux i386
Windows 7 & 10
Location: Venice, FL
Flag: United States of America
Contact:

Re: The L SEP character

Post by GeorgeMcGinn »

The other thing that is weird about them is while your length is increased by eight, but in hex it's only three characters (E2 80 A8) See attached Image.

I thought I recognized this as a Unicode and did some research and found this Wikipedia page about the LSEP character. (https://en.m.wikipedia.org/wiki/Unicode ... rs#/search)

Also there is another page, Fileinfo (https://www.fileformat.info/info/unicod ... /index.htm).

The LSEP seems like some type of carryover, possibly when DOS first came out, but I'm not sure. In the second website it shows all the variations (in UTF) which includes 16- and 32-bit ASCII.


IMG_0533.JPG
IMG_0533.JPG (1.49 MiB) Viewed 1393 times
George McGinn
Computer Scientist/Cosmologist/Writer/Photographer
Member: IEEE, IEEE Computer Society
IEEE Sensors Council & IoT Technical Community
American Association for the Advancement of Science (AAAS)

Post Reply