[rt-users] FullTextSearch the parts of word

Václav Ovsík vaclav.ovsik at i.cz
Fri Apr 11 04:51:07 EDT 2014


Hi Arkady,

On Thu, Apr 10, 2014 at 03:33:16PM +0400, Arkady Glazov wrote:
> Hi Václav,
> I will be in wait.
> 
> I look databases. All content saved as 'quoted-printable'. I can send
> example If it help you.
...

On Fri, Apr 11, 2014 at 09:38:26AM +0400, Arkady Glazov wrote:
> Hi Václav,
> 
> I send example of letter after this email.
> 
> In my RT Database body of email show as:
...


I can confirm this. I try your message and my own message with latin2
chars and both are qp encoded :(.

  -[ RECORD 1 ]---+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  id              | 313295
  transactionid   | 411992
  parent          | 313294
  messageid       | E1WYUC2-0005Yb-35 at skat-rt.seagroup.inc
  subject         | Quoted-printable test for RT
  filename        | 
  contenttype     | text/plain
  contentencoding | quoted-printable
  content         | =D0=9F=D1=80=D0=B8=D0=BC=D0=B5=D1=80 =D1=82=D0=B5=D0=BA=D1=81=D1=82=D0=B0 =
                  | =D1=81=D0=BE=D0=B4=D0=B5=D1=80=D0=B6=D0=B0=D1=89=D0=B5=D0=B3=D0=BE =D0=BA=
                  | =D0=B8=D1=80=D0=B8=D0=BB=D0=B8=D1=86=D1=83 =D0=B8 =D0=BB=D0=B0=D1=82=D0=B8=
                  | =D0=BD=D0=B8=D1=86=D1=83.
                  | This is example of cyrillic and latin text in th body.
                  | Encode as quoted-printable.
                  | 
                  | --
                  | Arkady Glazov
                  | 
  ...

  nis=# \x
  Expanded display is on.
  nis=# select * from attachments where transactionid =411999;
  -[ RECORD 1 ]---+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  id              | 313298
  transactionid   | 411999
  parent          | 0
  messageid       | 20140411083151.GF8681 at bobek.localdomain
  subject         | test latin2
  filename        | 
  contenttype     | text/plain
  contentencoding | quoted-printable
  content         | This is latin2 test:
                  | Diakritika v =C4=8Desk=C3=BDch znac=C3=ADch...
                  | =C5=BDlu=C5=A5ou=C4=8Dk=C3=BD k=C5=AF=C5=88 =C3=BAp=C4=9Bl =C4=8F=C3=A1bels=
                  | k=C3=A9 =C3=B3dy.
                  | --=20
                  | V=C3=A1clav Ovs=C3=ADk
                  | IIT-UNIX
                  | ICZ a.s.
                  | Pobo=C4=8Dka Plze=C5=88
                  | N=C3=A1m=C4=9Bst=C3=AD M=C3=ADru 10, 301 00 Plze=C5=88, CZ
                  | Tel. +420 222 275 511
                  | vaclav.ovsik at i.cz
                  | http://www.i.cz
                  | 
                  | 
  ...

Than this is maybe ready for bugreport. I will try to debug this a little...
I think the previous versions of RT did decoding MIME encodings as possible
into UTF-8 raw shape, so fulltext can work.

My current RT 3.8.16 has distribution of encoding:

nis=# select distinct contentencoding, count(contentencoding) from attachments group by contentencoding;
 contentencoding  | count  
------------------+--------
 none             | 283405
 quoted-printable |    547
 base64           |   1711
                  |      0
(4 rows)

Maybe this is a regression or some ugly feature of RT 4.2.x.

-- 
Zito



More information about the rt-users mailing list