max files parameter question for Tika Server

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

max files parameter question for Tika Server

Olivier Tavard
Hello

I have a question about starting Tika Server 1.19 and the spawnChild mode (which seems very promising !). We successfully activated it and we can add options like setting the child memory with -JXmx4G parameter for example or set the -taskTimeoutMillis parameter. But it is impossible to set the maxFiles parameter. We try different settings as -maxFiles100 or -maxFiles 1000 or -maxFiles100L for example but there is still an error :

java.lang.NumberFormatException: null

                  at java.lang.Long.parseLong(Long.java:552)

                  at java.lang.Long.parseLong(Long.java:631)

                  at org.apache.tika.server.TikaServerCli.executeLegacy(TikaServerCli.java:261)

                  at org.apache.tika.server.TikaServerCli.execute(TikaServerCli.java:148)

                  at org.apache.tika.server.TikaServerCli.main(TikaServerCli.java:117)

java.io.EOFException

                  at java.io.DataInputStream.readByte(DataInputStream.java:267)

                  at org.apache.tika.server.TikaServerWatchDog$ChildProcess.<init>(TikaServerWatchDog.java:153)

                  at org.apache.tika.server.TikaServerWatchDog$ChildProcess.<init>(TikaServerWatchDog.java:139)

                  at org.apache.tika.server.TikaServerWatchDog.execute(TikaServerWatchDog.java:87)

                  at org.apache.tika.server.TikaServerCli.execute(TikaServerCli.java:136)

                  at org.apache.tika.server.TikaServerCli.main(TikaServerCli.java:117)

 

Am I wrong with the configuration ? Could you tell me how to pass the maxFiles parameter correctly to the CLI please ?

Thanks,
Best regards,

Olivier

 

Reply | Threaded
Open this post in threaded view
|

Re: max files parameter question for Tika Server

Tim Allison
Probably a bug. I'll take a look.  We'll cut 1.19.1 in a few days,
likely, so that fix should make it in.  Let me know what else you
find, and thank you for reporting this!
On Wed, Oct 3, 2018 at 11:03 AM Olivier Tavard
<[hidden email]> wrote:

>
> Hello
>
> I have a question about starting Tika Server 1.19 and the spawnChild mode (which seems very promising !). We successfully activated it and we can add options like setting the child memory with -JXmx4G parameter for example or set the -taskTimeoutMillis parameter. But it is impossible to set the maxFiles parameter. We try different settings as -maxFiles100 or -maxFiles 1000 or -maxFiles100L for example but there is still an error :
>
> java.lang.NumberFormatException: null
>
>                   at java.lang.Long.parseLong(Long.java:552)
>
>                   at java.lang.Long.parseLong(Long.java:631)
>
>                   at org.apache.tika.server.TikaServerCli.executeLegacy(TikaServerCli.java:261)
>
>                   at org.apache.tika.server.TikaServerCli.execute(TikaServerCli.java:148)
>
>                   at org.apache.tika.server.TikaServerCli.main(TikaServerCli.java:117)
>
> java.io.EOFException
>
>                   at java.io.DataInputStream.readByte(DataInputStream.java:267)
>
>                   at org.apache.tika.server.TikaServerWatchDog$ChildProcess.<init>(TikaServerWatchDog.java:153)
>
>                   at org.apache.tika.server.TikaServerWatchDog$ChildProcess.<init>(TikaServerWatchDog.java:139)
>
>                   at org.apache.tika.server.TikaServerWatchDog.execute(TikaServerWatchDog.java:87)
>
>                   at org.apache.tika.server.TikaServerCli.execute(TikaServerCli.java:136)
>
>                   at org.apache.tika.server.TikaServerCli.main(TikaServerCli.java:117)
>
>
>
> Am I wrong with the configuration ? Could you tell me how to pass the maxFiles parameter correctly to the CLI please ?
>
> Thanks,
> Best regards,
>
> Olivier
>
>
Reply | Threaded
Open this post in threaded view
|

Re: max files parameter question for Tika Server

Tim Allison
Bug.  https://issues.apache.org/jira/browse/TIKA-2748. Fixed.  Thank you!
On Wed, Oct 3, 2018 at 11:23 AM Tim Allison <[hidden email]> wrote:

>
> Probably a bug. I'll take a look.  We'll cut 1.19.1 in a few days,
> likely, so that fix should make it in.  Let me know what else you
> find, and thank you for reporting this!
> On Wed, Oct 3, 2018 at 11:03 AM Olivier Tavard
> <[hidden email]> wrote:
> >
> > Hello
> >
> > I have a question about starting Tika Server 1.19 and the spawnChild mode (which seems very promising !). We successfully activated it and we can add options like setting the child memory with -JXmx4G parameter for example or set the -taskTimeoutMillis parameter. But it is impossible to set the maxFiles parameter. We try different settings as -maxFiles100 or -maxFiles 1000 or -maxFiles100L for example but there is still an error :
> >
> > java.lang.NumberFormatException: null
> >
> >                   at java.lang.Long.parseLong(Long.java:552)
> >
> >                   at java.lang.Long.parseLong(Long.java:631)
> >
> >                   at org.apache.tika.server.TikaServerCli.executeLegacy(TikaServerCli.java:261)
> >
> >                   at org.apache.tika.server.TikaServerCli.execute(TikaServerCli.java:148)
> >
> >                   at org.apache.tika.server.TikaServerCli.main(TikaServerCli.java:117)
> >
> > java.io.EOFException
> >
> >                   at java.io.DataInputStream.readByte(DataInputStream.java:267)
> >
> >                   at org.apache.tika.server.TikaServerWatchDog$ChildProcess.<init>(TikaServerWatchDog.java:153)
> >
> >                   at org.apache.tika.server.TikaServerWatchDog$ChildProcess.<init>(TikaServerWatchDog.java:139)
> >
> >                   at org.apache.tika.server.TikaServerWatchDog.execute(TikaServerWatchDog.java:87)
> >
> >                   at org.apache.tika.server.TikaServerCli.execute(TikaServerCli.java:136)
> >
> >                   at org.apache.tika.server.TikaServerCli.main(TikaServerCli.java:117)
> >
> >
> >
> > Am I wrong with the configuration ? Could you tell me how to pass the maxFiles parameter correctly to the CLI please ?
> >
> > Thanks,
> > Best regards,
> >
> > Olivier
> >
> >
Reply | Threaded
Open this post in threaded view
|

Re: max files parameter question for Tika Server

Olivier Tavard
Hello,

Thanks for the quick fix !
I will do more tests tomorrow with Tika server. I will let you know if I find something else.  

Best regards,
Olivier

Le 3 oct. 2018 à 21:29, Tim Allison <[hidden email]> a écrit :

Bug.  https://issues.apache.org/jira/browse/TIKA-2748. Fixed.  Thank you!
On Wed, Oct 3, 2018 at 11:23 AM Tim Allison <[hidden email]> wrote:

Probably a bug. I'll take a look.  We'll cut 1.19.1 in a few days,
likely, so that fix should make it in.  Let me know what else you
find, and thank you for reporting this!
On Wed, Oct 3, 2018 at 11:03 AM Olivier Tavard
<[hidden email]> wrote:

Hello

I have a question about starting Tika Server 1.19 and the spawnChild mode (which seems very promising !). We successfully activated it and we can add options like setting the child memory with -JXmx4G parameter for example or set the -taskTimeoutMillis parameter. But it is impossible to set the maxFiles parameter. We try different settings as -maxFiles100 or -maxFiles 1000 or -maxFiles100L for example but there is still an error :

java.lang.NumberFormatException: null

                 at java.lang.Long.parseLong(Long.java:552)

                 at java.lang.Long.parseLong(Long.java:631)

                 at org.apache.tika.server.TikaServerCli.executeLegacy(TikaServerCli.java:261)

                 at org.apache.tika.server.TikaServerCli.execute(TikaServerCli.java:148)

                 at org.apache.tika.server.TikaServerCli.main(TikaServerCli.java:117)

java.io.EOFException

                 at java.io.DataInputStream.readByte(DataInputStream.java:267)

                 at org.apache.tika.server.TikaServerWatchDog$ChildProcess.<init>(TikaServerWatchDog.java:153)

                 at org.apache.tika.server.TikaServerWatchDog$ChildProcess.<init>(TikaServerWatchDog.java:139)

                 at org.apache.tika.server.TikaServerWatchDog.execute(TikaServerWatchDog.java:87)

                 at org.apache.tika.server.TikaServerCli.execute(TikaServerCli.java:136)

                 at org.apache.tika.server.TikaServerCli.main(TikaServerCli.java:117)



Am I wrong with the configuration ? Could you tell me how to pass the maxFiles parameter correctly to the CLI please ?

Thanks,
Best regards,

Olivier