SPACE IN url
By wktd | March 16, 2010
http://cartouche-epson.1000cartouches.com/imprimantes_EPSON_Stylus%20Color_670.html
or
http://cartouche-epson.1000cartouches.com/imprimantes_EPSON_Stylus Color_670.html
This is no probleme for google that index these pages without probleme but phpdig don't. Is there some thing to change to get it ?
Regards
JPS
In robot_functions.php add the following to the phpdigRewriteUrl function:
$eval = str_replace(" ","%20",$eval);
Also in robot_functions.php add the following to the phpdigUpdSpiderRow function:
$path = str_replace(" ","%20",$path);
$file = str_replace(" ","%20",$file);
try this$eval = str_replace(" ","",$eval);
Don't know if that will work, but it's worth a shot.
Also, and I have to ask this, how practical would it be for you to modify those URL's so there is no embedded space?
JPS
Regards :)
First, in phpdigExplore find:
while (eregi("(]*src[[:blank:]]*=href[[:blank:]]*=http-equiv=['"]refresh['"] *content=['"][0-9]+;url[[:blank:]]*=window[.]location[[:blank:]]*=window[.]open[[:blank:]]*[(])[[:blank:]]*[\\'"]?((([[a-z]{3,5}://)+(([.a-zA-Z0-9-])+(:[0-9]+)*))*([:%/?=&;\\,._a-zA-Z0-9\+-]*))(#[.a-zA-Z0-9-]*)?[\\'" ]?",$eval,$regs)) {
and replace with:
while (eregi("(]*src[[:blank:]]*=href[[:blank:]]*=http-equiv=['"]refresh['"] *content=['"][0-9]+;url[[:blank:]]*=window[.]location[[:blank:]]*=window[.]open[[:blank:]]*[(])[[:blank:]]*[\\'"]?((([[a-z]{3,5}://)+(([.a-zA-Z0-9-])+(:[0-9]+)*))*([:%/?=&;\\,._a-zA-Z0-9\+ ()-]*))(#[.a-zA-Z0-9-]*)?[\\'" ]?",$eval,$regs)) {
Second, in phpdigIndexFile find:
while (eregi("]*href[[:blank:]]*=[[:blank:]]*[\\'"]?(((http://)+(([.a-zA-Z0-9-])+(:[0-9]+)*))*([:%/?=&;\\,._a-zA-Z0-9-]*))[#\\'" ]?)",$line,$regs)) {
and replace with:
while (eregi("]*href[[:blank:]]*=[[:blank:]]*[\\'"]?(((http://)+(([.a-zA-Z0-9-])+(:[0-9]+)*))*([:%/?=&;\\,._a-zA-Z0-9\+ ()-]*))[#\\'" ]?)",$line,$regs)) {
Now try another reindex. What are the results?
Remember to remove any "word" wrapping in the above code.
Regards
JPS
Still not working
Thank's for your help
#If you have any other info about this subject , Please add it free.# |
Topics: enart.zjbr.com | edit