We build. You grow.

Get best community software here

Start a social network, a fan-site, an education project with oxwall - free opensource community software

chmod list and robot.txt[Answered] | Forum

Topic location: Forum home » Support » General Questions
Alix
Alix Nov 2 '14
Hello everyone,

Someone has a list of chmod, permissions for folders and files?
Someone has a best best robot.txt configuration?

Thank you all for your futur answer.
Best Regards
The Forum post is edited by ross Nov 6 '14
Alix
Alix Nov 2 '14
youpie, for the crawler, i just won't proctect  a sensitive files and things from seekers.
just a standart protect... Do you have any exemple, can i applicate?

thank for your screen image for chmod. But you forget, i'm a noob. I think i'm going to make a error if don't have a number rules like: 0777, 0755, 0644.

Thank for your helps. Realy i appreciate.

Danke

tammy harris
tammy harris Nov 2 '14
robots file will only stop the good bots  
ross Team
ross Nov 2 '14
Alix you need to set 777 permissions recursively to these folders:


ow_pluginfiles ow_userfiles, ow_static, ow_smarty/template_c

Alix
Alix Nov 3 '14
Coool!!!
@Chris_W
@ross
It's was realy was i need thank a lot for your help.

Euh!! @roos for chmod it was only that i need to put on 777?

ow_pluginfiles ow_userfiles, ow_static, ow_smarty/template_c


The rest i put in 644 or 755? Sorry :)

Tchuss!!
ross Team
ross Nov 3 '14
Oh, also ow_log folder should be 777 other folders should be 755 and all files 644
Alix
Alix Nov 6 '14
I understand now ross,
Thank you for your help. It's so nice!
Have a good day
The Forum post is edited by Alix Nov 6 '14
ross Team
ross Nov 6 '14
No problem. I'm going to mark this thread as Answered. 
Harry
Harry Dec 3 '14
If you want a good chance at blocking the bots, put this at the end of your htaccess file.  It does really well.



#RewriteBase /#RewriteCond %{HTTP_USER_AGENT} ADSARobot|ah-ha|almaden|aktuelles|Anarchie|amzn_assoc|ASPSeek|ASSORT|ATHENS|Atomz|attach|attache|autoemailspider|BackWeb|Bandit|BatchFTP|bdfetch|big.brother|BlackWidow|bmclient|Boston\ Project|BravoBrian\ SpiderEngine\ MarcoPolo|Bot\ mailto:craftbot@yahoo.com|Buddy|Bullseye|bumblebee|capture|CherryPicker|ChinaClaw|CICC|clipping|Collector|Copier|Crescent|Crescent\ Internet\ ToolPak|Custo|cyberalert|DA$|Deweb|diagem|Digger|Digimarc|DIIbot|DISCo|DISCo\ Pump|DISCoFinder|Download\ Demon|Download\ Wonder|Downloader|Drip|DSurf15a|DTS.Agent|EasyDL|eCatch|ecollector|efp@gmx\.net|Email\ Extractor|EirGrabber|email|EmailCollector|EmailSiphon|EmailWolf|Express\ WebPictures|ExtractorPro|EyeNetIE|FavOrg|fastlwspider|Favorites\ Sweeper|Fetch|FEZhead|FileHound|FlashGet\ WebWasher|FlickBot|fluffy|FrontPage|GalaxyBot|Generic|Getleft|GetRight|GetSmart|GetWeb!|GetWebPage|gigabaz|Girafabot|Go\!Zilla|Go!Zilla|Go-Ahead-Got-It|GornKer|gotit|Grabber|GrabNet|Grafula|Green\ Research|grub-client|Harvest|hhjhj@yahoo|hloader|HMView|HomePageSearch|http\ generic|HTTrack|httpdown|httrack|ia_archiver|IBM_Planetwide|Image\ Stripper|Image\ Sucker|imagefetch|IncyWincy|Indy*Library|Indy\ Library|informant|Ingelin|InterGET|Internet\ Ninja|InternetLinkagent|Internet\ Ninja|InternetSeer\.com|Iria|Irvine|JBH*agent|JetCar|JOC|JOC\ Web\ Spider|JustView|KWebGet|Lachesis|larbin|LeechFTP|LexiBot|lftp|libwww|likse|Link|Link*Sleuth|LINKS\ ARoMATIZED|LinkWalker|LWP|lwp-trivial|Mag-Net|Magnet|Mac\ Finder|Mag-Net|Mass\ Downloader|MCspider|Memo|Microsoft.URL|MIDown\ tool|Mirror|Missigua\ Locator|Mister\ PiX|MMMtoCrawl\/UrlDispatcherLLL|^Mozilla$|Mozilla.*Indy|Mozilla.*NEWT|Mozilla*MSIECrawler|MS\ FrontPage*|MSFrontPage|MSIECrawler|MSProxy|multithreaddb|nationaldirectory|Navroad|NearSite|NetAnts|NetCarta|NetMechanic|netprospector|NetResearchServer|NetSpider|Net\ Vampire|NetZIP|NetZip\ Downloader|NetZippy|NEWT|NICErsPRO|Ninja|NPBot|Octopus|Offline\ Explorer|Offline\ Navigator|OpaL|Openfind|OpenTextSiteCrawler|OrangeBot|PageGrabber|Papa\ Foto|PackRat|pavuk|pcBrowser|PersonaPilot|Ping|PingALink|Pockey|Proxy|psbot|PSurf|puf|Pump|PushSite|QRVA|RealDownload|Reaper|Recorder|ReGet|replacer|RepoMonkey|Robozilla|Rover|RPT-HTTPClient|Rsync|Scooter|SearchExpress|searchhippo|searchterms\.it|Second\ Street\ Research|Seeker|Shai|Siphon|sitecheck|sitecheck.internetseer.com|SiteSnagger|SlySearch|SmartDownload|snagger|Snake|SpaceBison|Spegla|SpiderBot|sproose|SqWorm|Stripper|Sucker|SuperBot|SuperHTTP|Surfbot|SurfWalker|Szukacz|tAkeOut|tarspider|Teleport\ Pro|Templeton|TrueRobot|TV33_Mercator|UIowaCrawler|UtilMind|URLSpiderPro|URL_Spider_Pro|Vacuum|vagabondo|vayala|visibilitygap|VoidEYE|vspider|Web\ Downloader|w3mir|Web\ Data\ Extractor|Web\ Image\ Collector|Web\ Sucker|Wweb|WebAuto|WebBandit|web\.by\.mail|Webclipping|webcollage|webcollector|WebCopier|webcraft@bea|webdevil|webdownloader|Webdup|WebEMailExtrac|WebFetch|WebGo\ IS|WebHook|Webinator|WebLeacher|WEBMASTERS|WebMiner|WebMirror|webmole|WebReaper|WebSauger|Website|Website\ eXtractor|Website\ Quester|WebSnake|Webster|WebStripper|websucker|webvac|webwalk|webweasel|WebWhacker|WebZIP|Wget|Whacker|whizbang|WhosTalking|Widow|WISEbot|WWWOFFLE|x-Tractor|^Xaldon\ WebSpider|WUMPUS|Xenu|XGET|Zeus.*Webster|Zeus [NC]

ross Team
Alix
Alix Dec 3 '14

thank Harry B, i thank now bad crawler must be carefull with your restriction text. :)

I'm going to test that.

Just a little question for all 777 permissions recursively. That mean also for files of folder must be set 777

Thank

ross Team
ross Dec 3 '14
Yes, Alix files for folders which I mentioned above will be set 777 as well when you do that recursively.